kontakt.tul.cz

Ing. Jindřich Žďánský, Ph.D.

Ing. Jindřich Žďánský, Ph.D. QR VCARD
LinePositionDepartmentOffice number
+420 48535 3066EmployeeInstitute of Information Technology and ElectronicsA02017

Publications

  1. J. Nouza, P. Červa, J. Žďánský, Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription, TSD 2022, 8 pages, 2022
  2. L. Matějů, F. Kynych, P. Červa, J. Málek, J. Žďánský, Overlapped Speech Detection in Broadcast Streams Using X-vectors, INTERSPEECH 2022, 4 pages, 2022
  3. J. Málek, J. Janský, Z. Koldovský, T. Kounovský, J. Čmejla, J. Žďánský, Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, IEEE, p. 2295-2309, 15 pages, ISSN: 2329-9290, n. 30, [Online], 2022
  4. J. Málek, J. Janský, T. Kounovský, Z. Koldovský, J. Žďánský, Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, USA, IEEE, 1, p. 226-230, 5 pages, ISSN: 1520-6149, [Online], 2021
  5. P. Červa, L. Matějů, J. Žďánský, R. Šafařík, J. Nouza, Identification of related languages from spoken data: Moving from off-line to on-line scenario, Computer Speech and Language, Elsevier, 19 pages, ISSN: 0885-2308, [Online], 2021
  6. P. Červa, L. Matějů, F. Kynych, J. Žďánský, J. Nouza, Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-vectors, Lecture Notes in Computer Science, Switzerland, Springer Nature Switzerland AG, ISBN: 978-303083526-2, p. 371-381, 11 pages, ISSN: 0302-9743, [Online], 2021
  7. L. Matějů, F. Kynych, P. Červa, J. Žďánský, J. Málek, Using X-vectors for Speech Activity Detection in Broadcast Streams, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, ISBN: 978-171383690-2, p. 4161 - 4165, 5 pages, ISSN: 2308-457X, [Online], 2021
  8. J. Janský, J. Málek, J. Čmejla, T. Kounovský, Z. Koldovský, J. Žďánský, Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Barcelona, IEEE, 1, ISBN: 978-1-5090-6631-5, p. 676-680, 5 pages, ISSN: 1520-6149, [Online], 2020
  9. J. Chaloupka, K. Paleček, P. Červa, J. Žďánský, Optical Character Recognition for Audio-Visual Broadcast Transcription System, 11th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2020 - Proceedings, Finsko, IEEE, 1, ISBN: 978-172818213-1, p. 229-232, 4 pages, [Online], 2020
  10. J. Nouza, P. Červa, J. Žďánský, Very Fast Keyword Spotting System with Real Time Factor below 0.01, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 426-436, 11 pages, ISSN: 0302-9743, [Online], 2020
  11. J. Málek, J. Žďánský, Voice-activity and overlapped speech detection using x-vectors, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 366-376, 11 pages, ISSN: 0302-9743, [Online], 2020
  12. L. Matějů, P. Červa, J. Žďánský, An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Austria, ISCA, 1, p. 649-653, 5 pages, ISSN: 2308-457X, 2019
  13. J. Málek, J. Žďánský, On Practical Aspects of Multi-condition Training Based on Augmentation for Reverberation-/Noise-Robust Speech Recognition, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Switzerland, Springer Nature Switzerland AG., 1, ISBN: 978-303027946-2, p. 251-263, 13 pages, ISSN: 0302-9743, 2019
  14. J. Málek, J. Žďánský, P. Červa, Robust Recognition of Conversational Telephone Speech via Multi-Condition Training and Data Augmentation, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 21st International Conference on Text, Speech, and Dialogue, TSD 2018, Springer Verlag, ISBN: 978-303000793-5, p. 324-333, 10 pages, ISSN: 0302-9743, 2018
  15. J. Málek, J. Žďánský, P. Červa, Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Kanada, IEEE, 1, ISBN: 978-153864658-8, p. 5624-5628, 5 pages, ISSN: 1520-6149, 2018
  16. L. Matějů, P. Červa, J. Žďánský, R. Šafařík, Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Indie, ISCA, 1, p. 1803-1807, 5 pages, ISSN: 2308-457X, 2018
  17. L. Matějů, P. Červa, J. Žďánský, Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription, Communications in Computer and Information Science, Spolková republika Německo, Springer Verlag, ISBN: 978-331967875-7, p. 341-358, 18 pages, ISSN: 1865-0929, 2017
  18. J. Nouza, P. Červa, J. Žďánský, S. Čihák, K. Bureš, Multilingvální platforma pro monitoring a analýzu multimédií, 2017
  19. J. Málek, J. Žďánský, P. Červa, Robust Automatic Recognition of Speech with Background Music, 16 June 2017, Article number 7953150, Pages 5210-52142017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017; Hilton New Orleans RiversideNew Orleans; United States; 5 March 2017 through 9 March 2017; Category numberCFP, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5210-5214, 5 pages, ISSN: 1520-6149, 2017
  20. L. Matějů, P. Červa, J. Žďánský, J. Málek, Speech Activity Detection in Online Broadcast Transcription Using Deep Neural Networks and Weighted Finite State Transducers, 2017 IEEE IICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsnternational Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5460-5464, 5 pages, ISSN: 1520-6149, 2017
  21. L. Matějů, P. Červa, J. Žďánský, Study on the use of deep neural networks for speech activity detection in broadcast recordings, ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications, Lisabon, Portugalsko, SciTePress, ISBN: 978-989-758-196-0, p. 45-51, 7 pages, 2016
  22. J. Málek, J. Silovský, P. Červa, Z. Koldovský, J. Nouza, J. Žďánský, Compensation of Nonlinear Distortions in Speech for Automatic Recognition, 38th International Conference on Telecommunications and Signal Processing, TSP 2015, Praha, Česká Republika, Institute of Electrical and Electronics Engineers Inc., 1, ISBN: 978-1-4799-8498-5, p. 419-423, 5 pages, 2015
  23. L. Matějů, P. Červa, J. Žďánský, Investigation into the use of deep neural networks for LVCSR of Czech, 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Česká Republika, IEEE, 1, ISBN: 978-1-4799-6972-2, p. 38-41, 4 pages, 2015
  24. J. Nouza, P. Červa, J. Žďánský, K. Blavka, M. Boháč, J. Silovský, J. Chaloupka, M. Kuchařová, J. Málek, Unikátní softwarová technologická platforma pro přepisy archivů historických i současných pořadů ČRo a jejich zpřístupnění pomocí webu, 2014