Ing. Jindřich Žďánský, Ph.D.

Ing. Jindřich Žďánský, Ph.D. QR VCARD
LinePositionDepartmentOffice number
+420 48535 3066EmployeeInstitute of Information Technology and ElectronicsA 02017

Publications

  1. L. Matějů, J. Nouza, P. Červa, J. Žďánský, F. Kynych, Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Dublin, ISCA, p. 3252 - 3256, 5 pages, ISSN: 2308-457X, [Online], 2023
  2. J. Nouza, L. Matějů, P. Červa, J. Žďánský, Developing State-of-the-Art End-to-End ASR for Norwegian, Lecture Notes in Computer Science - including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, Springer Science and Business, ISBN: 978-303140497-9, p. 200-213, 14 pages, ISSN: 03029743, [Online], 2023
  3. M. Poláček, P. Červa, J. Žďánský, L. Weingartová, Online Punctuation Restoration using ELECTRA Model for streaming ASR Systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Irsko, ISCA, p. 446-450, 5 pages, ISSN: 2308-457X, [Online], 2023
  4. F. Kynych, J. Žďánský, P. Červa, L. Matějů, Online Speaker Diarization Using Optimized SE-ResNet Architecture, Lecture Notes in Computer Science, Německo, Springer, ISBN: 978-303140497-9, p. 176-187, 12 pages, ISSN: 03029743, [Online], 2023
  5. J. Nouza, P. Červa, J. Žďánský, Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription, Lecture Notes in Computer Science, SPRINGER-VERLAG BERLIN, ISBN: 978-303116269-5, p. 401-409, 9 pages, ISSN: 0302-9743, [Online], 2022
  6. L. Matějů, F. Kynych, P. Červa, J. Málek, J. Žďánský, Overlapped Speech Detection in Broadcast Streams Using X-vectors, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Jižní Korea, ISCA, p. 4606 - 4610, 4 pages, ISSN: 2308-457X, [Online], 2022
  7. J. Málek, J. Janský, Z. Koldovský, T. Kounovský, J. Čmejla, J. Žďánský, Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, IEEE, p. 2295-2309, 15 pages, ISSN: 2329-9290, n. 30, [Online], 2022
  8. J. Málek, J. Janský, T. Kounovský, Z. Koldovský, J. Žďánský, Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, USA, IEEE, 1, p. 226-230, 5 pages, ISSN: 1520-6149, [Online], 2021
  9. P. Červa, L. Matějů, J. Žďánský, R. Šafařík, J. Nouza, Identification of related languages from spoken data: Moving from off-line to on-line scenario, Computer Speech and Language, Elsevier, 19 pages, ISSN: 0885-2308, [Online], 2021
  10. P. Červa, L. Matějů, F. Kynych, J. Žďánský, J. Nouza, Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-vectors, Lecture Notes in Computer Science, Switzerland, Springer Nature Switzerland AG, ISBN: 978-303083526-2, p. 371-381, 11 pages, ISSN: 0302-9743, [Online], 2021
  11. L. Matějů, F. Kynych, P. Červa, J. Žďánský, J. Málek, Using X-vectors for Speech Activity Detection in Broadcast Streams, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, ISBN: 978-171383690-2, p. 4161 - 4165, 5 pages, ISSN: 2308-457X, [Online], 2021
  12. J. Janský, J. Málek, J. Čmejla, T. Kounovský, Z. Koldovský, J. Žďánský, Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Barcelona, IEEE, 1, ISBN: 978-1-5090-6631-5, p. 676-680, 5 pages, ISSN: 1520-6149, [Online], 2020
  13. J. Chaloupka, K. Paleček, P. Červa, J. Žďánský, Optical Character Recognition for Audio-Visual Broadcast Transcription System, 11th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2020 - Proceedings, Finsko, IEEE, 1, ISBN: 978-172818213-1, p. 229-232, 4 pages, [Online], 2020
  14. J. Nouza, P. Červa, J. Žďánský, Very Fast Keyword Spotting System with Real Time Factor below 0.01, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 426-436, 11 pages, ISSN: 0302-9743, [Online], 2020
  15. J. Málek, J. Žďánský, Voice-activity and overlapped speech detection using x-vectors, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 366-376, 11 pages, ISSN: 0302-9743, [Online], 2020
  16. L. Matějů, P. Červa, J. Žďánský, An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Austria, ISCA, 1, p. 649-653, 5 pages, ISSN: 2308-457X, 2019
  17. J. Málek, J. Žďánský, On Practical Aspects of Multi-condition Training Based on Augmentation for Reverberation-/Noise-Robust Speech Recognition, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Switzerland, Springer Nature Switzerland AG., 1, ISBN: 978-303027946-2, p. 251-263, 13 pages, ISSN: 0302-9743, 2019
  18. J. Málek, J. Žďánský, P. Červa, Robust Recognition of Conversational Telephone Speech via Multi-Condition Training and Data Augmentation, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 21st International Conference on Text, Speech, and Dialogue, TSD 2018, Springer Verlag, ISBN: 978-303000793-5, p. 324-333, 10 pages, ISSN: 0302-9743, 2018
  19. J. Málek, J. Žďánský, P. Červa, Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Kanada, IEEE, 1, ISBN: 978-153864658-8, p. 5624-5628, 5 pages, ISSN: 1520-6149, 2018
  20. L. Matějů, P. Červa, J. Žďánský, R. Šafařík, Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Indie, ISCA, 1, p. 1803-1807, 5 pages, ISSN: 2308-457X, 2018
  21. L. Matějů, P. Červa, J. Žďánský, Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription, Communications in Computer and Information Science, Spolková republika Německo, Springer Verlag, ISBN: 978-331967875-7, p. 341-358, 18 pages, ISSN: 1865-0929, 2017
  22. J. Nouza, P. Červa, J. Žďánský, S. Čihák, K. Bureš, Multilingvální platforma pro monitoring a analýzu multimédií, 2017
  23. J. Málek, J. Žďánský, P. Červa, Robust Automatic Recognition of Speech with Background Music, 16 June 2017, Article number 7953150, Pages 5210-52142017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017; Hilton New Orleans RiversideNew Orleans; United States; 5 March 2017 through 9 March 2017; Category numberCFP, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5210-5214, 5 pages, ISSN: 1520-6149, 2017
  24. L. Matějů, P. Červa, J. Žďánský, J. Málek, Speech Activity Detection in Online Broadcast Transcription Using Deep Neural Networks and Weighted Finite State Transducers, 2017 IEEE IICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsnternational Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5460-5464, 5 pages, ISSN: 1520-6149, 2017
  25. L. Matějů, P. Červa, J. Žďánský, Study on the use of deep neural networks for speech activity detection in broadcast recordings, ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications, Lisabon, Portugalsko, SciTePress, ISBN: 978-989-758-196-0, p. 45-51, 7 pages, 2016
  26. J. Málek, J. Silovský, P. Červa, Z. Koldovský, J. Nouza, J. Žďánský, Compensation of Nonlinear Distortions in Speech for Automatic Recognition, 38th International Conference on Telecommunications and Signal Processing, TSP 2015, Praha, Česká Republika, Institute of Electrical and Electronics Engineers Inc., 1, ISBN: 978-1-4799-8498-5, p. 419-423, 5 pages, 2015
  27. L. Matějů, P. Červa, J. Žďánský, Investigation into the use of deep neural networks for LVCSR of Czech, 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Česká Republika, IEEE, 1, ISBN: 978-1-4799-6972-2, p. 38-41, 4 pages, 2015
  28. J. Nouza, P. Červa, J. Žďánský, K. Blavka, M. Boháč, J. Silovský, J. Chaloupka, M. Kuchařová, J. Málek, Unikátní softwarová technologická platforma pro přepisy archivů historických i současných pořadů ČRo a jejich zpřístupnění pomocí webu, 2014