Ing. Jindřich Žďánský, Ph.D.

Line	Position	Department	Office number
+420 48535 6857	Employee	Institute of Information Technology and Electronics	A 02021

Publications

L. Matějů, J. Nouza, P. Červa, J. Žďánský, Combining multilingual resources to enhance end-to-end speech recognition systems for Scandinavian languages, SPEECH COMMUNICATION, Elsevier B.V., 13 pages, ISSN: 0167-6393, n. MAY, [Online], 2025
M. Poláček, P. Červa, J. Žďánský, Lightweight online punctuation and capitalization restoration for streaming ASR systems, SPEECH COMMUNICATION, Elsevier B.V., 4 pages, ISSN: 0167-6393, n. SEP, [Online], 2025
F. Kynych, P. Červa, J. Žďánský, T. Svendsen, G. Salvi, A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams, Eurasip Journal on Audio, Speech, and Music Processing, Springer, 16 pages, ISSN: 1687-4722, n. 1, [Online], 2024
P. Červa, J. Nouza, J. Žďánský, L. Matějů, Softwarové moduly pro automatický přepis a zpracování mluvené dánštiny, 2024
P. Červa, J. Nouza, J. Žďánský, L. Matějů, Softwarové moduly pro automatický přepis a zpracování mluvené norštiny, 2024
P. Červa, J. Nouza, J. Žďánský, L. Matějů, Softwarové moduly pro automatický přepis a zpracování mluvené švédštiny, 2024
F. Kynych, P. Červa, J. Žďánský, Systém pro online diarizaci mluvčích v audiovizuálních datových proudech, 2024
L. Matějů, J. Nouza, P. Červa, J. Žďánský, F. Kynych, Combining Multilingual Resources and Models to Develop State-of-the-Art E2E ASR for Swedish, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Dublin, ISCA, p. 3252 - 3256, 5 pages, ISSN: 2308-457X, [Online], 2023
J. Nouza, L. Matějů, P. Červa, J. Žďánský, Developing State-of-the-Art End-to-End ASR for Norwegian, Lecture Notes in Computer Science - including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, Springer Science and Business, ISBN: 978-303140497-9, p. 200-213, 14 pages, ISSN: 03029743, [Online], 2023
M. Poláček, P. Červa, J. Žďánský, L. Weingartová, Online Punctuation Restoration using ELECTRA Model for streaming ASR Systems, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Irsko, ISCA, p. 446-450, 5 pages, ISSN: 2308-457X, [Online], 2023
F. Kynych, J. Žďánský, P. Červa, L. Matějů, Online Speaker Diarization Using Optimized SE-ResNet Architecture, Lecture Notes in Computer Science, Německo, Springer, ISBN: 978-303140497-9, p. 176-187, 12 pages, ISSN: 03029743, [Online], 2023
J. Nouza, P. Červa, J. Žďánský, Lexicon-based vs. Lexicon-free ASR for Norwegian Parliament Speech Transcription, Lecture Notes in Computer Science, SPRINGER-VERLAG BERLIN, ISBN: 978-303116269-5, p. 401-409, 9 pages, ISSN: 0302-9743, [Online], 2022
L. Matějů, F. Kynych, P. Červa, J. Málek, J. Žďánský, Overlapped Speech Detection in Broadcast Streams Using X-vectors, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Jižní Korea, ISCA, p. 4606 - 4610, 4 pages, ISSN: 2308-457X, [Online], 2022
J. Málek, J. Janský, Z. Koldovský, T. Kounovský, J. Čmejla, J. Žďánský, Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification, IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, IEEE, p. 2295-2309, 15 pages, ISSN: 2329-9290, n. 30, [Online], 2022
J. Málek, J. Janský, T. Kounovský, Z. Koldovský, J. Žďánský, Blind extraction of moving audio source in a challenging environment supported by speaker identification via X-vectors, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, USA, IEEE, 1, p. 226-230, 5 pages, ISSN: 1520-6149, [Online], 2021
P. Červa, L. Matějů, J. Žďánský, R. Šafařík, J. Nouza, Identification of related languages from spoken data: Moving from off-line to on-line scenario, Computer Speech and Language, Elsevier, 19 pages, ISSN: 0885-2308, n. JUL, [Online], 2021
P. Červa, L. Matějů, F. Kynych, J. Žďánský, J. Nouza, Identification of Scandinavian Languages from Speech Using Bottleneck Features and X-vectors, Lecture Notes in Computer Science, Switzerland, Springer Nature Switzerland AG, ISBN: 978-303083526-2, p. 371-381, 11 pages, ISSN: 0302-9743, [Online], 2021
L. Matějů, F. Kynych, P. Červa, J. Žďánský, J. Málek, Using X-vectors for Speech Activity Detection in Broadcast Streams, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, ISBN: 978-171383690-2, p. 4161 - 4165, 5 pages, ISSN: 2308-457X, [Online], 2021
J. Janský, J. Málek, J. Čmejla, T. Kounovský, Z. Koldovský, J. Žďánský, Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Barcelona, IEEE, 1, ISBN: 978-1-5090-6631-5, p. 676-680, 5 pages, ISSN: 1520-6149, [Online], 2020
J. Chaloupka, K. Paleček, P. Červa, J. Žďánský, Optical Character Recognition for Audio-Visual Broadcast Transcription System, 11th IEEE International Conference on Cognitive Infocommunications, CogInfoCom 2020 - Proceedings, Finsko, IEEE, 1, ISBN: 978-172818213-1, p. 229-232, 4 pages, [Online], 2020
J. Nouza, P. Červa, J. Žďánský, Very Fast Keyword Spotting System with Real Time Factor below 0.01, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 426-436, 11 pages, ISSN: 0302-9743, [Online], 2020
J. Málek, J. Žďánský, Voice-activity and overlapped speech detection using x-vectors, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 23rd International Conference on Text, Speech, and Dialogue, TSD 2020, Switzerland, Springer Nature Switzerland, 1, ISBN: 978-303058322-4, p. 366-376, 11 pages, ISSN: 0302-9743, [Online], 2020
L. Matějů, P. Červa, J. Žďánský, An Approach to Online Speaker Change Point Detection Using DNNs and WFSTs, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Austria, ISCA, 1, p. 649-653, 5 pages, ISSN: 2308-457X, 2019
J. Málek, J. Žďánský, On Practical Aspects of Multi-condition Training Based on Augmentation for Reverberation-/Noise-Robust Speech Recognition, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Switzerland, Springer Nature Switzerland AG., 1, ISBN: 978-303027946-2, p. 251-263, 13 pages, ISSN: 0302-9743, 2019
J. Málek, J. Žďánský, P. Červa, Robust Recognition of Conversational Telephone Speech via Multi-Condition Training and Data Augmentation, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) - 21st International Conference on Text, Speech, and Dialogue, TSD 2018, Springer Verlag, ISBN: 978-303000793-5, p. 324-333, 10 pages, ISSN: 0302-9743, 2018
J. Málek, J. Žďánský, P. Červa, Robust Recognition of Speech with Background Music in Acoustically Under-Resourced Scenarios, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Kanada, IEEE, 1, ISBN: 978-153864658-8, p. 5624-5628, 5 pages, ISSN: 1520-6149, 2018
L. Matějů, P. Červa, J. Žďánský, R. Šafařík, Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Indie, ISCA, 1, p. 1803-1807, 5 pages, ISSN: 2308-457X, 2018
L. Matějů, P. Červa, J. Žďánský, Investigation into the Use of WFSTs and DNNs for Speech Activity Detection in Broadcast Data Transcription, Communications in Computer and Information Science, Spolková republika Německo, Springer Verlag, ISBN: 978-331967875-7, p. 341-358, 18 pages, ISSN: 1865-0929, n. July, 2017
J. Nouza, P. Červa, J. Žďánský, S. Čihák, K. Bureš, Multilingvální platforma pro monitoring a analýzu multimédií, 2017
J. Málek, J. Žďánský, P. Červa, Robust Automatic Recognition of Speech with Background Music, 16 June 2017, Article number 7953150, Pages 5210-52142017 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017; Hilton New Orleans RiversideNew Orleans; United States; 5 March 2017 through 9 March 2017; Category numberCFP, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5210-5214, 5 pages, ISSN: 1520-6149, 2017
L. Matějů, P. Červa, J. Žďánský, J. Málek, Speech Activity Detection in Online Broadcast Transcription Using Deep Neural Networks and Weighted Finite State Transducers, 2017 IEEE IICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedingsnternational Conference on Acoustics, Speech, and Signal Processing, ICASSP 2017, USA, Institute of Electrical and Electronics Engineers Inc., ISBN: 978-1-5090-4117-6, p. 5460-5464, 5 pages, ISSN: 1520-6149, 2017
L. Matějů, P. Červa, J. Žďánský, Study on the use of deep neural networks for speech activity detection in broadcast recordings, ICETE 2016 - Proceedings of the 13th International Joint Conference on e-Business and Telecommunications, Lisabon, Portugalsko, SciTePress, ISBN: 978-989-758-196-0, p. 45-51, 7 pages, 2016
J. Málek, J. Silovský, P. Červa, Z. Koldovský, J. Nouza, J. Žďánský, Compensation of Nonlinear Distortions in Speech for Automatic Recognition, 38th International Conference on Telecommunications and Signal Processing, TSP 2015, Praha, Česká Republika, Institute of Electrical and Electronics Engineers Inc., 1, ISBN: 978-1-4799-8498-5, p. 419-423, 5 pages, 2015
L. Matějů, P. Červa, J. Žďánský, Investigation into the use of deep neural networks for LVCSR of Czech, 2015 IEEE International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics, Česká Republika, IEEE, 1, ISBN: 978-1-4799-6972-2, p. 38-41, 4 pages, 2015
J. Nouza, P. Červa, J. Žďánský, K. Blavka, M. Boháč, J. Silovský, J. Chaloupka, M. Kuchařová, J. Málek, Unikátní softwarová technologická platforma pro přepisy archivů historických i současných pořadů ČRo a jejich zpřístupnění pomocí webu, 2014