Yiwen Shao

Cited by

	All	Since 2019
Citations	280	279
h-index	7	7
i10-index	7	7

20182019202020212022202320241 5 29 82 73 70 20

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Sanjeev KhudanpurThe Johns Hopkins UniversityVerified email at jhu.edu
Shinji WatanabeCarnegie Mellon UniversityVerified email at cmu.edu
Daniel PoveyChief Speech Scientist, Xiaomi Corp.Verified email at xiaomi.com
Yiming WangMicrosoftVerified email at microsoft.com
Sonal JoshiJohns Hopkins UniversityVerified email at tcs.com
Jesús VillalbaJohns Hopkins UniversityVerified email at jhu.edu
Zili HuangJohns Hopkins UniversityVerified email at jhu.edu
Piotr ŻelaskoPrincipal Research Scientist @ NvidiaVerified email at nvidia.com
Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDVerified email at capitalone.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Qiguang LinAgileSpeechVerified email at agilespeech.com

Yiwen Shao

Johns Hopkins University

Verified email at jhu.edu

speech recognition machine learning deep learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Espresso: A fast end-to-end neural speech recognition toolkit Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	81	2019
Speaker diarization with region proposal network Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	68	2020
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR Y Shao, Y Wang, D Povey, S Khudanpur Proc. Interspeech 2020, 561-565, 2020	45	2020
Adversarial attacks and defenses for speech recognition systems P Żelasko, S Joshi, Y Shao, J Villalba, J Trmal, N Dehak, S Khudanpur arXiv preprint arXiv:2103.17122, 2021	24	2021
Using ASR methods for OCR A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ... 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019	23	2019
Multi-channel multi-speaker ASR using 3D spatial feature Y Shao, SX Zhang, D Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	11	2022
Use of pitch continuity for robust speech activity detection Y Shao, Q Lin 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	11	2018
Defense against adversarial attacks on hybrid speech recognition using joint adversarial fine-tuning with denoiser S Joshi, S Kataria, Y Shao, P Zelasko, J Villalba, S Khudanpur, N Dehak arXiv preprint arXiv:2204.03851, 2022	7	2022
A Novel Normalization Method for Autocorrelation Function for Pitch Detection and for Speech Activity Detection. Q Lin, Y Shao Interspeech, 2097-2101, 2018	7	2018
UniX-Encoder: A Universal X-Channel Speech Encoder for AD-HOC Microphone Array Speech Processing Z Huang, Y Shao, SX Zhang, D Yu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
Chunking Defense for Adversarial Attacks on ASR Y Shao, J Villalba, S Joshi, S Kataria, S Khudanpur, N Dehak Proc. Interspeech 2022, 2022	1	2022
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser S Joshi, S Kataria, Y Shao, P Żelasko, J Villalba, S Khudanpur, N Dehak Proc. Interspeech 2022, 2022	1	2022
RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR Y Shao, SX Zhang, D Yu arXiv preprint arXiv:2311.00146, 2023		2023
Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset Y Shao arXiv preprint arXiv:2310.03901, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors