Follow
Yiwen Shao
Title
Cited by
Cited by
Year
Espresso: A fast end-to-end neural speech recognition toolkit
Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
812019
Speaker diarization with region proposal network
Z Huang, S Watanabe, Y Fujita, P García, Y Shao, D Povey, S Khudanpur
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
682020
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Y Shao, Y Wang, D Povey, S Khudanpur
Proc. Interspeech 2020, 561-565, 2020
452020
Adversarial attacks and defenses for speech recognition systems
P ¯elasko, S Joshi, Y Shao, J Villalba, J Trmal, N Dehak, S Khudanpur
arXiv preprint arXiv:2103.17122, 2021
242021
Using ASR methods for OCR
A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ...
2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019
232019
Multi-channel multi-speaker ASR using 3D spatial feature
Y Shao, SX Zhang, D Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
112022
Use of pitch continuity for robust speech activity detection
Y Shao, Q Lin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
112018
Defense against adversarial attacks on hybrid speech recognition using joint adversarial fine-tuning with denoiser
S Joshi, S Kataria, Y Shao, P Zelasko, J Villalba, S Khudanpur, N Dehak
arXiv preprint arXiv:2204.03851, 2022
72022
A Novel Normalization Method for Autocorrelation Function for Pitch Detection and for Speech Activity Detection.
Q Lin, Y Shao
Interspeech, 2097-2101, 2018
72018
UniX-Encoder: A Universal X-Channel Speech Encoder for AD-HOC Microphone Array Speech Processing
Z Huang, Y Shao, SX Zhang, D Yu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Chunking Defense for Adversarial Attacks on ASR
Y Shao, J Villalba, S Joshi, S Kataria, S Khudanpur, N Dehak
Proc. Interspeech 2022, 2022
12022
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser
S Joshi, S Kataria, Y Shao, P ¯elasko, J Villalba, S Khudanpur, N Dehak
Proc. Interspeech 2022, 2022
12022
RIR-SF: Room Impulse Response Based Spatial Feature for Multi-channel Multi-talker ASR
Y Shao, SX Zhang, D Yu
arXiv preprint arXiv:2311.00146, 2023
2023
Challenges and Insights: Exploring 3D Spatial Features and Complex Networks on the MISP Dataset
Y Shao
arXiv preprint arXiv:2310.03901, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–14