Wei-Ning Hsu

Cytowane przez

	Wszystkie	Od 2019
Cytowania	7771	7516
h-indeks	38	34
i10-indeks	62	61

3100

1550

775

2325

2017201820192020202120222023202454 182 331 499 727 1683 3081 1165

Dostęp publiczny

Wyświetl wszystko

1 artykuł

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

James GlassMIT Computer Science and Artificial Intelligence LaboratoryZweryfikowany adres z mit.edu
Abdelrahman MohamedResearch scientist, Facebook AI ResearchZweryfikowany adres z fb.com
Alexei BaevskiFacebook AI ResearchZweryfikowany adres z fb.com
Michael AuliMeta, FAIRZweryfikowany adres z meta.com
Yu ZhangOpenAIZweryfikowany adres z csail.mit.edu
Bowen ShiFacebook AI ResearchZweryfikowany adres z meta.com
Emmanuel DupouxProfessor of Cognitive Psychology, Ecole des Hautes Etudes en Sciences Sociales, ParisZweryfikowany adres z ehess.fr
Yu-An ChungFacebook AI Research (FAIR)Zweryfikowany adres z fb.com
Yuxuan WangByteDanceZweryfikowany adres z cse.ohio-state.edu
Apoorv VyasFAIR Labs MetaZweryfikowany adres z meta.com
Gabriel SynnaeveResearch scientist at Facebook AI ResearchZweryfikowany adres z fb.com
David HarwathThe University of Texas at AustinZweryfikowany adres z utexas.edu
Ron J WeissGoogleZweryfikowany adres z google.com
Andros TjandraFacebook AI (research scientist)Zweryfikowany adres z fb.com
Matthew LeFacebook AI ResearchZweryfikowany adres z fb.com
Awni HannunMachine Learning Research, AppleZweryfikowany adres z apple.com
Hsuan-Tien LinProfessor of Computer Science and Information Engineering, National Taiwan UniversityZweryfikowany adres z csie.ntu.edu.tw
Jonathan Le RouxMERLZweryfikowany adres z merl.com
John HersheyGoogle (formerly MERL, IBM, MSR, UCSD)Zweryfikowany adres z google.com
Shinji WatanabeCarnegie Mellon UniversityZweryfikowany adres z cmu.edu

Obserwuj

Wei-Ning Hsu

Facebook AI Research (FAIR)

Zweryfikowany adres z csail.mit.edu - Strona główna

Speech Processing Machine Learning Natural Language Processing


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Hubert: Self-supervised speech representation learning by masked prediction of hidden units WN Hsu, B Bolte, YHH Tsai, K Lakhotia, R Salakhutdinov, A Mohamed IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3451-3460, 2021	1908	2021
Data2vec: A general framework for self-supervised learning in speech, vision and language A Baevski, WN Hsu, Q Xu, A Babu, J Gu, M Auli International Conference on Machine Learning, 1298-1312, 2022	648	2022
An unsupervised autoregressive model for speech representation learning YA Chung, WN Hsu, H Tang, J Glass INTERSPEECH, 2019	434	2019
Unsupervised learning of disentangled and interpretable representations from sequential data WN Hsu, Y Zhang, J Glass Thirty-first Conference on Neural Information Processing Systems (NeurIPS), 2017	397	2017
Hierarchical generative modeling for controllable speech synthesis WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ... Seventh International Conference on Learning Representations (ICLR), 2019	293*	2019
Unsupervised speech recognition A Baevski, WN Hsu, A Conneau, M Auli Advances in Neural Information Processing Systems 34, 27826-27839, 2021	265	2021
On generative spoken language modeling from raw audio K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ... Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021	243	2021
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ... INTERSPEECH, 2021	216	2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training WN Hsu, A Sriram, A Baevski, T Likhomanenko, Q Xu, V Pratap, J Kahn, ... INTERSPEECH, 2021	211	2021
Learning audio-visual speech representation by masked multimodal cluster prediction B Shi, WN Hsu, K Lakhotia, A Mohamed arXiv preprint arXiv:2201.02184, 2022	203	2022
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019	199	2019
Active learning by learning WN Hsu, HT Lin Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	183	2015
Learning Latent Representations for Speech Generation and Transformation WN Hsu, Y Zhang, J Glass INTERSPEECH, 1273-1277, 2017	176	2017
Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation WN Hsu, Y Zhang, J Glass 2017 IEEE automatic speech recognition and understanding workshop (ASRU), 16-23, 2017	159	2017
Semi-supervised training for improving data efficiency in end-to-end speech synthesis YA Chung, Y Wang, WN Hsu, Y Zhang, RJ Skerry-Ryan ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	136	2019
Disentangling correlated speaker and noise for speech synthesis via data augmentation and adversarial factorization WN Hsu, Y Zhang, RJ Weiss, YA Chung, Y Wang, Y Wu, J Glass ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	123	2019
Direct speech-to-speech translation with discrete units A Lee, PJ Chen, C Wang, J Gu, S Popuri, X Ma, A Polyak, Y Adi, Q He, ... arXiv preprint arXiv:2107.05604, 2021	113	2021
Scaling speech technology to 1,000+ languages V Pratap, A Tjandra, B Shi, P Tomasello, A Babu, S Kundu, A Elkahky, ... Journal of Machine Learning Research 25 (97), 1-52, 2024	108	2024
Textless speech-to-speech translation on real data A Lee, H Gong, PA Duquenne, H Schwenk, PJ Chen, C Wang, S Popuri, ... arXiv preprint arXiv:2112.08352, 2021	94	2021
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech D Harwath, WN Hsu, J Glass Eighth International Conference on Learning Representations (ICLR), 2020	93	2020

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy