Follow
Anuj Diwan
Title
Cited by
Cited by
Year
MUCS 2021: Multilingual and code-switching ASR challenges for low resource Indian languages
A Diwan, R Vaideeswaran, S Shah, A Singh, S Raghavan, S Khare, ...
Proc. Interspeech 2021, 2446-2450, 2021
97*2021
Why is winoground hard? investigating failures in visuolinguistic compositionality
A Diwan, L Berry, E Choi, D Harwath, K Mahowald
arXiv preprint arXiv:2211.00768, 2022
532022
Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration.
S Khare*, AR Mittal*, A Diwan*, S Sarawagi, P Jyothi, S Bharadwaj
Interspeech, 1529-1533, 2021
452021
Continual learning for on-device speech recognition using disentangled conformers
A Diwan, CF Yeh, WN Hsu, P Tomasello, E Choi, D Harwath, A Mohamed
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
92023
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages
A Diwan, P Jyothi
Proc. Interspeech 2021, 3445-3449, 2021
9*2021
Textless Speech-to-Speech Translation With Limited Parallel Data
A Diwan, A Srinivasan, D Harwath, E Choi
Findings of the Association for Computational Linguistics: EMNLP 2024, 16208 …, 2024
6*2024
Zero-shot Video Moment Retrieval With Off-the-Shelf Models
A Diwan, P Peng, R Mooney
Transfer Learning for Natural Language Processing Workshop, 10-21, 2023
62023
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks
C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ...
arXiv preprint arXiv:2411.05361, 2024
32024
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants
A Diwan, E Choi, D Harwath
arXiv preprint arXiv:2306.08667, 2023
12023
Multilingual, Code-switching and Low-Resource NLP and ASR
AJ Diwan
Indian Institute OF Technology Bombay, 2021
12021
Scheduling and control of executable jobs over compute instances
S Mitra, S Choudhary, S Garg, AJ Diwan, PK Maurya, A Aggarwal, P Jain
US Patent 12,014,217, 2024
2024
Modeling Abstract Style Prompts for Text-to-Speech Models
A Diwan, Z Zheng, D Harwath, E Choi
The system can't perform the operation now. Try again later.
Articles 1–12