MUCS 2021: Multilingual and code-switching ASR challenges for low resource Indian languages A Diwan, R Vaideeswaran, S Shah, A Singh, S Raghavan, S Khare, ... Proc. Interspeech 2021, 2446-2450, 2021 | 97* | 2021 |
Why is winoground hard? investigating failures in visuolinguistic compositionality A Diwan, L Berry, E Choi, D Harwath, K Mahowald arXiv preprint arXiv:2211.00768, 2022 | 53 | 2022 |
Low Resource ASR: The Surprising Effectiveness of High Resource Transliteration. S Khare*, AR Mittal*, A Diwan*, S Sarawagi, P Jyothi, S Bharadwaj Interspeech, 1529-1533, 2021 | 45 | 2021 |
Continual learning for on-device speech recognition using disentangled conformers A Diwan, CF Yeh, WN Hsu, P Tomasello, E Choi, D Harwath, A Mohamed ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 9 | 2023 |
Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages A Diwan, P Jyothi Proc. Interspeech 2021, 3445-3449, 2021 | 9* | 2021 |
Textless Speech-to-Speech Translation With Limited Parallel Data A Diwan, A Srinivasan, D Harwath, E Choi Findings of the Association for Computational Linguistics: EMNLP 2024, 16208 …, 2024 | 6* | 2024 |
Zero-shot Video Moment Retrieval With Off-the-Shelf Models A Diwan, P Peng, R Mooney Transfer Learning for Natural Language Processing Workshop, 10-21, 2023 | 6 | 2023 |
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ... arXiv preprint arXiv:2411.05361, 2024 | 3 | 2024 |
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants A Diwan, E Choi, D Harwath arXiv preprint arXiv:2306.08667, 2023 | 1 | 2023 |
Multilingual, Code-switching and Low-Resource NLP and ASR AJ Diwan Indian Institute OF Technology Bombay, 2021 | 1 | 2021 |
Scheduling and control of executable jobs over compute instances S Mitra, S Choudhary, S Garg, AJ Diwan, PK Maurya, A Aggarwal, P Jain US Patent 12,014,217, 2024 | | 2024 |
Modeling Abstract Style Prompts for Text-to-Speech Models A Diwan, Z Zheng, D Harwath, E Choi | | |