SlimPajama: A 627B token cleaned and deduplicated version of RedPajama D Soboleva, F Al-Khateeb, R Myers, JR Steeves, J Hestness, N Dey June, 2023 | 30 | 2023 |
Slimpajama-dc: Understanding data combinations for llm training Z Shen, T Tao, L Ma, W Neiswanger, J Hestness, N Vassilieva, ... arXiv preprint arXiv:2309.10818, 2023 | 11 | 2023 |
Replacing human audio with synthetic audio for on-device unspoken punctuation prediction D Soboleva, O Skopek, M Šajgalík, V Cărbune, F Weissenberger, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 10 | 2021 |
Three-stage question answering system with sentence ranking D Soboleva, K Vorontsov EPiC Series in Language and Linguistics 4, 18-25, 2019 | 4 | 2019 |
Btlm-3b-8k: 7b parameter performance in a 3b parameter model N Dey, D Soboleva, F Al-Khateeb, B Yang, R Pathria, H Khachane, ... arXiv preprint arXiv:2309.11568, 2023 | 2 | 2023 |
Position Interpolation Improves ALiBi Extrapolation F Al-Khateeb, N Dey, D Soboleva, J Hestness arXiv preprint arXiv:2310.13017, 2023 | 1 | 2023 |
MULTI-PHASE TRAINING OF MACHINE LEARNING MODELS FOR SEARCH RANKING A Boymel, S Daria US Patent App. 18/074,432, 2023 | 1 | 2023 |
REPLACING HUMAN-RECORDED AUDIO WITH SYNTHETIC AUDIOFOR ON-DEVICE UNSPOKEN PUNCTUATION PREDICTION B Miklos, D Valcarce, D Soboleva, F Weissenberger, J Proskurnia, J Lu, ... | | 2021 |
Multi-Task Transformer Networks for Search Relevance Prediction and Ranking D Soboleva, A Boymel, A Gotmanov, M Ryabinin | | |