Sebastian Jaszczur

20182019202020212022202320241 5 4 21 40 32

Henryk MichalewskiGoogleVerified email at google.com
Maciej PióroPhD Student, Polish Academy of Sciences / IDEAS NCBRVerified email at ideas-ncbr.pl
Jan LudziejewskiUniversity of WarsawVerified email at mimuw.edu.pl
Marek CyganUniversity of WarsawVerified email at mimuw.edu.pl
Tomasz OdrzygóźdźTradeLink LLCVerified email at impan.pl
Jakub KrajewskiPhD Student, University of WarsawVerified email at nvidia.com
Konrad StaniszewskiUniversity of Warsaw, IDEAS NCBRVerified email at uw.edu.pl
Szymon TworkowskiUniversity of WarsawVerified email at student.uw.edu.pl

Sebastian Jaszczur

Verified email at uw.edu.pl


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sparse is Enough in Scaling Transformers S Jaszczur, A Chowdhery, A Mohiuddin, L Kaiser, W Gajewski, ... Advances in Neural Information Processing Systems 34, 9895-9907, 2021	71	2021
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts M Pióro, K Ciebiera, K Król, J Ludziejewski, S Jaszczur arXiv preprint arXiv:2401.04081, 2024	14	2024
Neural heuristics for SAT solving S Jaszczur, M Łuszczyk, H Michalewski arXiv preprint arXiv:2005.13406, 2020	11	2020
Use of domain knowledge and feature engineering in helping AI to play Hearthstone P Przybyszewski, S Dziewiątkowski, S Jaszczur, M Śmiech, M Szczuka 2017 Federated Conference on Computer Science and Information Systems …, 2017	6	2017
Scaling Laws for Fine-Grained Mixture of Experts J Krajewski, J Ludziejewski, K Adamczewski, M Pióro, M Krutul, ... arXiv preprint arXiv:2402.07871, 2024	1	2024
Structured Packing in LLM Training Improves Long Context Utilization K Staniszewski, S Tworkowski, S Jaszczur, H Michalewski, Ł Kuciński, ... arXiv preprint arXiv:2312.17296, 2023		2023
Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation S Antoniak, S Jaszczur, M Krutul, M Pióro, J Krajewski, J Ludziejewski, ... arXiv preprint arXiv:2310.15961, 2023		2023
Sparse attention neural networks A Chowdhery, A Mohiuddin, H Michalewski, JM Kanerva, LM Kaiser, ... US Patent App. 17/666,400, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year