Episodic curiosity through reachability N Savinov, A Raichuk, R Marinier, D Vincent, M Pollefeys, T Lillicrap, ... arXiv preprint arXiv:1810.02274, 2018 | 221 | 2018 |
Towards accurate generative models of video: A new metric & challenges T Unterthiner, S Van Steenkiste, K Kurach, R Marinier, M Michalski, ... arXiv preprint arXiv:1812.01717, 2018 | 166 | 2018 |
What matters in on-policy reinforcement learning? a large-scale empirical study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... arXiv preprint arXiv:2006.05990, 2020 | 104 | 2020 |
Seed rl: Scalable and efficient deep-rl with accelerated central inference L Espeholt, R Marinier, P Stanczyk, K Wang, M Michalski arXiv preprint arXiv:1910.06591, 2019 | 97 | 2019 |
What matters for on-policy deep actor-critic methods? a large-scale study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... International conference on learning representations, 2021 | 65 | 2021 |
Implicit factoring with shared most significant and middle bits JC Faugère, R Marinier, G Renault Public Key Cryptography–PKC 2010: 13th International Conference on Practice …, 2010 | 33 | 2010 |
FVD: A new metric for video generation T Unterthiner, S van Steenkiste, K Kurach, R Marinier, M Michalski, ... | 24 | 2019 |
Self-attentional credit assignment for transfer in reinforcement learning J Ferret, R Marinier, M Geist, O Pietquin arXiv preprint arXiv:1907.08027, 2019 | 23 | 2019 |
Audiolm: a language modeling approach to audio generation Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ... arXiv preprint arXiv:2209.03143, 2022 | 22 | 2022 |
What matters in on-policy reinforcement learning M Andrychowicz, A Raichuk, P Stanczyk, M Orsini, S Girgin, R Marinier, ... A large-scale empirical study. CoRR, abs/2006.05990, 2020 | 18 | 2020 |
Solving N-player dynamic routing games with congestion: a mean field approach T Cabannes, M Lauriere, J Perolat, R Marinier, S Girgin, S Perrin, ... arXiv preprint arXiv:2110.11943, 2021 | 10 | 2021 |
Hyperparameter selection for imitation learning L Hussenot, M Andrychowicz, D Vincent, R Dadashi, A Raichuk, S Ramos, ... International Conference on Machine Learning, 4511-4522, 2021 | 10 | 2021 |
Learning Equilibria in Mean-Field Games: Introducing Mean-Field PSRO P Muller, M Rowland, R Elie, G Piliouras, J Perolat, M Lauriere, R Marinier, ... arXiv preprint arXiv:2111.08350, 2021 | 5 | 2021 |
Rlds: an ecosystem to generate, share and use datasets in reinforcement learning S Ramos, S Girgin, L Hussenot, D Vincent, H Yakubovich, D Toyama, ... arXiv preprint arXiv:2111.02767, 2021 | 5 | 2021 |
Credit assignment as a proxy for transfer in reinforcement learning J Ferret, R Marinier, M Geist, O Pietquin arXiv preprint arXiv:1907.08027, 2019 | 5 | 2019 |
Methods, systems, and media for presenting content organized by category A Pak, F Raimundo, S Girgin, R Marinier, V Simonet US Patent 11,036,743, 2021 | 4 | 2021 |
Cryptanalysis of the Improved Cellular Message Encryption Algorithm T Chardin, R Marinier Cryptology ePrint Archive, 2008 | 2 | 2008 |
An Adaptive Chosen-plaintext Attack of the Improved Cellular Message Encryption Algorithm. T Chardin, R Marinier Int. J. Netw. Secur. 9 (2), 173-179, 2009 | 1 | 2009 |
Reinforcement learning with centralized inference and training L Espeholt, K Wang, MM Michalski, PM Stanczyk, R Marinier US Patent App. 17/764,066, 2022 | | 2022 |
Solving N player dynamic routing games with congestion: a mean field approach A Bayen, E Goubault, J Perolat, ML Lauriere, O Pietquin, R Marinier, ... | | 2022 |