Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2354 | 2023 |
Measuring compositional generalization: A comprehensive method on realistic data D Keysers, N Schärli, N Scales, H Buisman, D Furrer, S Kashubin, ... arXiv preprint arXiv:1912.09713, 2019 | 384 | 2019 |
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 270 | 2020 |
Gemma 2: Improving open language models at a practical size G Team, M Riviere, S Pathak, PG Sessa, C Hardin, S Bhupatiraju, ... arXiv preprint arXiv:2408.00118, 2024 | 256 | 2024 |
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023 | 86 | 2023 |
Factually consistent summarization via reinforcement learning with textual entailment feedback P Roit, J Ferret, L Shani, R Aharoni, G Cideron, R Dadashi, M Geist, ... arXiv preprint arXiv:2306.00186, 2023 | 73 | 2023 |
Hyperparameter selection for imitation learning L Hussenot, M Andrychowicz, D Vincent, R Dadashi, A Raichuk, S Ramos, ... International Conference on Machine Learning, 4511-4522, 2021 | 21 | 2021 |
Bond: Aligning llms with best-of-n distillation PG Sessa, R Dadashi, L Hussenot, J Ferret, N Vieillard, A Ramé, ... arXiv preprint arXiv:2407.14622, 2024 | 18 | 2024 |
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task D Tsarkov, T Tihon, N Scales, N Momchev, D Sinopalnikov, N Schärli Proceedings of the AAAI Conference on Artificial Intelligence 35 (11), 9949-9957, 2021 | 16 | 2021 |
Rlds: an ecosystem to generate, share and use datasets in reinforcement learning S Ramos, S Girgin, L Hussenot, D Vincent, H Yakubovich, D Toyama, ... arXiv preprint arXiv:2111.02767, 2021 | 14 | 2021 |
Imitating language via scalable inverse reinforcement learning M Wulfmeier, M Bloesch, N Vieillard, A Ahuja, J Bornschein, S Huang, ... The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024 | | 2024 |