Learning to adapt in dynamic, real-world environments through meta-reinforcement learning A Nagabandi, I Clavera, S Liu, RS Fearing, P Abbeel, S Levine, C Finn arXiv preprint arXiv:1803.11347, 2018 | 735* | 2018 |
Model-ensemble trust-region policy optimization T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel arXiv preprint arXiv:1802.10592, 2018 | 542 | 2018 |
Benchmarking model-based reinforcement learning T Wang, X Bao, I Clavera, J Hoang, Y Wen, E Langlois, S Zhang, G Zhang, ... arXiv preprint arXiv:1907.02057, 2019 | 454 | 2019 |
Model-based reinforcement learning via meta-policy optimization I Clavera, J Rothfuss, J Schulman, Y Fujita, T Asfour, P Abbeel Conference on Robot Learning, 617-629, 2018 | 294 | 2018 |
Promp: Proximal meta-policy search J Rothfuss, D Lee, I Clavera, T Asfour, P Abbeel arXiv preprint arXiv:1810.06784, 2018 | 234 | 2018 |
Model-augmented actor-critic: Backpropagating through paths I Clavera, V Fu, P Abbeel arXiv preprint arXiv:2005.08068, 2020 | 97 | 2020 |
Sub-policy adaptation for hierarchical reinforcement learning AC Li, C Florensa, I Clavera, P Abbeel arXiv preprint arXiv:1906.05862, 2019 | 95 | 2019 |
Policy transfer via modularity and reward guiding I Clavera, D Held, P Abbeel 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017 | 47 | 2017 |
Trajectory-wise multiple choice learning for dynamics generalization in reinforcement learning Y Seo, K Lee, I Clavera Gilaberte, T Kurutach, J Shin, P Abbeel Advances in Neural Information Processing Systems 33, 12968-12979, 2020 | 40 | 2020 |
Asynchronous methods for model-based reinforcement learning Y Zhang, I Clavera, B Tsai, P Abbeel arXiv preprint arXiv:1910.12453, 2019 | 31 | 2019 |
Mutual information maximization for robust plannable representations Y Ding, I Clavera, P Abbeel arXiv preprint arXiv:2005.08114, 2020 | 11 | 2020 |
Policy transfer via modularity I Clavera, P Abbeel IROS. IEEE, 2017 | 10 | 2017 |
Policy Transfer Via Modularity IC Gilaberte Universitat Politècnica de Catalunya. Facultat de Matemàtiques i Estadística, 2017 | | 2017 |
Towards SLAM with an events-based camera I Clavera Gilaberte, J Solà Ortega, J Andrade-Cetto | | 2016 |
R-LAtte: Attention Module for Visual Control via Reinforcement Learning M Zhao, Q Li, A Srinivas, I Clavera, K Lee, P Abbeel | | |