Seguir
Marvin Zhang
Marvin Zhang
Dirección de correo verificada de eecs.berkeley.edu - Página principal
Título
Citado por
Citado por
Año
GPT-4 technical report
OpenAI
arXiv, 2023
7251*2023
Wilds: A benchmark of in-the-wild distribution shifts
PW Koh, S Sagawa, H Marklund, SM Xie, M Zhang, A Balsubramani, ...
International conference on machine learning, 5637-5664, 2021
14612021
When to trust your model: Model-based policy optimization
M Janner, J Fu, M Zhang, S Levine
Advances in Neural Information Processing Systems (NeurIPS), 2019
10482019
Adaptive risk minimization: Learning to adapt to domain shift
M Zhang, H Marklund, N Dhawan, A Gupta, S Levine, C Finn
Advances in Neural Information Processing Systems 34, 23664-23678, 2021
316*2021
Solar: Deep structured representations for model-based reinforcement learning
M Zhang, S Vikram, L Smith, P Abbeel, M Johnson, S Levine
International conference on machine learning, 7444-7453, 2019
3102019
Memo: Test time robustness via adaptation and augmentation
M Zhang, S Levine, C Finn
Advances in neural information processing systems 35, 38629-38642, 2022
2792022
Combining model-based and model-free updates for trajectory-centric reinforcement learning
Y Chebotar, K Hausman, M Zhang, G Sukhatme, S Schaal, S Levine
International conference on machine learning, 703-711, 2017
2192017
Avid: Learning multi-stage tasks via pixel-level translation of human videos
L Smith, N Dhawan, M Zhang, P Abbeel, S Levine
Robotics: Science and Systems (RSS), 2019
1592019
Deep reinforcement learning for tensegrity robot locomotion
M Zhang, X Geng, J Bruce, K Caluwaerts, M Vespignani, V SunSpiral, ...
2017 IEEE international conference on robotics and automation (ICRA), 634-641, 2017
144*2017
Learning deep neural network policies with continuous memory states
M Zhang, Z McCarthy, C Finn, S Levine, P Abbeel
2016 IEEE international conference on robotics and automation (ICRA), 520-527, 2016
1022016
Guided policy search code implementation, 2016
C Finn, M Zhang, J Fu, X Tan, Z McCarthy, E Scharff, S Levine
Software available from rll. berkeley. edu/gps, 2016
282016
Gpt-4o system card
A Hurst, A Lerer, AP Goucher, A Perelman, A Ramesh, A Clark, AJ Ostrow, ...
arXiv preprint arXiv:2410.21276, 2024
192024
Adaptation Based Approaches to Distribution Shift Problems
MM Zhang
University of California, Berkeley, 2021
22021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–13