Title |
---|
![]() Offline Actor-Critic Reinforcement Learning Scales to Large Models Jost Tobias Springenberg A. Abdolmaleki Jingwei Zhang Oliver Groth Michael Bloesch ...Sarah Bechtle Steven Kapturowski Roland Hafner N. Heess Martin Riedmiller |
![]() Learning adaptive planning representations with natural language
guidance L. Wong Jiayuan Mao Pratyusha Sharma Zachary S. Siegel Jiahai Feng Noa Korneev Joshua B. Tenenbaum Jacob Andreas |