Title |
---|
![]() Offline Actor-Critic Reinforcement Learning Scales to Large Models Jost Tobias Springenberg A. Abdolmaleki Jingwei Zhang Oliver Groth Michael Bloesch ...Sarah Bechtle Steven Kapturowski Roland Hafner N. Heess Martin Riedmiller |
![]() SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex
Interactive Tasks Bill Yuchen Lin Yicheng Fu Karina Yang Faeze Brahman Shiyu Huang Chandra Bhagavatula Prithviraj Ammanabrolu Yejin Choi Xiang Ren |