Title |
---|
![]() LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs Omar Choukrani Idriss Malek Daniil Orel Zhuohan Xie Zangir Iklassov Martin Takáč Salem Lahlou |
![]() CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World Zoya Volovikova Gregory Gorbov Petr Kuderov Aleksandr I. Panov Alexey Skrynnik |
![]() AgentGym: Evolving Large Language Model-based Agents across Diverse
Environments Zhiheng Xi Yiwen Ding Wenxiang Chen Boyang Hong Honglin Guo ...Qi Zhang Xipeng Qiu Xuanjing Huang Zuxuan Wu Yu-Gang Jiang |
![]() Language-guided Skill Learning with Temporal Variational Inference Haotian Fu Pratyusha Sharma Elias Stengel-Eskin George Konidaris Nicolas Le Roux Marc-Alexandre Côté Xingdi Yuan |