ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.08272
  4. Cited By
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language
  Learning

BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning

18 October 2018
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
    ELM
ArXivPDFHTML

Papers citing "BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning"

50 / 165 papers shown
Title
LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs
Omar Choukrani
Idriss Malek
Daniil Orel
Zhuohan Xie
Zangir Iklassov
Martin Takáč
Salem Lahlou
LLMAG
ELM
LRM
0
0
0
17 May 2025
CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World
CrafText Benchmark: Advancing Instruction Following in Complex Multimodal Open-Ended World
Zoya Volovikova
Gregory Gorbov
Petr Kuderov
Aleksandr I. Panov
Alexey Skrynnik
0
0
0
17 May 2025
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Annie Wong
Thomas Bäck
Aske Plaat
Niki van Stein
Anna V. Kononova
ReLM
ELM
LRM
45
0
0
15 May 2025
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts
Jing-Cheng Pang
Kaiyuan Li
Yixuan Wang
Si-Hang Yang
Shengyi Jiang
Yang Yu
OffRL
LLMAG
LM&Ro
LRM
19
0
0
15 May 2025
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems
Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems
Zaid Khan
Elias Stengel-Eskin
Archiki Prasad
Jaemin Cho
Joey Tianyi Zhou
31
0
0
14 Apr 2025
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
SocialJax: An Evaluation Suite for Multi-agent Reinforcement Learning in Sequential Social Dilemmas
Zihao Guo
Richard Willis
Richard Willis
Tristan Tomilin
Joel Z Leibo
Yali Du
58
0
0
18 Mar 2025
ATLaS: Agent Tuning via Learning Critical Steps
Zhixun Chen
Ming Li
Y. Huang
Yali Du
Meng Fang
Dinesh Manocha
83
3
0
04 Mar 2025
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning
Egor Cherepanov
Nikita Kachaev
A. Kovalev
Aleksandr I. Panov
OffRL
41
0
0
14 Feb 2025
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning
Zhisheng Tang
Mayank Kejriwal
LRM
59
3
0
20 Jan 2025
CAREL: Instruction-guided reinforcement learning with cross-modal
  auxiliary objectives
CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives
Armin Saghafian
Amirmohammad Izadi
Negin Hashemi Dijujin
M. Baghshah
66
0
0
29 Nov 2024
Learning for Long-Horizon Planning via Neuro-Symbolic Abductive
  Imitation
Learning for Long-Horizon Planning via Neuro-Symbolic Abductive Imitation
Jie-Jing Shao
Hao-Ran Hao
Xiao-Wen Yang
Yu-Feng Li
79
2
0
27 Nov 2024
Identifying and Addressing Delusions for Target-Directed Decision-Making
Identifying and Addressing Delusions for Target-Directed Decision-Making
Mingde Zhao
Tristan Sylvain
Doina Precup
Yoshua Bengio
37
0
0
09 Oct 2024
SEAL: SEmantic-Augmented Imitation Learning via Language Model
SEAL: SEmantic-Augmented Imitation Learning via Language Model
Chengyang Gu
Yuxin Pan
Haotian Bai
Hui Xiong
Yize Chen
27
0
0
03 Oct 2024
Learning to Ground Existentially Quantified Goals
Learning to Ground Existentially Quantified Goals
Martin Funkquist
Simon Ståhlberg
Hector Geffner
21
0
0
30 Sep 2024
RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and
  Language Models
RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and Language Models
Abhinav Jain
Chris Jermaine
Vaibhav Unhelkar
KELM
LLMAG
31
1
0
18 Sep 2024
EPO: Hierarchical LLM Agents with Environment Preference Optimization
EPO: Hierarchical LLM Agents with Environment Preference Optimization
Qi Zhao
Haotian Fu
Chen Sun
George Konidaris
39
8
0
28 Aug 2024
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon
  Agent Tasks with Large Language Model
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model
Mengkang Hu
Tianxing Chen
Qiguang Chen
Yao Mu
Wenqi Shao
Ping Luo
LM&Ro
LLMAG
RALM
29
4
0
18 Aug 2024
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
Mengkang Hu
Yixiao Wang
Can Xu
Lingfeng Sun
Chensheng Peng
T. Hannagan
Nicola Poerio
Saravan Rajmohan
LM&Ro
LLMAG
69
15
0
01 Aug 2024
LLM-Empowered State Representation for Reinforcement Learning
LLM-Empowered State Representation for Reinforcement Learning
Boyuan Wang
Yun Qu
Yuhang Jiang
Jianzhun Shao
Chang-rui Liu
Wenming Yang
Xiangyang Ji
40
7
0
18 Jul 2024
ELCC: the Emergent Language Corpus Collection
ELCC: the Emergent Language Corpus Collection
Brendon Boldt
David R. Mortensen
35
0
0
04 Jul 2024
Improving Sample Efficiency of Reinforcement Learning with Background
  Knowledge from Large Language Models
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
Fuxiang Zhang
Junyou Li
Yi-Chen Li
Zongzhang Zhang
Yang Yu
Deheng Ye
OffRL
KELM
47
1
0
04 Jul 2024
AI Agents That Matter
AI Agents That Matter
Sayash Kapoor
Benedikt Stroebl
Zachary S. Siegel
Nitya Nadgir
Arvind Narayanan
49
36
0
01 Jul 2024
Program Synthesis Benchmark for Visual Programming in XLogoOnline
  Environment
Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment
Chao Wen
Jacqueline Staub
Adish Singla
ELM
44
3
0
17 Jun 2024
AgentGym: Evolving Large Language Model-based Agents across Diverse
  Environments
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
Zhiheng Xi
Yiwen Ding
Wenxiang Chen
Boyang Hong
Honglin Guo
...
Qi Zhang
Xipeng Qiu
Xuanjing Huang
Zuxuan Wu
Yu-Gang Jiang
LLMAG
LM&Ro
38
29
0
06 Jun 2024
Policy Learning with a Language Bottleneck
Policy Learning with a Language Bottleneck
Megha Srivastava
Cédric Colas
Dorsa Sadigh
Jacob Andreas
40
3
0
07 May 2024
Learning Generalized Policies for Fully Observable Non-Deterministic
  Planning Domains
Learning Generalized Policies for Fully Observable Non-Deterministic Planning Domains
Till Hofmann
Hector Geffner
OffRL
32
2
0
03 Apr 2024
A Survey on Large Language Model-Based Game Agents
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
71
51
0
02 Apr 2024
Sharing the Cost of Success: A Game for Evaluating and Learning
  Collaborative Multi-Agent Instruction Giving and Following Policies
Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
P. Sadler
Sherzod Hakimov
David Schlangen
LLMAG
30
2
0
26 Mar 2024
Rethinking Mutual Information for Language Conditioned Skill Discovery
  on Imitation Learning
Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
Zhaoxun Ju
Chao Yang
Hongbo Wang
Yu Qiao
Gang Hua
LM&Ro
28
3
0
27 Feb 2024
Language-guided Skill Learning with Temporal Variational Inference
Language-guided Skill Learning with Temporal Variational Inference
Haotian Fu
Pratyusha Sharma
Elias Stengel-Eskin
George Konidaris
Nicolas Le Roux
Marc-Alexandre Côté
Xingdi Yuan
38
7
0
26 Feb 2024
Vision-Language Navigation with Embodied Intelligence: A Survey
Peng Gao
Peng Wang
Feng Gao
Fei-Yue Wang
Ruyue Yuan
LM&Ro
40
2
0
22 Feb 2024
All Language Models Large and Small
All Language Models Large and Small
Zhixun Chen
Yali Du
D. Mguni
24
0
0
19 Feb 2024
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Quentin Gallouedec
E. Beeching
Clément Romac
Emmanuel Dellandréa
21
11
0
15 Feb 2024
Towards Unified Alignment Between Agents, Humans, and Environment
Towards Unified Alignment Between Agents, Humans, and Environment
Zonghan Yang
An Liu
Zijun Liu
Kai Liu
Fangzhou Xiong
...
Zhenhe Zhang
Fuwen Luo
Zhicheng Guo
Peng Li
Yang Liu
32
4
0
12 Feb 2024
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Analyzing Adversarial Inputs in Deep Reinforcement Learning
Davide Corsi
Guy Amir
Guy Katz
Alessandro Farinelli
AAML
33
7
0
07 Feb 2024
Learning Communication Policies for Different Follower Behaviors in a
  Collaborative Reference Game
Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
P. Sadler
Sherzod Hakimov
David Schlangen
29
1
0
07 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
21
7
0
02 Feb 2024
True Knowledge Comes from Practice: Aligning LLMs with Embodied
  Environments via Reinforcement Learning
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning
Weihao Tan
Wentao Zhang
Shanqi Liu
Longtao Zheng
Xinrun Wang
Bo An
OffRL
44
17
0
25 Jan 2024
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
Chang Ma
Junlei Zhang
Zhihao Zhu
Cheng Yang
Yujiu Yang
Yaohui Jin
Zhenzhong Lan
Lingpeng Kong
Junxian He
ELM
LLMAG
37
54
0
24 Jan 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models
  (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
Yiqi Wang
Wentao Chen
Xiaotian Han
Xudong Lin
Haiteng Zhao
Yongfei Liu
Bohan Zhai
Jianbo Yuan
Quanzeng You
Hongxia Yang
LRM
47
69
0
10 Jan 2024
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Filippos Christianos
Georgios Papoudakis
Matthieu Zimmer
Thomas Coste
Zhihao Wu
...
Yicheng Luo
Jianye Hao
Kun Shao
Haitham Bou-Ammar
Jun Wang
30
19
0
22 Dec 2023
Active Reinforcement Learning for Robust Building Control
Active Reinforcement Learning for Robust Building Control
Doseok Jang
Larry Yan
Lucas Spangher
C. Spanos
20
1
0
16 Dec 2023
LiFT: Unsupervised Reinforcement Learning with Foundation Models as
  Teachers
LiFT: Unsupervised Reinforcement Learning with Foundation Models as Teachers
Taewook Nam
Juyong Lee
Jesse Zhang
Sung Ju Hwang
Joseph J. Lim
Karl Pertsch
OffRL
LRM
43
5
0
14 Dec 2023
Toward General-Purpose Robots via Foundation Models: A Survey and
  Meta-Analysis
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Yafei Hu
Quanting Xie
Vidhi Jain
Jonathan M Francis
Jay Patrikar
...
Xiaolong Wang
Sebastian A. Scherer
Z. Kira
Fei Xia
Yonatan Bisk
LM&Ro
AI4CE
34
63
0
14 Dec 2023
diff History for Neural Language Agents
diff History for Neural Language Agents
Ulyana Piterbarg
Lerrel Pinto
Rob Fergus
27
3
0
12 Dec 2023
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement
  Learning
No Prior Mask: Eliminate Redundant Action for Deep Reinforcement Learning
Dianyu Zhong
Yiqin Yang
Qianchuan Zhao
27
6
0
11 Dec 2023
Is Feedback All You Need? Leveraging Natural Language Feedback in
  Goal-Conditioned Reinforcement Learning
Is Feedback All You Need? Leveraging Natural Language Feedback in Goal-Conditioned Reinforcement Learning
Sabrina McCallum
Max Taylor-Davies
Stefano V. Albrecht
Alessandro Suglia
21
1
0
07 Dec 2023
Robot Learning in the Era of Foundation Models: A Survey
Robot Learning in the Era of Foundation Models: A Survey
Xuan Xiao
Jiahang Liu
Zhipeng Wang
Yanmin Zhou
Yong Qi
Qian Cheng
Bin He
Shuo Jiang
AI4CE
LM&Ro
26
27
0
24 Nov 2023
Emergence of Abstract State Representations in Embodied Sequence
  Modeling
Emergence of Abstract State Representations in Embodied Sequence Modeling
Tian Yun
Zilai Zeng
Kunal Handa
Ashish V. Thapliyal
Bo Pang
Ellie Pavlick
Chen Sun
LM&Ro
35
7
0
03 Nov 2023
LLaMA Rider: Spurring Large Language Models to Explore the Open World
LLaMA Rider: Spurring Large Language Models to Explore the Open World
Yicheng Feng
Yuxuan Wang
Jiazheng Liu
Sipeng Zheng
Zongqing Lu
LLMAG
LRM
18
16
0
13 Oct 2023
1234
Next