ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.08272
  4. Cited By
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language
  Learning

BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning

18 October 2018
Maxime Chevalier-Boisvert
Dzmitry Bahdanau
Salem Lahlou
Lucas Willems
Chitwan Saharia
Thien Huu Nguyen
Yoshua Bengio
    ELM
ArXivPDFHTML

Papers citing "BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning"

50 / 165 papers shown
Title
ELDEN: Exploration via Local Dependencies
ELDEN: Exploration via Local Dependencies
Jiaheng Hu
Zizhao Wang
Peter Stone
Roberto Martin-Martin
35
8
0
12 Oct 2023
Consciousness-Inspired Spatio-Temporal Abstractions for Better
  Generalization in Reinforcement Learning
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
Mingde Zhao
Safa Alver
H. V. Seijen
Romain Laroche
Doina Precup
Yoshua Bengio
15
3
0
30 Sep 2023
A Data Source for Reasoning Embodied Agents
A Data Source for Reasoning Embodied Agents
Jack Lanchantin
Sainbayar Sukhbaatar
Gabriel Synnaeve
Yuxuan Sun
Kavya Srinet
Arthur Szlam
LM&Ro
LRM
25
5
0
14 Sep 2023
Self-driven Grounding: Large Language Model Agents with Automatical
  Language-aligned Skill Learning
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning
Shaohui Peng
Xingui Hu
Qi Yi
Rui Zhang
Jiaming Guo
...
Rui Chen
Zidong Du
Qi Guo
Yunji Chen
Ling Li
LLMAG
LRM
LM&Ro
29
9
0
04 Sep 2023
Learning to Identify Critical States for Reinforcement Learning from
  Videos
Learning to Identify Critical States for Reinforcement Learning from Videos
Haozhe Liu
Mingchen Zhuge
Bing Li
Yu‐Han Wang
Francesco Faccio
Guohao Li
Jürgen Schmidhuber
OffRL
28
6
0
15 Aug 2023
Pre-Trained Large Language Models for Industrial Control
Pre-Trained Large Language Models for Industrial Control
Lei Song
Chuheng Zhang
Li Zhao
Jiang Bian
LM&Ro
AI4CE
32
12
0
06 Aug 2023
COLLIE: Systematic Construction of Constrained Text Generation Tasks
COLLIE: Systematic Construction of Constrained Text Generation Tasks
Shunyu Yao
Howard Chen
Austin W. Hanjie
Runzhe Yang
Karthik Narasimhan
47
32
0
17 Jul 2023
The SocialAI School: Insights from Developmental Psychology Towards
  Artificial Socio-Cultural Agents
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
18
19
0
15 Jul 2023
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning
  Environments for Goal-Oriented Tasks
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
Maxime Chevalier-Boisvert
Bolun Dai
Mark Towers
Rodrigo de Lazcano
Lucas Willems
Salem Lahlou
Suman Pal
Pablo Samuel Castro
Jordan Terry
VGen
13
182
0
24 Jun 2023
Deep Reinforcement Learning with Task-Adaptive Retrieval via
  Hypernetwork
Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
Yonggang Jin
Chenxu Wang
Tianyu Zheng
Liuyu Xiang
Yao-Chun Yang
Junge Zhang
Jie Fu
Zhaofeng He
3DH
42
0
0
19 Jun 2023
Enabling Intelligent Interactions between an Agent and an LLM: A
  Reinforcement Learning Approach
Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach
Bin-Bin Hu
Chenyang Zhao
Pushi Zhang
Zihao Zhou
Yuanhang Yang
Zenglin Xu
Bin Liu
LM&Ro
LLMAG
25
21
0
06 Jun 2023
Improved Compositional Generalization by Generating Demonstrations for
  Meta-Learning
Improved Compositional Generalization by Generating Demonstrations for Meta-Learning
Sam Spilsbury
Alexander Ilin
48
1
0
22 May 2023
Yes, this Way! Learning to Ground Referring Expressions into Actions
  with Intra-episodic Feedback from Supportive Teachers
Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
P. Sadler
Sherzod Hakimov
David Schlangen
41
1
0
22 May 2023
Can Agents Run Relay Race with Strangers? Generalization of RL to
  Out-of-Distribution Trajectories
Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories
Li-Cheng Lan
Huan Zhang
Cho-Jui Hsieh
OODD
20
9
0
26 Apr 2023
Bridging RL Theory and Practice with the Effective Horizon
Bridging RL Theory and Practice with the Effective Horizon
Cassidy Laidlaw
Stuart J. Russell
Anca Dragan
OffRL
9
28
0
19 Apr 2023
Think Before You Act: Unified Policy for Interleaving Language Reasoning
  with Actions
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions
Lina Mezghani
Piotr Bojanowski
Alahari Karteek
Sainbayar Sukhbaatar
LM&Ro
OffRL
LRM
21
8
0
18 Apr 2023
Foundation Models for Decision Making: Problems, Methods, and
  Opportunities
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
98
156
0
07 Mar 2023
Read and Reap the Rewards: Learning to Play Atari with the Help of
  Instruction Manuals
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
Yue Wu
Yewen Fan
Paul Pu Liang
A. Azaria
Yuan-Fang Li
Tom Michael Mitchell
OffRL
26
47
0
09 Feb 2023
Grounding Large Language Models in Interactive Environments with Online
  Reinforcement Learning
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
Thomas Carta
Clément Romac
Thomas Wolf
Sylvain Lamprier
Olivier Sigaud
Pierre-Yves Oudeyer
LM&Ro
LLMAG
22
182
0
06 Feb 2023
Composing Task Knowledge with Modular Successor Feature Approximators
Composing Task Knowledge with Modular Successor Feature Approximators
Wilka Carvalho
Angelos Filos
Richard L. Lewis
Honglak Lee
Satinder Singh
17
7
0
28 Jan 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning
  and Adaptive Horizon Prediction
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
Shaofei Cai
Zihao Wang
Xiaojian Ma
Guy Van den Broeck
Yitao Liang
50
40
0
21 Jan 2023
"No, to the Right" -- Online Language Corrections for Robotic
  Manipulation via Shared Autonomy
"No, to the Right" -- Online Language Corrections for Robotic Manipulation via Shared Autonomy
Yuchen Cui
Siddharth Karamcheti
Raj Palleti
Nidhya Shivakumar
Percy Liang
Dorsa Sadigh
LM&Ro
40
76
0
06 Jan 2023
On Realization of Intelligent Decision-Making in the Real World: A
  Foundation Decision Model Perspective
On Realization of Intelligent Decision-Making in the Real World: A Foundation Decision Model Perspective
Ying Wen
Bo Liu
M. Zhou
Shufang Hou
Zhe Cao
Chenyang Le
Jingxiao Chen
Zheng Tian
Weinan Zhang
Jun Wang
AI4CE
23
10
0
24 Dec 2022
Hippocampus-Inspired Cognitive Architecture (HICA) for Operant
  Conditioning
Hippocampus-Inspired Cognitive Architecture (HICA) for Operant Conditioning
Deokgun Park
Md Ashaduzzaman Rubel Mondol
Sm Mazharul Islam
Aishwarya Pothula
20
0
0
16 Dec 2022
Language-Conditioned Reinforcement Learning to Solve Misunderstandings
  with Action Corrections
Language-Conditioned Reinforcement Learning to Solve Misunderstandings with Action Corrections
Frank Röder
Manfred Eppe
CLL
LRM
27
3
0
18 Nov 2022
Curriculum-based Asymmetric Multi-task Reinforcement Learning
Curriculum-based Asymmetric Multi-task Reinforcement Learning
H. Huang
Deheng Ye
Li Shen
Wei Liu
32
12
0
07 Nov 2022
lilGym: Natural Language Visual Reasoning with Reinforcement Learning
lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Anne Wu
Kianté Brantley
Noriyuki Kojima
Yoav Artzi
ReLM
OffRL
LRM
27
3
0
03 Nov 2022
Discrete Factorial Representations as an Abstraction for Goal
  Conditioned Reinforcement Learning
Discrete Factorial Representations as an Abstraction for Goal Conditioned Reinforcement Learning
Riashat Islam
Hongyu Zang
Anirudh Goyal
Alex Lamb
Kenji Kawaguchi
Xin-hui Li
Romain Laroche
Yoshua Bengio
Rémi Tachet des Combes
OffRL
AI4CE
23
9
0
01 Nov 2022
Benchmarking Learning Efficiency in Deep Reservoir Computing
Benchmarking Learning Efficiency in Deep Reservoir Computing
Hugo Cisneros
Josef Sivic
Tomáš Mikolov
14
2
0
29 Sep 2022
Trust in Language Grounding: a new AI challenge for human-robot teams
Trust in Language Grounding: a new AI challenge for human-robot teams
David M. Bossens
C. Evers
36
1
0
05 Sep 2022
Language-Based Causal Representation Learning
Language-Based Causal Representation Learning
Blai Bonet
Hector Geffner
35
0
0
12 Jul 2022
Compositional Generalization in Grounded Language Learning via Induced
  Model Sparsity
Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Sam Spilsbury
Alexander Ilin
16
7
0
06 Jul 2022
ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and
  Acquisition at Inference Time
ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time
Tailin Wu
Megan Tjandrasuwita
Zhengxuan Wu
Xuelin Yang
Kevin Liu
Rok Sosivc
J. Leskovec
16
22
0
30 Jun 2022
EAGER: Asking and Answering Questions for Automatic Reward Shaping in
  Language-guided RL
EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL
Thomas Carta
Pierre-Yves Oudeyer
Olivier Sigaud
Sylvain Lamprier
OffRL
28
24
0
20 Jun 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale
  Knowledge
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Linxi Fan
Guanzhi Wang
Yunfan Jiang
Ajay Mandlekar
Yuncong Yang
Haoyi Zhu
Andrew Tang
De-An Huang
Yuke Zhu
Anima Anandkumar
LM&Ro
48
348
0
17 Jun 2022
Towards Understanding How Machines Can Learn Causal Overhypotheses
Towards Understanding How Machines Can Learn Causal Overhypotheses
Eliza Kosoy
David M. Chan
Adrian Liu
Jasmine Collins
Bryanna Kaufmann
Sandy Han Huang
Jessica B. Hamrick
John F. Canny
Nan Rosemary Ke
Alison Gopnik
CML
AI4CE
28
18
0
16 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
36
9
0
07 Jun 2022
Hierarchies of Reward Machines
Hierarchies of Reward Machines
Daniel Furelos-Blanco
Mark Law
Anders Jonsson
Krysia Broda
A. Russo
24
8
0
31 May 2022
Learning to Query Internet Text for Informing Reinforcement Learning
  Agents
Learning to Query Internet Text for Informing Reinforcement Learning Agents
Kolby Nottingham
Alekhya Pyla
Sameer Singh
Roy Fox
RALM
11
3
0
25 May 2022
A Generalist Agent
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
59
787
0
12 May 2022
Asking for Knowledge: Training RL Agents to Query External Knowledge
  Using Language
Asking for Knowledge: Training RL Agents to Query External Knowledge Using Language
Iou-Jen Liu
Xingdi Yuan
Marc-Alexandre Côté
Pierre-Yves Oudeyer
A. Schwing
RALM
19
12
0
12 May 2022
Learning Generalized Policies Without Supervision Using GNNs
Learning Generalized Policies Without Supervision Using GNNs
Simon Ståhlberg
Blai Bonet
Hector Geffner
OffRL
26
27
0
12 May 2022
Counterfactual Explanations for Natural Language Interfaces
Counterfactual Explanations for Natural Language Interfaces
George Tolkachev
Stephen Mell
Steve Zdancewic
Osbert Bastani
LRM
AAML
16
4
0
27 Apr 2022
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning
  for Robotics
Grounding Hindsight Instructions in Multi-Goal Reinforcement Learning for Robotics
Frank Röder
Manfred Eppe
S. Wermter
29
7
0
08 Apr 2022
Contrastive language and vision learning of general fashion concepts
Contrastive language and vision learning of general fashion concepts
P. Chia
Giuseppe Attanasio
Federico Bianchi
Silvia Terragni
A. Magalhães
Diogo Gonçalves
C. Greco
Jacopo Tagliabue
CLIP
21
42
0
08 Apr 2022
Teachable Reinforcement Learning via Advice Distillation
Teachable Reinforcement Learning via Advice Distillation
Olivia Watkins
Trevor Darrell
Pieter Abbeel
Jacob Andreas
Abhishek Gupta
OffRL
14
3
0
19 Mar 2022
Zipfian environments for Reinforcement Learning
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
13
15
0
15 Mar 2022
One-Shot Learning from a Demonstration with Hierarchical Latent Language
One-Shot Learning from a Demonstration with Hierarchical Latent Language
Nathaniel Weir
Xingdi Yuan
Marc-Alexandre Côté
Matthew J. Hausknecht
Romain Laroche
Ida Momennejad
H. V. Seijen
Benjamin Van Durme
BDL
21
6
0
09 Mar 2022
LISA: Learning Interpretable Skill Abstractions from Language
LISA: Learning Interpretable Skill Abstractions from Language
Divyansh Garg
Skanda Vaidyanath
Kuno Kim
Jiaming Song
Stefano Ermon
LM&Ro
OffRL
153
29
0
28 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and
  Computational Models
Learning Causal Overhypotheses through Exploration in Children and Computational Models
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
22
9
0
21 Feb 2022
Previous
1234
Next