Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.03763
Cited By
Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
7 December 2021
DeepMind Interactive Agents Team Josh Abramson
Josh Abramson
Arun Ahuja
Arthur Brussee
Federico Carnevale
Mary Cassin
Felix Fischer
Petko Georgiev
Alex Goldin
Mansi Gupta
Tim Harley
Felix Hill
Peter C. Humphreys
Alden Hung
Jessica Landon
Timothy Lillicrap
Hamza Merzic
Alistair Muldal
Adam Santoro
Guy Scully
Tamara von Glehn
Greg Wayne
Nathaniel Wong
Chen Yan
Rui Zhu
LM&Ro
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning"
36 / 36 papers shown
Title
Future Research Avenues for Artificial Intelligence in Digital Gaming: An Exploratory Report
Markus Dablander
77
0
0
18 Dec 2024
IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents
Shrestha Mohanty
Negar Arabzadeh
Andrea Tupini
Yuxuan Sun
Alexey Skrynnik
Artem Zholus
Marc-Alexandre Côté
Julia Kiseleva
35
0
0
12 Jul 2024
LEGENT: Open Platform for Embodied Agents
Zhili Cheng
Zhitong Wang
Jinyi Hu
Shengding Hu
An Liu
Yuge Tu
Pengkai Li
Lei Shi
Zhiyuan Liu
Maosong Sun
VLM
30
6
0
28 Apr 2024
Scaling Instructable Agents Across Many Simulated Worlds
Sima Team
Maria Abi Raad
Arun Ahuja
Catarina Barros
F. Besse
...
Daan Wierstra
Duncan Williams
Nathaniel Wong
Sarah York
Nick Young
LM&Ro
115
38
0
13 Mar 2024
Behavioural Cloning in VizDoom
Ryan Spick
Timothy Bradley
Ayush Raina
P. Amadori
Guy Moss
LM&Ro
24
0
0
08 Jan 2024
Vision-Language Models as a Source of Rewards
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
Harris Chan
Gheorghe Comanici
...
Yannick Schroecker
Stephen Spencer
Richie Steigerwald
Luyu Wang
Lei Zhang
VLM
LRM
42
26
0
14 Dec 2023
How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs
Haoqin Tu
Chenhang Cui
Zijun Wang
Yiyang Zhou
Bingchen Zhao
Junlin Han
Wangchunshu Zhou
Huaxiu Yao
Cihang Xie
MLLM
60
71
0
27 Nov 2023
Long-Horizon Dialogue Understanding for Role Identification in the Game of Avalon with Large Language Models
Simon Stepputtis
Joseph Campbell
Yaqi Xie
Zhengyang Qi
W. Zhang
Ruiyi Wang
Sanketh Rangreji
Michael Lewis
Katia P. Sycara
LLMAG
24
8
0
09 Nov 2023
Large Language Models as Generalizable Policies for Embodied Tasks
Andrew Szot
Max Schwarzer
Harsh Agrawal
Bogdan Mazoure
Walter A. Talbott
Katherine Metcalf
Natalie Mackraz
Devon Hjelm
Alexander Toshev
LM&Ro
31
58
0
26 Oct 2023
The SocialAI School: Insights from Developmental Psychology Towards Artificial Socio-Cultural Agents
Grgur Kovač
Rémy Portelas
Peter Ford Dominey
Pierre-Yves Oudeyer
13
19
0
15 Jul 2023
Towards Language-Based Modulation of Assistive Robots through Multimodal Models
Philipp Wicke
Lufti Kerem cSenel
Shengqiang Zhang
Luis F. C. Figueredo
Abdeldjallil Naceri
Sami Haddadin
Hinrich Schütze
20
2
0
26 Jun 2023
Large Sequence Models for Sequential Decision-Making: A Survey
Muning Wen
Runji Lin
Hanjing Wang
Yaodong Yang
Ying Wen
Luo Mai
Jun Wang
Haifeng Zhang
Weinan Zhang
LM&Ro
LRM
37
35
0
24 Jun 2023
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
Longtao Zheng
R. Wang
Xinrun Wang
Bo An
LLMAG
24
57
0
13 Jun 2023
Voyager: An Open-Ended Embodied Agent with Large Language Models
Guanzhi Wang
Yuqi Xie
Yunfan Jiang
Ajay Mandlekar
Chaowei Xiao
Yuke Zhu
Linxi Fan
Anima Anandkumar
LM&Ro
SyDa
46
755
0
25 May 2023
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
Hiroki Furuta
Kuang-Huei Lee
Ofir Nachum
Yutaka Matsuo
Aleksandra Faust
S. Gu
Izzeddin Gur
LM&Ro
36
92
0
19 May 2023
Contrastive Language, Action, and State Pre-training for Robot Learning
Krishan Rana
Andrew Melnik
Niko Sünderhauf
20
12
0
21 Apr 2023
Vision-Language Models as Success Detectors
Yuqing Du
Ksenia Konyushkova
Misha Denil
A. Raju
Jessica Landon
Felix Hill
Nando de Freitas
Serkan Cabi
MLLM
LRM
89
77
0
13 Mar 2023
Foundation Models for Decision Making: Problems, Methods, and Opportunities
Sherry Yang
Ofir Nachum
Yilun Du
Jason W. Wei
Pieter Abbeel
Dale Schuurmans
LM&Ro
OffRL
LRM
AI4CE
90
155
0
07 Mar 2023
Could a Large Language Model be Conscious?
D. Chalmers
LRM
AI4CE
ELM
16
84
0
04 Mar 2023
Distilling Internet-Scale Vision-Language Models into Embodied Agents
T. Sumers
Kenneth Marino
Arun Ahuja
Rob Fergus
Ishita Dasgupta
LM&Ro
28
24
0
29 Jan 2023
A Survey on Transformers in Reinforcement Learning
Wenzhe Li
Hao Luo
Zichuan Lin
Chongjie Zhang
Zongqing Lu
Deheng Ye
OffRL
MU
AI4CE
37
55
0
08 Jan 2023
User-Conditioned Neural Control Policies for Mobile Robotics
L. Bauersfeld
Elia Kaufmann
Davide Scaramuzza
23
6
0
22 Nov 2022
Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Josh Abramson
Arun Ahuja
Federico Carnevale
Petko Georgiev
Alex Goldin
...
Tamara von Glehn
Greg Wayne
Nathaniel Wong
Chen Yan
Rui Zhu
33
27
0
21 Nov 2022
Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration
Mesut Yang
Micah Carroll
Anca Dragan
32
13
0
03 Nov 2022
Instruction-Following Agents with Multimodal Transformer
Hao Liu
Lisa Lee
Kimin Lee
Pieter Abbeel
LM&Ro
32
10
0
24 Oct 2022
Interactive Language: Talking to Robots in Real Time
Corey Lynch
Ayzaan Wahid
Jonathan Tompson
Tianli Ding
James Betker
Robert Baruch
Travis Armstrong
Peter R. Florence
LM&Ro
35
214
0
12 Oct 2022
Grounding Language with Visual Affordances over Unstructured Data
Oier Mees
Jessica Borja-Diaz
Wolfram Burgard
LM&Ro
121
108
0
04 Oct 2022
Instruction-driven history-aware policies for robotic manipulations
Pierre-Louis Guhur
Shizhe Chen
Ricardo Garcia Pinel
Makarand Tapaswi
Ivan Laptev
Cordelia Schmid
LM&Ro
110
102
0
11 Sep 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
34
285
0
23 Jun 2022
Intra-agent speech permits zero-shot task acquisition
Chen Yan
Federico Carnevale
Petko Georgiev
Adam Santoro
Aurelia Guy
Alistair Muldal
Chia-Chun Hung
Josh Abramson
Timothy Lillicrap
Greg Wayne
LM&Ro
36
9
0
07 Jun 2022
Evaluating Multimodal Interactive Agents
Josh Abramson
Arun Ahuja
Federico Carnevale
Petko Georgiev
Alex Goldin
...
Adam Santoro
Tamara von Glehn
Greg Wayne
Nathaniel Wong
Chen Yan
23
3
0
26 May 2022
What Matters in Language Conditioned Robotic Imitation Learning over Unstructured Data
Oier Mees
Lukás Hermann
Wolfram Burgard
LM&Ro
30
149
0
13 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
33
67
0
08 Apr 2022
Mind the gap: Challenges of deep learning approaches to Theory of Mind
Jaan Aru
Aqeel Labash
Oriol Corcoll
Raul Vicente
22
26
0
30 Mar 2022
A data-driven approach for learning to control computers
Peter C. Humphreys
David Raposo
Tobias Pohlen
Gregory Thornton
Rachita Chhaparia
...
Josh Abramson
Petko Georgiev
Alex Goldin
Adam Santoro
Timothy Lillicrap
25
97
0
16 Feb 2022
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
246
4,489
0
23 Jan 2020
1