ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.00539
  4. Cited By
Dungeons and Data: A Large-Scale NetHack Dataset

Dungeons and Data: A Large-Scale NetHack Dataset

1 November 2022
Eric Hambro
Roberta Raileanu
Dan Rothermel
Vegard Mella
Tim Rocktaschel
Heinrich Küttler
Naila Murray
    OffRL
ArXivPDFHTML

Papers citing "Dungeons and Data: A Large-Scale NetHack Dataset"

44 / 44 papers shown
Title
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri
Bartłomiej Cupiał
Samuel Coward
Ulyana Piterbarg
Maciej Wolczyk
...
Lerrel Pinto
Rob Fergus
Jakob Foerster
Jack Parker-Holder
Tim Rocktaschel
LLMAG
LRM
181
19
0
20 Nov 2024
You Can't Count on Luck: Why Decision Transformers and RvS Fail in
  Stochastic Environments
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
OffRL
216
28
0
31 May 2022
Insights From the NeurIPS 2021 NetHack Challenge
Insights From the NeurIPS 2021 NetHack Challenge
Eric Hambro
Sharada Mohanty
Dmitrii Babaev
Mi-Ra Byeon
Dipam Chakraborty
...
Dan Rothermel
Mikayel Samvelyan
Dmitry Sorokin
Maciej Sypetkowski
Michal Sypetkowski
37
19
0
22 Mar 2022
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open
  Problems
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems
Rafael Figueiredo Prudencio
Marcos R. O. A. Máximo
Esther Luna Colombini
OffRL
70
236
0
02 Mar 2022
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng
Tengyang Xie
Nan Jiang
Alekh Agarwal
OffRL
60
129
0
05 Feb 2022
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement
  Learning
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Sabela Ramos
Sertan Girgin
Léonard Hussenot
Damien Vincent
Hanna Yakubovich
...
Piotr Stańczyk
Raphaël Marinier
Jeremiah Harmsen
Olivier Pietquin
Nikola Momchev
OffRL
58
24
0
04 Nov 2021
Offline Reinforcement Learning with Implicit Q-Learning
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
276
899
0
12 Oct 2021
BRAC+: Improved Behavior Regularized Actor Critic for Offline
  Reinforcement Learning
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning
Chi Zhang
S. Kuppannagari
Viktor Prasanna
OffRL
68
16
0
02 Oct 2021
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning
  Research
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Mikayel Samvelyan
Robert Kirk
Vitaly Kurin
Jack Parker-Holder
Minqi Jiang
Eric Hambro
Fabio Petroni
Heinrich Küttler
Edward Grefenstette
Tim Rocktaschel
OffRL
277
90
0
27 Sep 2021
What Matters in Learning from Offline Human Demonstrations for Robot
  Manipulation
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
Ajay Mandlekar
Danfei Xu
J. Wong
Soroush Nasiriany
Chen Wang
Rohun Kulkarni
Li Fei-Fei
Silvio Savarese
Yuke Zhu
Roberto Martín-Martín
OffRL
275
502
0
06 Aug 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
184
82
0
01 Feb 2021
The MAGICAL Benchmark for Robust Imitation
The MAGICAL Benchmark for Robust Imitation
Sam Toyer
Rohin Shah
Andrew Critch
Stuart J. Russell
46
36
0
01 Nov 2020
Learning Quadrupedal Locomotion over Challenging Terrain
Learning Quadrupedal Locomotion over Challenging Terrain
Joonho Lee
Jemin Hwangbo
Lorenz Wellhausen
V. Koltun
Marco Hutter
113
1,166
0
21 Oct 2020
The NetHack Learning Environment
The NetHack Learning Environment
Heinrich Küttler
Nantas Nardelli
Alexander H. Miller
Roberta Raileanu
Marco Selvatici
Edward Grefenstette
Tim Rocktaschel
61
181
0
24 Jun 2020
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with
  Asynchronous Reinforcement Learning
Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Aleksei Petrenko
Zhehui Huang
T. Kumar
Gaurav Sukhatme
V. Koltun
47
104
0
21 Jun 2020
Conservative Q-Learning for Offline Reinforcement Learning
Conservative Q-Learning for Offline Reinforcement Learning
Aviral Kumar
Aurick Zhou
George Tucker
Sergey Levine
OffRL
OnRL
131
1,806
0
08 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
512
2,018
0
04 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
210
1,359
0
15 Apr 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline
  Reinforcement Learning
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
56
283
0
19 Feb 2020
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
140
1,819
0
13 Dec 2019
Behavior Regularized Offline Reinforcement Learning
Behavior Regularized Offline Reinforcement Learning
Yifan Wu
George Tucker
Ofir Nachum
OffRL
85
683
0
26 Nov 2019
RoboNet: Large-Scale Multi-Robot Learning
RoboNet: Large-Scale Multi-Robot Learning
Sudeep Dasari
F. Ebert
Stephen Tian
Suraj Nair
Bernadette Bucher
Karl Schmeckpeper
Siddharth Singh
Sergey Levine
Chelsea Finn
LM&Ro
70
302
0
24 Oct 2019
Solving Rubik's Cube with a Robot Hand
Solving Rubik's Cube with a Robot Hand
OpenAI
Ilge Akkaya
Marcin Andrychowicz
Maciek Chociej
Ma-teusz Litwin
...
Peter Welinder
Lilian Weng
Qiming Yuan
Wojciech Zaremba
Lei Zhang
ODL
111
1,225
0
16 Oct 2019
Scaling data-driven robotics with reward sketching and batch
  reinforcement learning
Scaling data-driven robotics with reward sketching and batch reinforcement learning
Serkan Cabi
Sergio Gomez Colmenarejo
Alexander Novikov
Ksenia Konyushkova
Scott E. Reed
...
David Barker
Jonathan Scholz
Misha Denil
Nando de Freitas
Ziyun Wang
OffRL
65
29
0
26 Sep 2019
MineRL: A Large-Scale Dataset of Minecraft Demonstrations
MineRL: A Large-Scale Dataset of Minecraft Demonstrations
William H. Guss
Brandon Houghton
Nicholay Topin
Phillip Wang
Cayden R. Codel
Manuela Veloso
Ruslan Salakhutdinov
OffRL
55
224
0
29 Jul 2019
An Optimistic Perspective on Offline Reinforcement Learning
An Optimistic Perspective on Offline Reinforcement Learning
Rishabh Agarwal
Dale Schuurmans
Mohammad Norouzi
OffRL
OnRL
60
69
0
10 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human
  Preferences in Dialog
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
99
340
0
30 Jun 2019
Simitate: A Hybrid Imitation Learning Benchmark
Simitate: A Hybrid Imitation Learning Benchmark
Raphael Memmesheimer
Ivanna Mykhalchyshyna
Viktor Seib
Dietrich Paulus
48
20
0
15 May 2019
Off-Policy Deep Reinforcement Learning without Exploration
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
192
1,604
0
07 Dec 2018
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data
  for Imitation
Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data for Imitation
Pratyusha Sharma
Lekha Mohan
Lerrel Pinto
Abhinav Gupta
44
121
0
16 Oct 2018
Imitating Latent Policies from Observation
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
65
137
0
21 May 2018
Behavioral Cloning from Observation
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
107
720
0
04 May 2018
Kickstarting Deep Reinforcement Learning
Kickstarting Deep Reinforcement Learning
Simon Schmitt
Jonathan J. Hudson
Augustin Žídek
Simon Osindero
Carl Doersch
...
Joel Z Leibo
Heinrich Küttler
Andrew Zisserman
Karen Simonyan
S. M. Ali Eslami
OnRL
52
132
0
10 Mar 2018
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning
  and Demonstrations
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
Aravind Rajeswaran
Vikash Kumar
Abhishek Gupta
Giulia Vezzani
John Schulman
E. Todorov
Sergey Levine
126
1,093
0
28 Sep 2017
StarCraft II: A New Challenge for Reinforcement Learning
StarCraft II: A New Challenge for Reinforcement Learning
Oriol Vinyals
T. Ewalds
Sergey Bartunov
Petko Georgiev
A. Vezhnevets
...
Anthony Brunasso
David Lawrence
Anders Ekermo
J. Repp
Rodney Tsing
76
872
0
16 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
444
18,931
0
20 Jul 2017
Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
N. Heess
TB Dhruva
S. Sriram
Jay Lemmon
J. Merel
...
Tom Erez
Ziyun Wang
S. M. Ali Eslami
Martin Riedmiller
David Silver
197
936
0
07 Jul 2017
Third-Person Imitation Learning
Third-Person Imitation Learning
Bradly C. Stadie
Pieter Abbeel
Ilya Sutskever
58
234
0
06 Mar 2017
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Sim-to-Real Robot Learning from Pixels with Progressive Nets
Andrei A. Rusu
Matej Vecerík
Thomas Rothörl
N. Heess
Razvan Pascanu
R. Hadsell
72
532
0
13 Oct 2016
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning
  and Large-Scale Data Collection
Learning Hand-Eye Coordination for Robotic Grasping with Deep Learning and Large-Scale Data Collection
Sergey Levine
P. Pastor
A. Krizhevsky
Deirdre Quillen
160
2,071
0
07 Mar 2016
Playing Atari with Deep Reinforcement Learning
Playing Atari with Deep Reinforcement Learning
Volodymyr Mnih
Koray Kavukcuoglu
David Silver
Alex Graves
Ioannis Antonoglou
Daan Wierstra
Martin Riedmiller
114
12,201
0
19 Dec 2013
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
194
3,211
0
02 Nov 2010
A Contextual-Bandit Approach to Personalized News Article Recommendation
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
375
2,944
0
28 Feb 2010
Learning from Logged Implicit Exploration Data
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
159
255
0
27 Feb 2010
1