Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.11592
Cited By
Playing hard exploration games by watching YouTube
29 May 2018
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Playing hard exploration games by watching YouTube"
50 / 60 papers shown
Title
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
37
8
0
25 May 2024
CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation
Jun Wang
Yuzhe Qin
Kaiming Kuang
Yigit Korkmaz
Akhilan Gurumoorthy
Hao Su
Xiaolong Wang
43
20
0
22 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Learning to Act without Actions
Dominik Schmidt
Minqi Jiang
OffRL
34
31
0
17 Dec 2023
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
34
25
0
29 May 2023
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
David Brandfonbrener
Ofir Nachum
Joan Bruna
AI4CE
26
21
0
26 May 2023
SIRL: Similarity-based Implicit Representation Learning
Andreea Bobu
Yi Liu
Rohin Shah
Daniel S. Brown
Anca Dragan
SSL
DRL
35
17
0
02 Jan 2023
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
David Zhang
Micah Carroll
Andreea Bobu
Anca Dragan
24
4
0
30 Nov 2022
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
25
5
0
23 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
39
27
0
16 Nov 2022
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
Stone Tao
Xiaochen Li
Tongzhou Mu
Zhiao Huang
Yuzhe Qin
Hao Su
22
3
0
14 Oct 2022
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
37
35
0
19 Sep 2022
Graph Inverse Reinforcement Learning from Diverse Videos
Sateesh Kumar
Jonathan Zamora
Nicklas Hansen
Rishabh Jangir
Xiaolong Wang
35
53
0
28 Jul 2022
Aligning Robot Representations with Humans
Andreea Bobu
Andi Peng
27
0
0
15 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
26
324
0
02 May 2022
From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation
Yuzhe Qin
Hao Su
Xiaolong Wang
27
99
0
26 Apr 2022
Learning Generalizable Dexterous Manipulation from Human Grasp Affordance
Yueh-hua Wu
Jiashun Wang
Xiaolong Wang
29
55
0
05 Apr 2022
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
19
4
0
02 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
118
0
25 Mar 2022
Shaping embodied agent behavior with activity-context priors from egocentric video
Tushar Nagarajan
Kristen Grauman
EgoV
LM&Ro
55
13
0
14 Oct 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
134
190
0
12 Aug 2021
Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning
Jinghuan Shang
Michael S. Ryoo
SSL
24
24
0
02 Aug 2021
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
78
28
0
13 Jul 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
27
63
0
17 Jun 2021
XIRL: Cross-embodiment Inverse Reinforcement Learning
Kevin Zakka
Andy Zeng
Peter R. Florence
Jonathan Tompson
Jeannette Bohg
Debidatta Dwibedi
SSL
43
118
0
07 Jun 2021
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
26
86
0
25 Feb 2021
Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu
Chuheng Zhang
Li Zhao
Tao Qin
Jinhua Zhu
Jian Li
Nenghai Yu
Tie-Yan Liu
SSL
OffRL
16
47
0
22 Feb 2021
Applied Machine Learning for Games: A Graduate School Course
Yilei Zeng
Aayush Shah
Jameson Thai
M. Zyda
AI4CE
14
3
0
30 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
16
105
0
12 Nov 2020
Slot Contrastive Networks: A Contrastive Approach for Representing Objects
Evan Racah
Sarath Chandar
OCL
DRL
21
14
0
18 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
27
5
0
12 Jul 2020
FlowControl: Optical Flow Based Visual Servoing
Max Argus
Lukás Hermann
Jon Long
Thomas Brox
22
25
0
01 Jul 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
22
109
0
23 Jun 2020
Rinascimento: using event-value functions for playing Splendor
Ivan Bravi
Simon Lucas
22
2
0
10 Jun 2020
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
M. Geist
Olivier Pietquin
26
124
0
08 Jun 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
43
139
0
30 Apr 2020
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
350
0
27 Apr 2020
State-Only Imitation Learning for Dexterous Manipulation
Ilija Radosavovic
Xiaolong Wang
Lerrel Pinto
Jitendra Malik
OffRL
19
121
0
07 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
27
509
0
30 Mar 2020
Visual Task Progress Estimation with Appearance Invariant Embeddings for Robot Control and Planning
Guilherme J. Maeda
Joni Väätäinen
Hironori Yoshida
22
2
0
16 Mar 2020
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel S. Brown
Russell Coleman
R. Srinivasan
S. Niekum
BDL
30
101
0
21 Feb 2020
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
36
12
0
15 Feb 2020
Learning Predictive Models From Observation and Interaction
Karl Schmeckpeper
Annie Xie
Oleh Rybkin
Stephen Tian
Kostas Daniilidis
Sergey Levine
Chelsea Finn
DRL
33
60
0
30 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
17
18
0
05 Jul 2019
Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments
Evan Racah
C. Pal
SSL
27
2
0
27 Jun 2019
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
44
254
0
19 Jun 2019
Wasserstein Dependency Measure for Representation Learning
Sherjil Ozair
Corey Lynch
Yoshua Bengio
Aaron van den Oord
Sergey Levine
P. Sermanet
SSL
DRL
30
116
0
28 Mar 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
361
0
30 Jan 2019
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
31
136
0
08 Dec 2018
1
2
Next