ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11592
  4. Cited By
Playing hard exploration games by watching YouTube

Playing hard exploration games by watching YouTube

29 May 2018
Y. Aytar
Tobias Pfaff
David Budden
T. Paine
Ziyun Wang
Nando de Freitas
ArXivPDFHTML

Papers citing "Playing hard exploration games by watching YouTube"

50 / 60 papers shown
Title
Diffusion-Reward Adversarial Imitation Learning
Diffusion-Reward Adversarial Imitation Learning
Chun-Mao Lai
Hsiang-Chun Wang
Ping-Chun Hsieh
Yu-Chiang Frank Wang
Min-Hung Chen
Shao-Hua Sun
37
8
0
25 May 2024
CyberDemo: Augmenting Simulated Human Demonstration for Real-World
  Dexterous Manipulation
CyberDemo: Augmenting Simulated Human Demonstration for Real-World Dexterous Manipulation
Jun Wang
Yuzhe Qin
Kaiming Kuang
Yigit Korkmaz
Akhilan Gurumoorthy
Hao Su
Xiaolong Wang
43
20
0
22 Feb 2024
Enhancing Reinforcement Learning Agents with Local Guides
Enhancing Reinforcement Learning Agents with Local Guides
Paul Daoudi
Bogdan Robu
Christophe Prieur
Ludovic Dos Santos
M. Barlier
OnRL
31
3
0
21 Feb 2024
Learning to Act without Actions
Learning to Act without Actions
Dominik Schmidt
Minqi Jiang
OffRL
34
31
0
17 Dec 2023
Pre-training Contextualized World Models with In-the-wild Videos for
  Reinforcement Learning
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning
Jialong Wu
Haoyu Ma
Chao Deng
Mingsheng Long
OffRL
34
25
0
29 May 2023
Inverse Dynamics Pretraining Learns Good Representations for Multitask
  Imitation
Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
David Brandfonbrener
Ofir Nachum
Joan Bruna
AI4CE
26
21
0
26 May 2023
SIRL: Similarity-based Implicit Representation Learning
SIRL: Similarity-based Implicit Representation Learning
Andreea Bobu
Yi Liu
Rohin Shah
Daniel S. Brown
Anca Dragan
SSL
DRL
35
17
0
02 Jan 2023
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
Time-Efficient Reward Learning via Visually Assisted Cluster Ranking
David Zhang
Micah Carroll
Andreea Bobu
Anca Dragan
24
4
0
30 Nov 2022
Multi-Environment Pretraining Enables Transfer to Action Limited
  Datasets
Multi-Environment Pretraining Enables Transfer to Action Limited Datasets
David Venuto
Sherry Yang
Pieter Abbeel
Doina Precup
Igor Mordatch
Ofir Nachum
OffRL
25
5
0
23 Nov 2022
Learning Reward Functions for Robotic Manipulation by Observing Humans
Learning Reward Functions for Robotic Manipulation by Observing Humans
Minttu Alakuijala
Gabriel Dulac-Arnold
Julien Mairal
Jean Ponce
Cordelia Schmid
OffRL
39
27
0
16 Nov 2022
Abstract-to-Executable Trajectory Translation for One-Shot Task
  Generalization
Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization
Stone Tao
Xiaochen Li
Tongzhou Mu
Zhiao Huang
Yuzhe Qin
Hao Su
22
3
0
14 Oct 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
37
35
0
19 Sep 2022
Graph Inverse Reinforcement Learning from Diverse Videos
Graph Inverse Reinforcement Learning from Diverse Videos
Sateesh Kumar
Jonathan Zamora
Nicklas Hansen
Rishabh Jangir
Xiaolong Wang
35
53
0
28 Jul 2022
Aligning Robot Representations with Humans
Aligning Robot Representations with Humans
Andreea Bobu
Andi Peng
27
0
0
15 May 2022
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
26
324
0
02 May 2022
From One Hand to Multiple Hands: Imitation Learning for Dexterous
  Manipulation from Single-Camera Teleoperation
From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation
Yuzhe Qin
Hao Su
Xiaolong Wang
27
99
0
26 Apr 2022
Learning Generalizable Dexterous Manipulation from Human Grasp
  Affordance
Learning Generalizable Dexterous Manipulation from Human Grasp Affordance
Yueh-hua Wu
Jiashun Wang
Xiaolong Wang
29
55
0
05 Apr 2022
Learning List-wise Representation in Reinforcement Learning for Ads
  Allocation with Multiple Auxiliary Tasks
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks
Zehua Wang
Guogang Liao
Xiaowen Shi
Xiaoxu Wu
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
OffRL
19
4
0
02 Apr 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
18
118
0
25 Mar 2022
Shaping embodied agent behavior with activity-context priors from
  egocentric video
Shaping embodied agent behavior with activity-context priors from egocentric video
Tushar Nagarajan
Kristen Grauman
EgoV
LM&Ro
58
13
0
14 Oct 2021
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
Yuzhe Qin
Yueh-hua Wu
Shaowei Liu
Hanwen Jiang
Ruihan Yang
Yang Fu
Xiaolong Wang
134
190
0
12 Aug 2021
Self-Supervised Disentangled Representation Learning for Third-Person
  Imitation Learning
Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning
Jinghuan Shang
Michael S. Ryoo
SSL
24
24
0
02 Aug 2021
Recent Advances in Leveraging Human Guidance for Sequential
  Decision-Making Tasks
Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks
Ruohan Zhang
F. Torabi
Garrett A. Warnell
Peter Stone
81
28
0
13 Jul 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual
  Policies
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Linxi Fan
Guanzhi Wang
De-An Huang
Zhiding Yu
Li Fei-Fei
Yuke Zhu
Anima Anandkumar
OffRL
30
63
0
17 Jun 2021
XIRL: Cross-embodiment Inverse Reinforcement Learning
XIRL: Cross-embodiment Inverse Reinforcement Learning
Kevin Zakka
Andy Zeng
Peter R. Florence
Jonathan Tompson
Jeannette Bohg
Debidatta Dwibedi
SSL
43
118
0
07 Jun 2021
Off-Policy Imitation Learning from Observations
Off-Policy Imitation Learning from Observations
Zhuangdi Zhu
Kaixiang Lin
Bo Dai
Jiayu Zhou
OffRL
26
86
0
25 Feb 2021
Return-Based Contrastive Representation Learning for Reinforcement
  Learning
Return-Based Contrastive Representation Learning for Reinforcement Learning
Guoqing Liu
Chuheng Zhang
Li Zhao
Tao Qin
Jinhua Zhu
Jian Li
Nenghai Yu
Tie-Yan Liu
SSL
OffRL
19
47
0
22 Feb 2021
Applied Machine Learning for Games: A Graduate School Course
Applied Machine Learning for Games: A Graduate School Course
Yilei Zeng
Aayush Shah
Jameson Thai
M. Zyda
AI4CE
14
3
0
30 Nov 2020
Reinforcement Learning with Videos: Combining Offline Observations with
  Interaction
Reinforcement Learning with Videos: Combining Offline Observations with Interaction
Karl Schmeckpeper
Oleh Rybkin
Kostas Daniilidis
Sergey Levine
Chelsea Finn
OffRL
16
105
0
12 Nov 2020
Slot Contrastive Networks: A Contrastive Approach for Representing
  Objects
Slot Contrastive Networks: A Contrastive Approach for Representing Objects
Evan Racah
Sarath Chandar
OCL
DRL
21
14
0
18 Jul 2020
Learning Abstract Models for Strategic Exploration and Fast Reward
  Transfer
Learning Abstract Models for Strategic Exploration and Fast Reward Transfer
E. Liu
Ramtin Keramati
Sudarshan Seshadri
Kelvin Guu
Panupong Pasupat
Emma Brunskill
Percy Liang
OffRL
27
5
0
12 Jul 2020
FlowControl: Optical Flow Based Visual Servoing
FlowControl: Optical Flow Based Visual Servoing
Max Argus
Lukás Hermann
Jon Long
Thomas Brox
25
25
0
01 Jul 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement
  Learning
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
22
109
0
23 Jun 2020
Rinascimento: using event-value functions for playing Splendor
Rinascimento: using event-value functions for playing Splendor
Ivan Bravi
Simon Lucas
24
2
0
10 Jun 2020
Primal Wasserstein Imitation Learning
Primal Wasserstein Imitation Learning
Robert Dadashi
Léonard Hussenot
M. Geist
Olivier Pietquin
26
124
0
08 Jun 2020
Bootstrap Latent-Predictive Representations for Multitask Reinforcement
  Learning
Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Z. Guo
Bernardo Avila-Pires
Bilal Piot
Jean-Bastien Grill
Florent Altché
Rémi Munos
M. G. Azar
BDL
DRL
SSL
43
139
0
30 Apr 2020
First return, then explore
First return, then explore
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
47
350
0
27 Apr 2020
State-Only Imitation Learning for Dexterous Manipulation
State-Only Imitation Learning for Dexterous Manipulation
Ilija Radosavovic
Xiaolong Wang
Lerrel Pinto
Jitendra Malik
OffRL
19
121
0
07 Apr 2020
Agent57: Outperforming the Atari Human Benchmark
Agent57: Outperforming the Atari Human Benchmark
Adria Puigdomenech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Daniel Guo
Charles Blundell
OffRL
27
509
0
30 Mar 2020
Visual Task Progress Estimation with Appearance Invariant Embeddings for
  Robot Control and Planning
Visual Task Progress Estimation with Appearance Invariant Embeddings for Robot Control and Planning
Guilherme J. Maeda
Joni Väätäinen
Hironori Yoshida
22
2
0
16 Mar 2020
Safe Imitation Learning via Fast Bayesian Reward Inference from
  Preferences
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel S. Brown
Russell Coleman
R. Srinivasan
S. Niekum
BDL
30
101
0
21 Feb 2020
Universal Value Density Estimation for Imitation Learning and
  Goal-Conditioned Reinforcement Learning
Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
Yannick Schroecker
Charles Isbell
OffRL
36
12
0
15 Feb 2020
Learning Predictive Models From Observation and Interaction
Learning Predictive Models From Observation and Interaction
Karl Schmeckpeper
Annie Xie
Oleh Rybkin
Stephen Tian
Kostas Daniilidis
Sergey Levine
Chelsea Finn
DRL
33
60
0
30 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Attentive Multi-Task Deep Reinforcement Learning
Attentive Multi-Task Deep Reinforcement Learning
Timo Bram
Gino Brunner
Oliver Richter
Roger Wattenhofer
CLL
17
18
0
05 Jul 2019
Supervise Thyself: Examining Self-Supervised Representations in
  Interactive Environments
Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments
Evan Racah
C. Pal
SSL
27
2
0
27 Jun 2019
Unsupervised State Representation Learning in Atari
Unsupervised State Representation Learning in Atari
Ankesh Anand
Evan Racah
Sherjil Ozair
Yoshua Bengio
Marc-Alexandre Côté
R. Devon Hjelm
SSL
44
254
0
19 Jun 2019
Wasserstein Dependency Measure for Representation Learning
Wasserstein Dependency Measure for Representation Learning
Sherjil Ozair
Corey Lynch
Yoshua Bengio
Aaron van den Oord
Sergey Levine
P. Sermanet
SSL
DRL
30
116
0
28 Mar 2019
Go-Explore: a New Approach for Hard-Exploration Problems
Go-Explore: a New Approach for Hard-Exploration Problems
Adrien Ecoffet
Joost Huizinga
Joel Lehman
Kenneth O. Stanley
Jeff Clune
AI4TS
24
361
0
30 Jan 2019
Learning Montezuma's Revenge from a Single Demonstration
Learning Montezuma's Revenge from a Single Demonstration
Tim Salimans
Richard J. Chen
33
136
0
08 Dec 2018
12
Next