Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.10778
Cited By
v1
v2 (latest)
Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos
14 December 2024
Xin Liu
Yaran Chen
Haoran Li
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Sample-efficient Unsupervised Policy Cloning from Ensemble Self-supervised Labeled Videos"
50 / 53 papers shown
Title
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
77
2
0
20 May 2024
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Tao Huang
Guangqi Jiang
Yanjie Ze
Huazhe Xu
VGen
115
26
0
21 Dec 2023
Learning to Act without Actions
Dominik Schmidt
Minqi Jiang
OffRL
137
38
0
17 Dec 2023
Adversarial Imitation Learning from Visual Observations using Latent Information
Vittorio Giammarino
Tomas Landelius
I. Paschalidis
97
7
0
29 Sep 2023
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare
P. Kebria
Abbas Khosravi
Saeid Nahavandi
95
102
0
05 Sep 2023
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
Seohong Park
Dibya Ghosh
Benjamin Eysenbach
Sergey Levine
OffRL
128
61
0
22 Jul 2023
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
Jinxin Liu
Hongyin Zhang
Zifeng Zhuang
Yachen Kang
Donglin Wang
Bin Wang
OffRL
115
8
0
26 Jun 2023
TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Ruijie Zheng
Xiyao Wang
Yanchao Sun
Shuang Ma
Jieyu Zhao
Huazhe Xu
Hal Daumé
Furong Huang
90
40
0
22 Jun 2023
Learning from Visual Observation via Offline Pretrained State-to-Go Transformer
Bo-fan Zhou
Ke Li
Jiechuan Jiang
Zongqing Lu
ViT
OffRL
51
10
0
22 Jun 2023
Video Prediction Models as Rewards for Reinforcement Learning
Alejandro Escontrela
Ademi Adeniji
Wilson Yan
Ajay Jain
Xue Bin Peng
Ken Goldberg
Youngwoon Lee
Danijar Hafner
Pieter Abbeel
108
59
0
23 May 2023
Reinforcement Learning from Passive Data via Latent Intentions
Dibya Ghosh
Chethan Bhateja
Sergey Levine
OffRL
97
48
0
10 Apr 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
129
10
0
11 Feb 2023
Multi-View Masked World Models for Visual Robotic Manipulation
Younggyo Seo
Junsup Kim
Stephen James
Kimin Lee
Jinwoo Shin
Pieter Abbeel
VGen
105
60
0
05 Feb 2023
STEPS: Joint Self-supervised Nighttime Image Enhancement and Depth Estimation
Yupeng Zheng
Chengliang Zhong
Pengfei Li
Huan-ang Gao
Yuhang Zheng
...
Ling Wang
Hao Zhao
Guyue Zhou
Qichao Zhang
Dong Zhao
85
38
0
02 Feb 2023
Visual Imitation Learning with Patch Rewards
Minghuan Liu
Tairan He
Weinan Zhang
Shuicheng Yan
Zhongwen Xu
SSL
99
14
0
02 Feb 2023
Mastering Diverse Domains through World Models
Danijar Hafner
J. Pašukonis
Jimmy Ba
Timothy Lillicrap
94
617
0
10 Jan 2023
Masked World Models for Visual Control
Younggyo Seo
Danijar Hafner
Hao Liu
Fangchen Liu
Stephen James
Kimin Lee
Pieter Abbeel
OffRL
177
149
0
28 Jun 2022
Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning
Yang Yue
Bingyi Kang
Zhongwen Xu
Gao Huang
Shuicheng Yan
OffRL
99
13
0
25 Jun 2022
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Bowen Baker
Ilge Akkaya
Peter Zhokhov
Joost Huizinga
Jie Tang
Adrien Ecoffet
Brandon Houghton
Raul Sampedro
Jeff Clune
OffRL
159
304
0
23 Jun 2022
A Survey on Model-based Reinforcement Learning
Fan Luo
Tian Xu
Hang Lai
Xiong-Hui Chen
Weinan Zhang
Yang Yu
OffRL
LRM
121
110
0
19 Jun 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
107
123
0
25 Mar 2022
Masked Visual Pre-training for Motor Control
Tete Xiao
Ilija Radosavovic
Trevor Darrell
Jitendra Malik
SSL
119
250
0
11 Mar 2022
Human-Level Control through Directly-Trained Deep Spiking Q-Networks
Guisong Liu
Wenjie Deng
Xiurui Xie
Li Huang
Huajin Tang
OffRL
63
46
0
13 Dec 2021
URLB: Unsupervised Reinforcement Learning Benchmark
Michael Laskin
Denis Yarats
Hao Liu
Kimin Lee
Albert Zhan
Kevin Lu
Catherine Cang
Lerrel Pinto
Pieter Abbeel
SSL
OffRL
86
140
0
28 Oct 2021
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
Fei Deng
Ingook Jang
Sungjin Ahn
VLM
79
62
0
27 Oct 2021
Intrinsically Motivated Self-supervised Learning in Reinforcement Learning
Yue Zhao
Chenzhuang Du
Hang Zhao
Tiejun Li
SSL
60
5
0
26 Jun 2021
Reinforcement Learning with Prototypical Representations
Denis Yarats
Rob Fergus
A. Lazaric
Lerrel Pinto
SSL
83
226
0
22 Feb 2021
Mastering Atari with Discrete World Models
Danijar Hafner
Timothy Lillicrap
Mohammad Norouzi
Jimmy Ba
DRL
192
875
0
05 Oct 2020
Decoupling Representation Learning from Reinforcement Learning
Adam Stooke
Kimin Lee
Pieter Abbeel
Michael Laskin
SSL
DRL
380
346
0
14 Sep 2020
Deep Reinforcement Learning based Automatic Exploration for Navigation in Unknown Environment
Haoran Li
Qichao Zhang
Dongbin Zhao
60
198
0
23 Jul 2020
Data-Efficient Reinforcement Learning with Self-Predictive Representations
Max Schwarzer
Ankesh Anand
Rishab Goel
R. Devon Hjelm
Aaron Courville
Philip Bachman
118
321
0
12 Jul 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
145
1,097
0
08 Apr 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
434
18,988
0
13 Feb 2020
Deep Reinforcement Learning for Autonomous Driving: A Survey
B. R. Kiran
Ibrahim Sobh
V. Talpaert
Patrick Mannion
A. A. Sallab
S. Yogamani
P. Pérez
367
1,710
0
02 Feb 2020
Dream to Control: Learning Behaviors by Latent Imagination
Danijar Hafner
Timothy Lillicrap
Jimmy Ba
Mohammad Norouzi
VLM
190
1,378
0
03 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
129
557
0
03 Dec 2019
Momentum Contrast for Unsupervised Visual Representation Learning
Kaiming He
Haoqi Fan
Yuxin Wu
Saining Xie
Ross B. Girshick
SSL
275
12,174
0
13 Nov 2019
On Mutual Information Maximization for Representation Learning
Michael Tschannen
Josip Djolonga
Paul Kishan Rubenstein
Sylvain Gelly
Mario Lucic
SSL
196
502
0
31 Jul 2019
Model-Based Reinforcement Learning for Atari
Lukasz Kaiser
Mohammad Babaeizadeh
Piotr Milos
B. Osinski
R. Campbell
...
Sergey Levine
Afroz Mohiuddin
Ryan Sepassi
George Tucker
Henryk Michalewski
OffRL
207
870
0
01 Mar 2019
Generative Adversarial Imitation from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
GAN
98
245
0
17 Jul 2018
QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation
Dmitry Kalashnikov
A. Irpan
P. Pastor
Julian Ibarz
Alexander Herzog
...
Deirdre Quillen
E. Holly
Mrinal Kalakrishnan
Vincent Vanhoucke
Sergey Levine
184
1,473
0
27 Jun 2018
Imitating Latent Policies from Observation
Ashley D. Edwards
Himanshu Sahni
Yannick Schroecker
Charles Isbell
108
139
0
21 May 2018
Behavioral Cloning from Observation
F. Torabi
Garrett A. Warnell
Peter Stone
OffRL
145
732
0
04 May 2018
Zero-Shot Visual Imitation
Deepak Pathak
Parsa Mahmoudieh
Guanghao Luo
Pulkit Agrawal
Dian Chen
Yide Shentu
Evan Shelhamer
Jitendra Malik
Alexei A. Efros
Trevor Darrell
LM&Ro
126
301
0
23 Apr 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
276
1,609
0
05 Feb 2018
Neural Discrete Representation Learning
Aaron van den Oord
Oriol Vinyals
Koray Kavukcuoglu
BDL
SSL
OCL
259
5,093
0
02 Nov 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
698
19,363
0
20 Jul 2017
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
121
1,229
0
16 Nov 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
203
3,132
0
10 Jun 2016
Continuous control with deep reinforcement learning
Timothy Lillicrap
Jonathan J. Hunt
Alexander Pritzel
N. Heess
Tom Erez
Yuval Tassa
David Silver
Daan Wierstra
415
13,333
0
09 Sep 2015
1
2
Next