Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.06899
Cited By
Semi-supervised reward learning for offline reinforcement learning
12 December 2020
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
SSL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semi-supervised reward learning for offline reinforcement learning"
25 / 25 papers shown
Title
Offline Learning from Demonstrations and Unlabeled Experience
Konrad Zolna
Alexander Novikov
Ksenia Konyushkova
Çağlar Gülçehre
Ziyun Wang
Y. Aytar
Misha Denil
Nando de Freitas
Scott E. Reed
SSL
OffRL
57
67
0
27 Nov 2020
Hyperparameter Selection for Offline Reinforcement Learning
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
74
147
0
17 Jul 2020
Critic Regularized Regression
Ziyun Wang
Alexander Novikov
Konrad Zolna
Jost Tobias Springenberg
Scott E. Reed
...
Noah Y. Siegel
J. Merel
Çağlar Gülçehre
N. Heess
Nando de Freitas
OffRL
104
320
0
26 Jun 2020
Acme: A Research Framework for Distributed Reinforcement Learning
Matthew W. Hoffman
Bobak Shahriari
John Aslanides
Gabriel Barth-Maron
Nikola Momchev
...
Srivatsan Srinivasan
A. Cowie
Ziyun Wang
Bilal Piot
Nando de Freitas
84
225
0
01 Jun 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
448
1,994
0
04 May 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
24
283
0
19 Feb 2020
Self-training with Noisy Student improves ImageNet classification
Qizhe Xie
Minh-Thang Luong
Eduard H. Hovy
Quoc V. Le
NoLa
152
2,375
0
11 Nov 2019
Positive-Unlabeled Reward Learning
Danfei Xu
Misha Denil
39
38
0
01 Nov 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Ziyi Wang
Che Wang
Yanqiu Wu
George Andriopoulos
OffRL
44
121
0
27 Oct 2019
Task-Relevant Adversarial Imitation Learning
Konrad Zolna
Scott E. Reed
Alexander Novikov
Sergio Gomez Colmenarejo
David Budden
Serkan Cabi
Misha Denil
Nando de Freitas
Ziyun Wang
GAN
101
61
0
02 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
100
548
0
01 Oct 2019
End-to-End Robotic Reinforcement Learning without Reward Engineering
Avi Singh
Larry Yang
Kristian Hartikainen
Chelsea Finn
Sergey Levine
SSL
OffRL
51
267
0
16 Apr 2019
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto
David Meger
Doina Precup
OffRL
BDL
108
1,586
0
07 Dec 2018
An Algorithmic Perspective on Imitation Learning
Takayuki Osa
Joni Pajarinen
Gerhard Neumann
J. Andrew Bagnell
Pieter Abbeel
Jan Peters
67
833
0
16 Nov 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
Yuke Zhu
Ziyun Wang
J. Merel
Andrei A. Rusu
Tom Erez
...
S. Tunyasuvunakool
János Kramár
R. Hadsell
Nando de Freitas
N. Heess
SSL
52
317
0
26 Feb 2018
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
87
746
0
30 Oct 2017
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
61
1,497
0
21 Jul 2017
Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration
Rouhollah Rahmatizadeh
P. Abolghasemi
Ladislau Bölöni
Sergey Levine
60
255
0
10 Jul 2017
Learning human behaviors from motion capture by adversarial imitation
J. Merel
Yuval Tassa
TB Dhruva
S. Srinivasan
Jay Lemmon
Ziyun Wang
Greg Wayne
N. Heess
GAN
47
202
0
07 Jul 2017
Positive-Unlabeled Learning with Non-Negative Risk Estimator
Ryuichi Kiryo
Gang Niu
M. C. D. Plessis
Masashi Sugiyama
55
471
0
02 Mar 2017
Unsupervised Perceptual Rewards for Imitation Learning
P. Sermanet
Kelvin Xu
Sergey Levine
SSL
53
158
0
20 Dec 2016
Perceptual Reward Functions
Ashley D. Edwards
Charles Isbell
A. Takanishi
XAI
42
17
0
12 Aug 2016
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
95
3,084
0
10 Jun 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
71
946
0
01 Mar 2016
Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning
R. G. Cinbis
Jakob Verbeek
Cordelia Schmid
WSOD
SSL
48
430
0
03 Mar 2015
1