Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.08847
Cited By
v1
v2 (latest)
Provable RL with Exogenous Distractors via Multistep Inverse Dynamics
17 October 2021
Yonathan Efroni
Dipendra Kumar Misra
A. Krishnamurthy
Alekh Agarwal
John Langford
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Provable RL with Exogenous Distractors via Multistep Inverse Dynamics"
26 / 26 papers shown
Title
Object-Centric Latent Action Learning
Albina Klepach
Alexander Nikulin
Ilya Zisman
Denis Tarasov
Alexander Derevyagin
Andrei Polubarov
Nikita Lyubaykin
Vladislav Kurenkov
115
0
0
13 Feb 2025
Planning from Pixels using Inverse Dynamics Models
Keiran Paster
Sheila A. McIlraith
Jimmy Ba
BDL
51
41
0
04 Dec 2020
Reinforcement Learning with Trajectory Feedback
Yonathan Efroni
Nadav Merlis
Shie Mannor
75
45
0
13 Aug 2020
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
Alekh Agarwal
Sham Kakade
A. Krishnamurthy
Wen Sun
OffRL
170
227
0
18 Jun 2020
Learning Invariant Representations for Reinforcement Learning without Reconstruction
Amy Zhang
R. McAllister
Roberto Calandra
Y. Gal
Sergey Levine
OOD
SSL
116
479
0
18 Jun 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning
A. Srinivas
Michael Laskin
Pieter Abbeel
SSL
DRL
OffRL
100
1,092
0
08 Apr 2020
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
79
151
0
13 Nov 2019
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles
Aditya Modi
Nan Jiang
Ambuj Tewari
Satinder Singh
70
132
0
23 Oct 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs
Lior Shani
Yonathan Efroni
Shie Mannor
57
176
0
06 Sep 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift
Alekh Agarwal
Sham Kakade
Jason D. Lee
G. Mahajan
72
321
0
01 Aug 2019
DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Carles Gelada
Saurabh Kumar
Jacob Buckman
Ofir Nachum
Marc G. Bellemare
BDL
88
288
0
06 Jun 2019
Online Convex Optimization in Adversarial Markov Decision Processes
Aviv A. Rosenberg
Yishay Mansour
54
138
0
19 May 2019
Information-Theoretic Considerations in Batch Reinforcement Learning
Jinglin Chen
Nan Jiang
OOD
OffRL
161
378
0
01 May 2019
Provably efficient RL with Rich Observations via Latent State Decoding
S. Du
A. Krishnamurthy
Nan Jiang
Alekh Agarwal
Miroslav Dudík
John Langford
OffRL
74
230
0
25 Jan 2019
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
92
1,448
0
12 Nov 2018
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
161
1,345
0
30 Oct 2018
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
72
707
0
13 Aug 2018
Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning
Thomas G. Dietterich
George Trimponias
Zhitang Chen
BDL
OffRL
57
32
0
05 Jun 2018
On Oracle-Efficient PAC RL with Rich Observations
Christoph Dann
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
Robert Schapire
49
98
0
01 Mar 2018
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
541
19,296
0
20 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
125
2,451
0
15 May 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Christoph Dann
Tor Lattimore
Emma Brunskill
83
311
0
22 Mar 2017
Variational Intrinsic Control
Karol Gregor
Danilo Jimenez Rezende
Daan Wierstra
DRL
OffRL
88
429
0
22 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
111
775
0
15 Nov 2016
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable
Nan Jiang
A. Krishnamurthy
Alekh Agarwal
John Langford
Robert Schapire
156
421
0
29 Oct 2016
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Alekh Agarwal
Daniel J. Hsu
Satyen Kale
John Langford
Lihong Li
Robert Schapire
OffRL
410
510
0
04 Feb 2014
1