v1v2 (latest)

Provable RL with Exogenous Distractors via Multistep Inverse Dynamics

17 October 2021

Papers citing "Provable RL with Exogenous Distractors via Multistep Inverse Dynamics"

26 / 26 papers shown

Title
Object-Centric Latent Action Learning Albina Klepach Alexander Nikulin Ilya Zisman Denis Tarasov Alexander Derevyagin Andrei Polubarov Nikita Lyubaykin Vladislav Kurenkov 115 0 0 13 Feb 2025
Planning from Pixels using Inverse Dynamics Models Keiran Paster Sheila A. McIlraith Jimmy Ba BDL 51 41 0 04 Dec 2020
Reinforcement Learning with Trajectory Feedback Yonathan Efroni Nadav Merlis Shie Mannor 75 45 0 13 Aug 2020
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs Alekh Agarwal Sham Kakade A. Krishnamurthy Wen Sun OffRL 170 227 0 18 Jun 2020
Learning Invariant Representations for Reinforcement Learning without Reconstruction Amy Zhang R. McAllister Roberto Calandra Y. Gal Sergey Levine OOD SSL 116 479 0 18 Jun 2020
CURL: Contrastive Unsupervised Representations for Reinforcement Learning A. Srinivas Michael Laskin Pieter Abbeel SSL DRL OffRL 100 1,092 0 08 Apr 2020
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning Dipendra Kumar Misra Mikael Henaff A. Krishnamurthy John Langford 79 151 0 13 Nov 2019
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles Aditya Modi Nan Jiang Ambuj Tewari Satinder Singh 70 132 0 23 Oct 2019
Adaptive Trust Region Policy Optimization: Global Convergence and Faster Rates for Regularized MDPs Lior Shani Yonathan Efroni Shie Mannor 57 176 0 06 Sep 2019
On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift Alekh Agarwal Sham Kakade Jason D. Lee G. Mahajan 72 321 0 01 Aug 2019
DeepMDP: Learning Continuous Latent Space Models for Representation Learning Carles Gelada Saurabh Kumar Jacob Buckman Ofir Nachum Marc G. Bellemare BDL 88 288 0 06 Jun 2019
Online Convex Optimization in Adversarial Markov Decision Processes Aviv A. Rosenberg Yishay Mansour 54 138 0 19 May 2019
Information-Theoretic Considerations in Batch Reinforcement Learning Jinglin Chen Nan Jiang OOD OffRL 161 378 0 01 May 2019
Provably efficient RL with Rich Observations via Latent State Decoding S. Du A. Krishnamurthy Nan Jiang Alekh Agarwal Miroslav Dudík John Langford OffRL 74 230 0 25 Jan 2019
Learning Latent Dynamics for Planning from Pixels Danijar Hafner Timothy Lillicrap Ian S. Fischer Ruben Villegas David R Ha Honglak Lee James Davidson BDL 92 1,448 0 12 Nov 2018
Exploration by Random Network Distillation Yuri Burda Harrison Edwards Amos Storkey Oleg Klimov 161 1,345 0 30 Oct 2018
Large-Scale Study of Curiosity-Driven Learning Yuri Burda Harrison Edwards Deepak Pathak Amos Storkey Trevor Darrell Alexei A. Efros LRM 72 707 0 13 Aug 2018
Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning Thomas G. Dietterich George Trimponias Zhitang Chen BDL OffRL 57 32 0 05 Jun 2018
On Oracle-Efficient PAC RL with Rich Observations Christoph Dann Nan Jiang A. Krishnamurthy Alekh Agarwal John Langford Robert Schapire 49 98 0 01 Mar 2018
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 541 19,296 0 20 Jul 2017
Curiosity-driven Exploration by Self-supervised Prediction Deepak Pathak Pulkit Agrawal Alexei A. Efros Trevor Darrell LRM SSL 125 2,451 0 15 May 2017
Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning Christoph Dann Tor Lattimore Emma Brunskill 83 311 0 22 Mar 2017
Variational Intrinsic Control Karol Gregor Danilo Jimenez Rezende Daan Wierstra DRL OffRL 88 429 0 22 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning Haoran Tang Rein Houthooft Davis Foote Adam Stooke Xi Chen Yan Duan John Schulman F. Turck Pieter Abbeel OffRL 111 775 0 15 Nov 2016
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable Nan Jiang A. Krishnamurthy Alekh Agarwal John Langford Robert Schapire 156 421 0 29 Oct 2016
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits Alekh Agarwal Daniel J. Hsu Satyen Kale John Langford Lihong Li Robert Schapire OffRL 410 510 0 04 Feb 2014