ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.11089
  4. Cited By
Rewriting History with Inverse RL: Hindsight Inference for Policy
  Improvement

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

25 February 2020
Benjamin Eysenbach
Xinyang Geng
Sergey Levine
Ruslan Salakhutdinov
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement"

38 / 38 papers shown
Title
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
135
1
0
22 Aug 2024
Generalized Hindsight for Reinforcement Learning
Generalized Hindsight for Reinforcement Learning
Alexander C. Li
Lerrel Pinto
Pieter Abbeel
61
70
0
26 Feb 2020
Gradient Surgery for Multi-Task Learning
Gradient Surgery for Multi-Task Learning
Tianhe Yu
Saurabh Kumar
Abhishek Gupta
Sergey Levine
Karol Hausman
Chelsea Finn
197
1,230
0
19 Jan 2020
Learning to Reach Goals via Iterated Supervised Learning
Learning to Reach Goals via Iterated Supervised Learning
Dibya Ghosh
Abhishek Gupta
Ashwin Reddy
Justin Fu
Coline Devin
Benjamin Eysenbach
Sergey Levine
109
35
0
12 Dec 2019
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Scalability in Perception for Autonomous Driving: Waymo Open Dataset
Pei Sun
Henrik Kretzschmar
Xerxes Dotiwalla
Aurelien Chouard
Vijaysai Patnaik
...
Shuyang Cheng
Yu Zhang
Jonathon Shlens
Zhifeng Chen
Dragomir Anguelov
152
2,910
0
10 Dec 2019
Argoverse: 3D Tracking and Forecasting with Rich Maps
Argoverse: 3D Tracking and Forecasting with Rich Maps
Ming-Fang Chang
John Lambert
Patsorn Sangkloy
Jagjeet Singh
Sławomir Bąk
...
De Wang
Peter Carr
Simon Lucey
Deva Ramanan
James Hays
3DPC
153
1,298
0
06 Nov 2019
RoboNet: Large-Scale Multi-Robot Learning
RoboNet: Large-Scale Multi-Robot Learning
Sudeep Dasari
F. Ebert
Stephen Tian
Suraj Nair
Bernadette Bucher
Karl Schmeckpeper
Siddharth Singh
Sergey Levine
Chelsea Finn
LM&Ro
97
304
0
24 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta
  Reinforcement Learning
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
266
1,182
0
24 Oct 2019
Advantage-Weighted Regression: Simple and Scalable Off-Policy
  Reinforcement Learning
Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
Xue Bin Peng
Aviral Kumar
Grace Zhang
Sergey Levine
OffRL
157
570
0
01 Oct 2019
Search on the Replay Buffer: Bridging Planning and Reinforcement
  Learning
Search on the Replay Buffer: Bridging Planning and Reinforcement Learning
Benjamin Eysenbach
Ruslan Salakhutdinov
Sergey Levine
OffRL
86
293
0
12 Jun 2019
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
Rui Zhao
Xudong Sun
Volker Tresp
65
83
0
21 May 2019
nuScenes: A multimodal dataset for autonomous driving
nuScenes: A multimodal dataset for autonomous driving
Holger Caesar
Varun Bankiti
Alex H. Lang
Sourabh Vora
Venice Erin Liong
Qiang Xu
Anush Krishnan
Yuxin Pan
G. Baldan
Oscar Beijbom
3DPC
306
5,790
0
26 Mar 2019
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic
  Context Variables
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables
Kate Rakelly
Aurick Zhou
Deirdre Quillen
Chelsea Finn
Sergey Levine
OffRL
88
661
0
19 Mar 2019
Learning Latent Plans from Play
Learning Latent Plans from Play
Corey Lynch
Mohi Khansari
Ted Xiao
Vikash Kumar
Jonathan Tompson
Sergey Levine
P. Sermanet
SSLLM&Ro
111
406
0
05 Mar 2019
A Commute in Data: The comma2k19 Dataset
A Commute in Data: The comma2k19 Dataset
H. Schafer
Eder Santana
A. Haden
R. Biasini
3DV
66
74
0
14 Dec 2018
Soft Actor-Critic Algorithms and Applications
Soft Actor-Critic Algorithms and Applications
Tuomas Haarnoja
Aurick Zhou
Kristian Hartikainen
George Tucker
Sehoon Ha
...
Vikash Kumar
Henry Zhu
Abhishek Gupta
Pieter Abbeel
Sergey Levine
152
2,453
0
13 Dec 2018
Multi-task Deep Reinforcement Learning with PopArt
Multi-task Deep Reinforcement Learning with PopArt
Matteo Hessel
Hubert Soyer
L. Espeholt
Wojciech M. Czarnecki
Simon Schmitt
H. V. Hasselt
144
320
0
12 Sep 2018
Self-Imitation Learning
Self-Imitation Learning
Junhyuk Oh
Yijie Guo
Satinder Singh
Honglak Lee
SSL
76
251
0
14 Jun 2018
Maximum a Posteriori Policy Optimisation
Maximum a Posteriori Policy Optimisation
A. Abdolmaleki
Jost Tobias Springenberg
Yuval Tassa
Rémi Munos
N. Heess
Martin Riedmiller
85
478
0
14 Jun 2018
Reinforcement Learning and Control as Probabilistic Inference: Tutorial
  and Review
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
Sergey Levine
AI4CEBDL
99
674
0
02 May 2018
Semi-parametric Topological Memory for Navigation
Semi-parametric Topological Memory for Navigation
Nikolay Savinov
Alexey Dosovitskiy
V. Koltun
82
383
0
01 Mar 2018
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Learning by Playing - Solving Sparse Reward Tasks from Scratch
Martin Riedmiller
Roland Hafner
Thomas Lampe
Michael Neunert
Jonas Degrave
T. Wiele
Volodymyr Mnih
N. Heess
Jost Tobias Springenberg
95
449
0
28 Feb 2018
Investigating Human Priors for Playing Video Games
Investigating Human Priors for Playing Video Games
Rachit Dubey
Pulkit Agrawal
Deepak Pathak
Thomas Griffiths
Alexei A. Efros
OffRL
120
146
0
28 Feb 2018
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Temporal Difference Models: Model-Free Deep RL for Model-Based Control
Vitchyr H. Pong
S. Gu
Murtaza Dalal
Sergey Levine
OffRL
116
240
0
25 Feb 2018
IMPALA: Scalable Distributed Deep-RL with Importance Weighted
  Actor-Learner Architectures
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
L. Espeholt
Hubert Soyer
Rémi Munos
Karen Simonyan
Volodymyr Mnih
...
Vlad Firoiu
Tim Harley
Iain Dunning
Shane Legg
Koray Kavukcuoglu
249
1,609
0
05 Feb 2018
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement
  Learning with a Stochastic Actor
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja
Aurick Zhou
Pieter Abbeel
Sergey Levine
319
8,432
0
04 Jan 2018
DeepMind Control Suite
DeepMind Control Suite
Yuval Tassa
Yotam Doron
Alistair Muldal
Tom Erez
Yazhe Li
...
A. Abdolmaleki
J. Merel
Andrew Lefrancq
Timothy Lillicrap
Martin Riedmiller
ELMLM&RoBDL
150
1,144
0
02 Jan 2018
Divide-and-Conquer Reinforcement Learning
Divide-and-Conquer Reinforcement Learning
Dibya Ghosh
Avi Singh
Aravind Rajeswaran
Vikash Kumar
Sergey Levine
OffRL
102
127
0
27 Nov 2017
Inverse Reward Design
Inverse Reward Design
Dylan Hadfield-Menell
S. Milli
Pieter Abbeel
Stuart J. Russell
Anca Dragan
96
400
0
08 Nov 2017
Distral: Robust Multitask Reinforcement Learning
Distral: Robust Multitask Reinforcement Learning
Yee Whye Teh
V. Bapst
Wojciech M. Czarnecki
John Quan
J. Kirkpatrick
R. Hadsell
N. Heess
Razvan Pascanu
184
553
0
13 Jul 2017
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Chen Sun
Abhinav Shrivastava
Saurabh Singh
Abhinav Gupta
VLM
212
2,411
0
10 Jul 2017
Hindsight Experience Replay
Hindsight Experience Replay
Marcin Andrychowicz
Dwight Crow
Alex Ray
Jonas Schneider
Rachel Fong
Peter Welinder
Bob McGrew
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
OffRL
293
2,339
0
05 Jul 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
118
1,350
0
27 Feb 2017
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with
  Weak Supervision
Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision
Chen Liang
Jonathan Berant
Quoc V. Le
Kenneth D. Forbus
Ni Lao
NAI
122
406
0
31 Oct 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy
  Optimization
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
112
952
0
01 Mar 2016
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning
Emilio Parisotto
Jimmy Lei Ba
Ruslan Salakhutdinov
OffRL
117
599
0
19 Nov 2015
Policy Distillation
Policy Distillation
Andrei A. Rusu
Sergio Gomez Colmenarejo
Çağlar Gülçehre
Guillaume Desjardins
J. Kirkpatrick
Razvan Pascanu
Volodymyr Mnih
Koray Kavukcuoglu
R. Hadsell
105
697
0
19 Nov 2015
Shared Autonomy via Hindsight Optimization
Shared Autonomy via Hindsight Optimization
Shervin Javdani
S. Srinivasa
J. Andrew Bagnell
97
195
0
26 Mar 2015
1