Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.07443
Cited By
v1
v2 (latest)
Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions
15 July 2020
Sinong Geng
Houssam Nassif
Carlos A. Manzanares
A. M. Reppen
R. Sircar
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions"
19 / 19 papers shown
Title
Temporal Poisson Square Root Graphical Models
Sinong Geng
Zhaobin Kuang
P. Peissig
David Page
53
13
0
12 May 2020
Stochastic Learning for Sparse Discrete Markov Random Fields with Controlled Gradient Approximation Error
Sinong Geng
Zhaobin Kuang
Jie Liu
S. Wright
David Page
32
13
0
12 May 2020
Ivy: Instrumental Variable Synthesis for Causal Inference
Zhaobin Kuang
Frederic Sala
N. Sohoni
Sen Wu
A. Córdova-Palomera
Jared A. Dunnmon
J. Priest
Christopher Ré
CML
43
27
0
11 Apr 2020
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
68
75
0
05 Dec 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
95
433
0
25 Oct 2019
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
S. Reddy
Anca Dragan
Sergey Levine
OffRL
61
52
0
27 May 2019
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks
Sanjeev Arora
S. Du
Wei Hu
Zhiyuan Li
Ruosong Wang
MLT
221
974
0
24 Jan 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
190
606
0
01 Jan 2019
ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst
Mayank Bansal
A. Krizhevsky
A. Ogale
OOD
100
742
0
07 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
122
420
0
19 Nov 2018
Gradient Descent Finds Global Minima of Deep Neural Networks
S. Du
Jason D. Lee
Haochuan Li
Liwei Wang
Masayoshi Tomizuka
ODL
229
1,136
0
09 Nov 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
100
260
0
09 Sep 2018
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
131
757
0
30 Oct 2017
An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov Network Estimation
Sinong Geng
Zhaobin Kuang
David Page
139
11
0
27 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
118
1,348
0
27 Feb 2017
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
108
952
0
01 Mar 2016
Maximum Entropy Deep Inverse Reinforcement Learning
Markus Wulfmeier
Peter Ondruska
Ingmar Posner
OOD
116
406
0
17 Jul 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
259
3,238
0
02 Nov 2010
1