ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.07443
  4. Cited By
Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions
v1v2 (latest)

Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions

15 July 2020
Sinong Geng
Houssam Nassif
Carlos A. Manzanares
A. M. Reppen
R. Sircar
ArXiv (abs)PDFHTML

Papers citing "Deep PQR: Solving Inverse Reinforcement Learning using Anchor Actions"

19 / 19 papers shown
Title
Temporal Poisson Square Root Graphical Models
Temporal Poisson Square Root Graphical Models
Sinong Geng
Zhaobin Kuang
P. Peissig
David Page
53
13
0
12 May 2020
Stochastic Learning for Sparse Discrete Markov Random Fields with
  Controlled Gradient Approximation Error
Stochastic Learning for Sparse Discrete Markov Random Fields with Controlled Gradient Approximation Error
Sinong Geng
Zhaobin Kuang
Jie Liu
S. Wright
David Page
32
13
0
12 May 2020
Ivy: Instrumental Variable Synthesis for Causal Inference
Ivy: Instrumental Variable Synthesis for Causal Inference
Zhaobin Kuang
Frederic Sala
N. Sohoni
Sen Wu
A. Córdova-Palomera
Jared A. Dunnmon
J. Priest
Christopher Ré
CML
43
27
0
11 Apr 2020
Learning Human Objectives by Evaluating Hypothetical Behavior
Learning Human Objectives by Evaluating Hypothetical Behavior
S. Reddy
Anca Dragan
Sergey Levine
Shane Legg
Jan Leike
68
75
0
05 Dec 2019
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and
  Reinforcement Learning
Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning
Abhishek Gupta
Vikash Kumar
Corey Lynch
Sergey Levine
Karol Hausman
95
433
0
25 Oct 2019
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards
S. Reddy
Anca Dragan
Sergey Levine
OffRL
61
52
0
27 May 2019
Fine-Grained Analysis of Optimization and Generalization for
  Overparameterized Two-Layer Neural Networks
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks
Sanjeev Arora
S. Du
Wei Hu
Zhiyuan Li
Ruosong Wang
MLT
223
974
0
24 Jan 2019
A Theoretical Analysis of Deep Q-Learning
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
190
609
0
01 Jan 2019
ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing
  the Worst
ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst
Mayank Bansal
A. Krizhevsky
A. Ogale
OOD
100
742
0
07 Dec 2018
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
124
420
0
19 Nov 2018
Gradient Descent Finds Global Minima of Deep Neural Networks
Gradient Descent Finds Global Minima of Deep Neural Networks
S. Du
Jason D. Lee
Haochuan Li
Liwei Wang
Masayoshi Tomizuka
ODL
231
1,136
0
09 Nov 2018
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward
  Bias in Adversarial Imitation Learning
Discriminator-Actor-Critic: Addressing Sample Inefficiency and Reward Bias in Adversarial Imitation Learning
Ilya Kostrikov
Kumar Krishna Agrawal
Debidatta Dwibedi
Sergey Levine
Jonathan Tompson
100
259
0
09 Sep 2018
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Learning Robust Rewards with Adversarial Inverse Reinforcement Learning
Justin Fu
Katie Z Luo
Sergey Levine
131
757
0
30 Oct 2017
An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov
  Network Estimation
An Efficient Pseudo-likelihood Method for Sparse Binary Pairwise Markov Network Estimation
Sinong Geng
Zhaobin Kuang
David Page
141
11
0
27 Feb 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
118
1,348
0
27 Feb 2017
Generative Adversarial Imitation Learning
Generative Adversarial Imitation Learning
Jonathan Ho
Stefano Ermon
GAN
165
3,125
0
10 Jun 2016
Guided Cost Learning: Deep Inverse Optimal Control via Policy
  Optimization
Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization
Chelsea Finn
Sergey Levine
Pieter Abbeel
108
952
0
01 Mar 2016
Maximum Entropy Deep Inverse Reinforcement Learning
Maximum Entropy Deep Inverse Reinforcement Learning
Markus Wulfmeier
Peter Ondruska
Ingmar Posner
OOD
116
406
0
17 Jul 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
259
3,238
0
02 Nov 2010
1