ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,098 papers shown
Title
Effects of Loss Functions And Target Representations on Adversarial
  Robustness
Effects of Loss Functions And Target Representations on Adversarial Robustness
Sean Saito
S. Roy
AAML
19
7
0
01 Dec 2018
An Introduction to Deep Reinforcement Learning
An Introduction to Deep Reinforcement Learning
Vincent François-Lavet
Peter Henderson
Riashat Islam
Marc G. Bellemare
Joelle Pineau
OffRL
AI4CE
88
1,236
0
30 Nov 2018
Neural probabilistic motor primitives for humanoid control
Neural probabilistic motor primitives for humanoid control
J. Merel
Leonard Hasenclever
Alexandre Galashov
Arun Ahuja
Vu Pham
Greg Wayne
Yee Whye Teh
N. Heess
31
156
0
28 Nov 2018
Understanding the impact of entropy on policy optimization
Understanding the impact of entropy on policy optimization
Zafarali Ahmed
Nicolas Le Roux
Mohammad Norouzi
Dale Schuurmans
28
226
0
27 Nov 2018
Connecting the Dots Between MLE and RL for Sequence Prediction
Connecting the Dots Between MLE and RL for Sequence Prediction
Bowen Tan
Zhiting Hu
Zichao Yang
Ruslan Salakhutdinov
Eric Xing
28
24
0
24 Nov 2018
Scalable agent alignment via reward modeling: a research direction
Scalable agent alignment via reward modeling: a research direction
Jan Leike
David M. Krueger
Tom Everitt
Miljan Martic
Vishal Maini
Shane Legg
34
397
0
19 Nov 2018
Learning Actionable Representations with Goal-Conditioned Policies
Learning Actionable Representations with Goal-Conditioned Policies
Dibya Ghosh
Abhishek Gupta
Sergey Levine
32
109
0
19 Nov 2018
Policy Optimization with Model-based Explorations
Policy Optimization with Model-based Explorations
Feiyang Pan
Qingpeng Cai
Anxiang Zeng
C. Pan
Qing Da
Hua-Lin He
Qing He
Pingzhong Tang
36
11
0
18 Nov 2018
Parameter Sharing Reinforcement Learning Architecture for Multi Agent
  Driving Behaviors
Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving Behaviors
Meha Kaushik
S. Phaniteja
K. M. Krishna
AI4CE
25
11
0
17 Nov 2018
An Algorithmic Perspective on Imitation Learning
An Algorithmic Perspective on Imitation Learning
Takayuki Osa
Joni Pajarinen
Gerhard Neumann
J. Andrew Bagnell
Pieter Abbeel
Jan Peters
50
830
0
16 Nov 2018
Reward-estimation variance elimination in sequential decision processes
Reward-estimation variance elimination in sequential decision processes
S. Pankov
19
5
0
15 Nov 2018
Intervention Aided Reinforcement Learning for Safe and Practical Policy
  Optimization in Navigation
Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation
Fan Wang
Bo Zhou
Ke Chen
Tingxiang Fan
Xi Zhang
Jiangyong Li
Hao Tian
Jia Pan
19
26
0
15 Nov 2018
Natural Environment Benchmarks for Reinforcement Learning
Natural Environment Benchmarks for Reinforcement Learning
Amy Zhang
Yuxin Wu
Joelle Pineau
OffRL
OOD
28
69
0
14 Nov 2018
Importance Weighted Evolution Strategies
Importance Weighted Evolution Strategies
Victor Campos
Xavier Giró-i-Nieto
Jordi Torres
27
1
0
12 Nov 2018
Learning from Demonstration in the Wild
Learning from Demonstration in the Wild
Bertrand Higy
K. Shiarlis
Xi Chen
Vitaly Kurin
Sudhanshu Kasewa
...
João Gomes
Supratik Paul
F. Oliehoek
João Messias
Shimon Whiteson
27
76
0
08 Nov 2018
Meta-Learning for Multi-objective Reinforcement Learning
Meta-Learning for Multi-objective Reinforcement Learning
Xi Chen
Ali Ghadirzadeh
Mårten Björkman
Pablo G. Cámara
OffRL
23
54
0
08 Nov 2018
Correlation Filter Selection for Visual Tracking Using Reinforcement
  Learning
Correlation Filter Selection for Visual Tracking Using Reinforcement Learning
Yanchun Xie
Jimin Xiao
Hassan Jameel Asghar
Jeyarajan Thiyagalingam
Dali Kaafar
18
21
0
08 Nov 2018
Deep Reinforcement Learning via L-BFGS Optimization
Deep Reinforcement Learning via L-BFGS Optimization
Chris Paxton
Roummel F. Marcia
OffRL
21
0
0
06 Nov 2018
A Closer Look at Deep Policy Gradients
A Closer Look at Deep Policy Gradients
Andrew Ilyas
Logan Engstrom
Shibani Santurkar
Dimitris Tsipras
Firdaus Janoos
Larry Rudolph
Aleksander Madry
30
50
0
06 Nov 2018
Managing engineering systems with large state and action spaces through
  deep reinforcement learning
Managing engineering systems with large state and action spaces through deep reinforcement learning
Varun Chandrasekaran
K. Papakonstantinou
AI4CE
18
161
0
05 Nov 2018
Learning to Defend by Learning to Attack
Learning to Defend by Learning to Attack
Haoming Jiang
Zhehui Chen
Yuyang Shi
Bo Dai
T. Zhao
21
22
0
03 Nov 2018
VIREL: A Variational Inference Framework for Reinforcement Learning
VIREL: A Variational Inference Framework for Reinforcement Learning
M. Fellows
Anuj Mahajan
Tim G. J. Rudner
Shimon Whiteson
DRL
38
54
0
03 Nov 2018
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
Horizon: Facebook's Open Source Applied Reinforcement Learning Platform
J. Gauci
Edoardo Conti
Yitao Liang
Kittipat Virochsiri
Yuchen He
Zachary Kaden
Vivek Narayanan
Xiaohui Ye
Zhengxing Chen
Scott Fujimoto
30
139
0
01 Nov 2018
Differentiable MPC for End-to-end Planning and Control
Differentiable MPC for End-to-end Planning and Control
Brandon Amos
I. D. Rodriguez
Jacob Sacks
Byron Boots
J. Zico Kolter
30
366
0
31 Oct 2018
Relative Importance Sampling For Off-Policy Actor-Critic in Deep
  Reinforcement Learning
Relative Importance Sampling For Off-Policy Actor-Critic in Deep Reinforcement Learning
Mahammad Humayoo
Xueqi Cheng
BDL
OffRL
21
5
0
30 Oct 2018
Assessing Generalization in Deep Reinforcement Learning
Assessing Generalization in Deep Reinforcement Learning
Charles Packer
Katelyn Gao
Jernej Kos
Philipp Krahenbuhl
V. Koltun
D. Song
OffRL
18
233
0
29 Oct 2018
Watch the Unobserved: A Simple Approach to Parallelizing Monte Carlo
  Tree Search
Watch the Unobserved: A Simple Approach to Parallelizing Monte Carlo Tree Search
Hoang Trung-Dung
Jianshu Chen
Mingze Yu
Yu Zhai
Xuewen Zhou
Ji Liu
17
30
0
28 Oct 2018
Learning and Management for Internet-of-Things: Accounting for
  Adaptivity and Scalability
Learning and Management for Internet-of-Things: Accounting for Adaptivity and Scalability
Tianyi Chen
Sergio Barbarossa
Xin Wang
G. Giannakis
Zhi-Li Zhang
9
79
0
27 Oct 2018
Stability-certified reinforcement learning: A control-theoretic
  perspective
Stability-certified reinforcement learning: A control-theoretic perspective
Ming Jin
Javad Lavaei
33
85
0
26 Oct 2018
Differential Variable Speed Limits Control for Freeway Recurrent
  Bottlenecks via Deep Reinforcement learning
Differential Variable Speed Limits Control for Freeway Recurrent Bottlenecks via Deep Reinforcement learning
Yuankai Wu
Huachun Tan
B. Ran
AI4CE
29
17
0
25 Oct 2018
Sample-Efficient Learning of Nonprehensile Manipulation Policies via
  Physics-Based Informed State Distributions
Sample-Efficient Learning of Nonprehensile Manipulation Policies via Physics-Based Informed State Distributions
Lerrel Pinto
Aditya Mandalika
Brian Hou
S. Srinivasa
33
13
0
24 Oct 2018
Inverse reinforcement learning for video games
Inverse reinforcement learning for video games
Aaron David Tucker
Adam Gleave
Stuart J. Russell
21
48
0
24 Oct 2018
Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal
  Representations for Contact-Rich Tasks
Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks
Michelle A. Lee
Yuke Zhu
K. Srinivasan
Parth Shah
Silvio Savarese
Li Fei-Fei
Animesh Garg
Jeannette Bohg
SSL
35
368
0
24 Oct 2018
Reconciling $λ$-Returns with Experience Replay
Reconciling λλλ-Returns with Experience Replay
Brett Daley
Chris Amato
24
4
0
23 Oct 2018
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep
  Reinforcement Learning
The Faults in Our Pi Stars: Security Issues and Open Challenges in Deep Reinforcement Learning
Vahid Behzadan
Arslan Munir
27
27
0
23 Oct 2018
Hierarchical Approaches for Reinforcement Learning in Parameterized
  Action Space
Hierarchical Approaches for Reinforcement Learning in Parameterized Action Space
E. Wei
Drew Wicke
S. Luke
BDL
30
35
0
23 Oct 2018
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy
  Improvement
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement
Samuel Neumann
Sungsu Lim
A. Joseph
Yangchen Pan
Adam White
Martha White
28
7
0
22 Oct 2018
Actor-Critic Policy Optimization in Partially Observable Multiagent
  Environments
Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
S. Srinivasan
Marc Lanctot
V. Zambaldi
Julien Perolat
K. Tuyls
Rémi Munos
Michael Bowling
13
148
0
21 Oct 2018
First-order and second-order variants of the gradient descent in a
  unified framework
First-order and second-order variants of the gradient descent in a unified framework
Thomas Pierrot
Nicolas Perrin
Olivier Sigaud
ODL
30
7
0
18 Oct 2018
Policy Gradient in Partially Observable Environments: Approximation and
  Convergence
Policy Gradient in Partially Observable Environments: Approximation and Convergence
Kamyar Azizzadenesheli
Manish Kumar Bera
Anima Anandkumar
OffRL
30
8
0
18 Oct 2018
Security Matters: A Survey on Adversarial Machine Learning
Security Matters: A Survey on Adversarial Machine Learning
Guofu Li
Pengjia Zhu
Jin Li
Zhemin Yang
Ning Cao
Zhiyi Chen
AAML
26
24
0
16 Oct 2018
ProMP: Proximal Meta-Policy Search
ProMP: Proximal Meta-Policy Search
Jonas Rothfuss
Dennis Lee
I. Clavera
Tamim Asfour
Pieter Abbeel
35
209
0
16 Oct 2018
Predictor-Corrector Policy Optimization
Predictor-Corrector Policy Optimization
Ching-An Cheng
Xinyan Yan
Nathan D. Ratliff
Byron Boots
OnRL
18
23
0
15 Oct 2018
Deep Reinforcement Learning
Deep Reinforcement Learning
Yuxi Li
VLM
OffRL
28
144
0
15 Oct 2018
Dexterous Manipulation with Deep Reinforcement Learning: Efficient,
  General, and Low-Cost
Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost
Henry Zhu
Abhishek Gupta
Aravind Rajeswaran
Sergey Levine
Vikash Kumar
OffRL
29
196
0
14 Oct 2018
Policy Transfer with Strategy Optimization
Policy Transfer with Strategy Optimization
Wenhao Yu
Chenxi Liu
Greg Turk
38
80
0
12 Oct 2018
A Survey and Critique of Multiagent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Pablo Hernandez-Leal
Bilal Kartal
Matthew E. Taylor
OffRL
48
553
0
12 Oct 2018
Parametrized Deep Q-Networks Learning: Reinforcement Learning with
  Discrete-Continuous Hybrid Action Space
Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space
Jiechao Xiong
Qing Wang
Zhuoran Yang
Peng Sun
Lei Han
Yang Zheng
Haobo Fu
Tong Zhang
Ji Liu
Han Liu
37
169
0
10 Oct 2018
Reinforcement Learning for Improving Agent Design
Reinforcement Learning for Improving Agent Design
David R Ha
40
124
0
09 Oct 2018
Fast Context Adaptation via Meta-Learning
Fast Context Adaptation via Meta-Learning
L. Zintgraf
K. Shiarlis
Vitaly Kurin
Katja Hofmann
Shimon Whiteson
22
37
0
08 Oct 2018
Previous
123...525354...606162
Next