Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
v1
v2
v3
v4
v5 (latest)
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 2,008 papers shown
Title
Deep Reinforcement Learning in Surgical Robotics: Enhancing the Automation Level
Cheng Qian
Hongliang Ren
83
4
0
02 Sep 2023
End-to-end Lidar-Driven Reinforcement Learning for Autonomous Racing
Meraj Mammadov
47
3
0
01 Sep 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
115
7
0
29 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
211
13
0
28 Aug 2023
Learn With Imagination: Safe Set Guided State-wise Constrained Policy Optimization
Weiye Zhao
Yifan Sun
Fei Li
Rui Chen
Tianhao Wei
Changliu Liu
124
6
0
25 Aug 2023
Prompt-Based Length Controlled Generation with Reinforcement Learning
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
89
11
0
23 Aug 2023
ILCAS: Imitation Learning-Based Configuration-Adaptive Streaming for Live Video Analytics with Cross-Camera Collaboration
Duo Wu
Dayou Zhang
Miao Zhang
Ruoyu Zhang
Fang Wang
Shuguang Cui
65
9
0
19 Aug 2023
Fast Decision Support for Air Traffic Management at Urban Air Mobility Vertiports using Graph Learning
Prajit KrisshnaKumar
Jhoel Witter
Steve Paul
Han-Seon Cho
Karthik Dantu
Souma Chowdhury
53
3
0
17 Aug 2023
Reinforcement Learning for Financial Index Tracking
X. Peng
Chen Gong
X. He
103
1
0
05 Aug 2023
Controlling the Solo12 Quadruped Robot with Deep Reinforcement Learning
M. Aractingi
Pierre-Alexandre Léziart
He Cao
Julien Perez
Yuan Yao
Philippe Souères
92
34
0
02 Aug 2023
PeRP: Personalized Residual Policies For Congestion Mitigation Through Co-operative Advisory Systems
Aamir Hasan
Neeloy Chakraborty
Haonan Chen
Jung-Hoon Cho
Cathy Wu
Katherine Driggs-Campbell
91
6
0
01 Aug 2023
Curiosity-Driven Reinforcement Learning based Low-Level Flight Control
Amir Ramezani Dooraki
Alexandros Iosifidis
36
0
0
28 Jul 2023
Counterfactual Explanation Policies in RL
Shripad Deshmukh
R Srivatsan
Supriti Vijay
Jayakumar Subramanian
Chirag Agarwal
OffRL
63
0
0
25 Jul 2023
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment
Vaddadi Sai Rahul
Debajyoti Chakraborty
16
2
0
20 Jul 2023
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
T. Westenbroek
Jacob Levy
David Fridovich-Keil
79
0
0
16 Jul 2023
Probabilistic Constraint for Safety-Critical Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
87
15
0
29 Jun 2023
SARC: Soft Actor Retrospective Critic
Sukriti Verma
Ayush Chopra
J. Subramanian
Mausoom Sarkar
Nikaash Puri
Piyush B. Gupta
Balaji Krishnamurthy
48
0
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
103
30
0
27 Jun 2023
Towards Optimal Pricing of Demand Response -- A Nonparametric Constrained Policy Optimization Approach
Jun Song
Chaoyue Zhao
OffRL
28
0
0
24 Jun 2023
Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers
Yongqi Dong
Tobias Datema
Vincent Wassenaar
Joris van de Weg
Cahit Tolga Kopar
Harim Suleman
OffRL
73
1
0
20 Jun 2023
Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction
Mohamed K. Abdel-Aziz
Mohammed S. Elbamby
S. Samarakoon
M. Bennis
69
5
0
20 Jun 2023
AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents
Timothée Mathieu
R. D. Vecchia
Alena Shilova
M. Centa
Hector Kohler
Odalric-Ambrym Maillard
Philippe Preux
51
0
0
19 Jun 2023
Optimal Execution Using Reinforcement Learning
Cong Zheng
Jiafa He
Can Yang
28
0
0
19 Jun 2023
Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
Yonggang Jin
Chenxu Wang
Tianyu Zheng
Liuyu Xiang
Yao-Chun Yang
Junge Zhang
Jie Fu
Zhaofeng He
3DH
103
0
0
19 Jun 2023
Robust Reinforcement Learning through Efficient Adversarial Herding
Juncheng Dong
Hao-Lun Hsu
Qitong Gao
Vahid Tarokh
Miroslav Pajic
88
4
0
12 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
83
2
0
09 Jun 2023
For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Scott Fujimoto
Wei-Di Chang
Edward James Smith
S. Gu
Doina Precup
David Meger
OffRL
109
55
0
04 Jun 2023
PAGAR: Taming Reward Misalignment in Inverse Reinforcement Learning-Based Imitation Learning with Protagonist Antagonist Guided Adversarial Reward
Weichao Zhou
Wenchao Li
63
0
0
02 Jun 2023
Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task
Reuf Kozlica
S. Wegenkittl
Simon Hirlaender
OffRL
41
4
0
02 Jun 2023
Progressive Learning for Physics-informed Neural Motion Planning
Ruiqi Ni
A. H. Qureshi
105
12
0
01 Jun 2023
Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees
Pengfei Li
Jianyi Yang
Shaolei Ren
OffRL
82
4
0
31 May 2023
Representation-Driven Reinforcement Learning
Ofir Nabati
Guy Tennenholtz
Shie Mannor
116
1
0
31 May 2023
On the Linear Convergence of Policy Gradient under Hadamard Parameterization
Jiacai Liu
Jinchi Chen
Ke Wei
65
3
0
31 May 2023
Deep Reinforcement Learning with Plasticity Injection
Evgenii Nikishin
Junhyuk Oh
Georg Ostrovski
Clare Lyle
Razvan Pascanu
Will Dabney
André Barreto
OffRL
71
52
0
24 May 2023
Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees
Sharan Vaswani
A. Kazemi
Reza Babanezhad
Nicolas Le Roux
OffRL
90
4
0
24 May 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
214
0
0
23 May 2023
Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning
Sumeet Batra
Bryon Tjanaka
Matthew C. Fontaine
Aleksei Petrenko
Stefanos Nikolaidis
Gaurav Sukhatme
OffRL
103
17
0
23 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Hanna Ziesche
Leonel Rozo
82
6
0
17 May 2023
Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Xiyun Li
Ziyi Ni
Jingqing Ruan
Linghui Meng
Jing Shi
Tielin Zhang
Bo Xu
100
4
0
10 May 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
62
5
0
07 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
93
1
0
04 May 2023
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Tuomas Haarnoja
Ben Moran
Guy Lever
Sandy H. Huang
Dhruva Tirumala
...
Andrea Huber
N. Hurley
F. Nori
R. Hadsell
N. Heess
141
153
0
26 Apr 2023
System III: Learning with Domain Knowledge for Safety Constraints
Fazl Barez
Hosien Hasanbieg
Alesandro Abbate
68
4
0
23 Apr 2023
Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies
Oscar Li
James Harrison
Jascha Narain Sohl-Dickstein
Virginia Smith
Luke Metz
114
6
0
21 Apr 2023
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
84
47
0
19 Apr 2023
Cooperative Multi-Agent Reinforcement Learning for Inventory Management
Madhav Khirwar
Karthik S. Gurumoorthy
Ankit Jain
Shantala Manchenahally
61
4
0
18 Apr 2023
Searching for ribbons with machine learning
Sergei Gukov
James Halverson
Ciprian Manolescu
Fabian Ruehle
97
13
0
18 Apr 2023
Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring
Runzhe Wan
Yu Liu
James McQueen
Doug Hains
Rui Song
OffRL
65
6
0
02 Apr 2023
Personalizing Task-oriented Dialog Systems via Zero-shot Generalizable Reward Function
A.B. Siddique
M. H. Maqbool
Kshitija Taywade
H. Foroosh
68
12
0
24 Mar 2023
Boosting Reinforcement Learning and Planning with Demonstrations: A Survey
Tongzhou Mu
H. Su
OffRL
81
1
0
23 Mar 2023
Previous
1
2
3
4
5
6
...
39
40
41
Next