Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,098 papers shown
Title
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property
Ioannis Anagnostides
Ioannis Panageas
Gabriele Farina
T. Sandholm
43
3
0
19 Dec 2023
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
Nai-Chieh Huang
Ping-Chun Hsieh
Kuo-Hao Ho
I-Chen Wu
29
8
0
19 Dec 2023
Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective
Wanying Wang
Yichen Zhu
Yirui Zhou
Yaxin Peng
Jian Tang
Zhiyuan Xu
Chaomin Shen
Yangchun Zhang
34
4
0
18 Dec 2023
Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling
Jakob J. Hollenstein
Georg Martius
J. Piater
22
3
0
18 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
28
10
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
27
2
0
17 Dec 2023
Constrained Meta-Reinforcement Learning for Adaptable Safety Guarantee with Differentiable Convex Programming
Minjae Cho
Chuangchuang Sun
35
3
0
15 Dec 2023
Multi-Objective Reinforcement Learning-based Approach for Pressurized Water Reactor Optimization
Paul Seurin
K. Shirvan
24
10
0
15 Dec 2023
Communication-Efficient Soft Actor-Critic Policy Collaboration via Regulated Segment Mixture in Internet of Vehicles
Xiaoxue Yu
Rongpeng Li
Chengchao Liang
Zhifeng Zhao
33
0
0
15 Dec 2023
Gradient Informed Proximal Policy Optimization
Sanghyun Son
L. Zheng
Ryan Sullivan
Yi-Ling Qiao
Ming-Chyuan Lin
37
7
0
14 Dec 2023
World Models via Policy-Guided Trajectory Diffusion
Marc Rigter
Jun Yamada
Ingmar Posner
34
19
0
13 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
80
5
0
13 Dec 2023
A dynamical clipping approach with task feedback for Proximal Policy Optimization
Ziqi Zhang
Jingzehua Xu
Zifeng Zhuang
Jinxin Liu
Donglin Wang
Shuai Zhang
24
1
0
12 Dec 2023
DiffAIL: Diffusion Adversarial Imitation Learning
Bingzheng Wang
Guoqiang Wu
Teng Pang
Yan Zhang
Yilong Yin
27
11
0
11 Dec 2023
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning
Kun-Li Channing Lin
Yufeng Wang
Peihao Chen
Runhao Zeng
Siyuan Zhou
Mingkui Tan
Chuang Gan
AI4CE
34
0
0
10 Dec 2023
The Generalization Gap in Offline Reinforcement Learning
Ishita Mediratta
Qingfei You
Minqi Jiang
Roberta Raileanu
OffRL
92
10
0
10 Dec 2023
On Task-Relevant Loss Functions in Meta-Reinforcement Learning and Online LQR
Jaeuk Shin
Giho Kim
Howon Lee
Joonho Han
Insoon Yang
OffRL
41
1
0
09 Dec 2023
Guaranteed Trust Region Optimization via Two-Phase KL Penalization
K.R. Zentner
Ujjwal Puri
Zhehui Huang
Gaurav Sukhatme
OffRL
24
0
0
08 Dec 2023
A Review of Cooperation in Multi-agent Learning
Yali Du
Joel Z Leibo
Usman Islam
Richard Willis
P. Sunehag
43
31
0
08 Dec 2023
MIMo: A Multi-Modal Infant Model for Studying Cognitive Development
Dominik Mattern
Pierre Schumacher
F. M. López
Marcel C. Raabe
M. Ernst
A. Aubret
Jochen Triesch
31
4
0
07 Dec 2023
Language Model Alignment with Elastic Reset
Michael Noukhovitch
Samuel Lavoie
Florian Strub
Aaron Courville
KELM
100
25
0
06 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
41
6
0
05 Dec 2023
Modular Control Architecture for Safe Marine Navigation: Reinforcement Learning and Predictive Safety Filters
Aksel Vaaler
Svein Jostein Husa
Daniel Menges
T. N. Larsen
Adil Rasheed
24
2
0
04 Dec 2023
LLM A*: Human in the Loop Large Language Models Enabled A* Search for Robotics
Hengjia Xiao
Peng Wang
Mingzhe Yu
Mattia Robbiani
29
21
0
04 Dec 2023
Safe Reinforcement Learning in Tensor Reproducing Kernel Hilbert Space
Xiaoyuan Cheng
Boli Chen
Liz Varga
Yukun Hu
28
0
0
01 Dec 2023
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
22
19
0
01 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
27
19
0
01 Dec 2023
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms
Xiangyuan Zhang
Weichao Mao
S. Mowlavi
M. Benosman
Tamer Basar
OffRL
AI4CE
24
2
0
30 Nov 2023
Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning
Jared Markowitz
Jesse Silverberg
Gary Collins
OffRL
23
0
0
30 Nov 2023
Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control
Bernd Frauenknecht
Tobias Ehlgen
Sebastian Trimpe
42
3
0
30 Nov 2023
Deep Reinforcement Learning Graphs: Feedback Motion Planning via Neural Lyapunov Verification
A. Ghanbarzadeh
Esmaeil Najafi
14
0
0
29 Nov 2023
Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
Yijun Yang
Tianyi Zhou
Kanxue Li
Dapeng Tao
Lusong Li
Li Shen
Xiaodong He
Jing Jiang
Yuhui Shi
LLMAG
LM&Ro
35
35
0
28 Nov 2023
Temporal Transfer Learning for Traffic Optimization with Coarse-grained Advisory Autonomy
Jung-Hoon Cho
Sirui Li
Jeongyun Kim
Cathy Wu
29
3
0
27 Nov 2023
Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation
Jiachen Li
David Isele
Kanghoon Lee
Jinkyoo Park
K. Fujimura
Mykel J. Kochenderfer
36
7
0
27 Nov 2023
Networked Multiagent Safe Reinforcement Learning for Low-carbon Demand Management in Distribution Network
Jichen Zhang
Linwei Sang
Yinliang Xu
Hongbin Sun
13
13
0
27 Nov 2023
Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning
Melrose Roderick
Gaurav Manek
Felix Berkenkamp
J. Zico Kolter
OffRL
OnRL
36
0
0
25 Nov 2023
Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series
Woosung Koh
Insu Choi
Yuntae Jang
Gimin Kang
Woo Chang Kim
33
1
0
22 Nov 2023
Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts
Ahmed Hendawy
Jan Peters
Carlo DÉramo
MoE
33
15
0
19 Nov 2023
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
16
2
0
17 Nov 2023
Asymptotically Fair Participation in Machine Learning Models: an Optimal Control Perspective
Zhuotong Chen
Qianxiao Li
Zheng Zhang
FaML
17
1
0
16 Nov 2023
Supported Trust Region Optimization for Offline Reinforcement Learning
Yongyi Mao
Hongchang Zhang
Chong Chen
Yi Tian Xu
Xiangyang Ji
OffRL
45
14
0
15 Nov 2023
Efficiently Escaping Saddle Points for Non-Convex Policy Optimization
Sadegh Khorasani
Saber Salehkaleybar
Negar Kiyavash
Niao He
Matthias Grossglauser
29
1
0
15 Nov 2023
Joint User Pairing and Beamforming Design of Multi-STAR-RISs-Aided NOMA in the Indoor Environment via Multi-Agent Reinforcement Learning
Y. Park
Y. Tun
Choong Seon Hong
16
1
0
15 Nov 2023
Adversarial Imitation Learning On Aggregated Data
Pierre Le Pelletier de Woillemont
Rémi Labory
Vincent Corruble
19
0
0
14 Nov 2023
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees
Yifei Zhou
Ayush Sekhari
Yuda Song
Wen Sun
OffRL
OnRL
30
8
0
14 Nov 2023
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
Nicholas Corrado
Josiah P. Hanna
OffRL
20
1
0
14 Nov 2023
Model-assisted Reinforcement Learning of a Quadrotor
Arshad Javeed
16
0
0
12 Nov 2023
Clipped-Objective Policy Gradients for Pessimistic Policy Optimization
Jared Markowitz
Edward W. Staley
OffRL
21
2
0
10 Nov 2023
Two Complementary Perspectives to Continual Learning: Ask Not Only What to Optimize, But Also How
Timm Hess
Tinne Tuytelaars
Gido M. van de Ven
46
7
0
08 Nov 2023
A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems
Cheng Yin
Yi Chen
17
0
0
07 Nov 2023
Previous
1
2
3
...
9
10
11
...
60
61
62
Next