ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization
v1v2v3v4v5 (latest)

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXiv (abs)PDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 2,008 papers shown
Title
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
Shangding Gu
Bilgehan Sel
Yuhao Ding
Lu Wang
Qingwei Lin
Ming Jin
Alois Knoll
106
10
0
02 May 2024
No Representation, No Trust: Connecting Representation, Collapse, and
  Trust Issues in PPO
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Skander Moalla
Andrea Miele
Razvan Pascanu
Çağlar Gülçehre
95
6
0
01 May 2024
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Deep Reinforcement Learning for Bipedal Locomotion: A Brief Survey
Lingfan Bao
Josephine N. Humphreys
Tianhu Peng
Chengxu Zhou
134
9
0
25 Apr 2024
ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos
ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos
Zerui Chen
Shizhe Chen
Cordelia Schmid
Ivan Laptev
Cordelia Schmid
121
16
0
24 Apr 2024
Self-playing Adversarial Language Game Enhances LLM Reasoning
Self-playing Adversarial Language Game Enhances LLM Reasoning
Pengyu Cheng
Tianhao Hu
Han Xu
Zhisong Zhang
Yong Dai
Lei Han
Nan Du
Nan Du
Xiaolong Li
SyDaLRMReLM
195
38
0
16 Apr 2024
Learn Your Reference Model for Real Good Alignment
Learn Your Reference Model for Real Good Alignment
Alexey Gorbatovski
Boris Shaposhnikov
Alexey Malakhov
Nikita Surnachev
Yaroslav Aksenov
Ian Maksimov
Nikita Balagansky
Daniil Gavrilov
OffRL
138
35
0
15 Apr 2024
Graph Reinforcement Learning for Combinatorial Optimization: A Survey
  and Unifying Perspective
Graph Reinforcement Learning for Combinatorial Optimization: A Survey and Unifying Perspective
Victor-Alexandru Darvariu
Stephen Hailes
Mirco Musolesi
AI4CE
124
8
0
09 Apr 2024
Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable
  Collaboration
Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration
Xudong Guo
Daming Shi
Junjie Yu
Wenhui Fan
90
5
0
05 Apr 2024
Safe and Robust Reinforcement Learning: Principles and Practice
Safe and Robust Reinforcement Learning: Principles and Practice
Taku Yamagata
Raúl Santos-Rodríguez
OffRL
101
2
0
27 Mar 2024
Sequential Decision-Making for Inline Text Autocomplete
Sequential Decision-Making for Inline Text Autocomplete
Rohan Chitnis
Shentao Yang
A. Geramifard
LRM
104
1
0
21 Mar 2024
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement
  Learning
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
Jizhe Dou
Haotian Zhang
Guodong Sun
91
0
0
16 Mar 2024
Learning to Watermark LLM-generated Text via Reinforcement Learning
Learning to Watermark LLM-generated Text via Reinforcement Learning
Xiaojun Xu
Yuanshun Yao
Yang Liu
101
14
0
13 Mar 2024
Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement
  Learning Approach
Constrained Optimal Fuel Consumption of HEV: A Constrained Reinforcement Learning Approach
Shuchang Yan
45
1
0
12 Mar 2024
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Yifan Lin
Yuhao Wang
Enlu Zhou
146
0
0
01 Mar 2024
Performance Improvement Bounds for Lipschitz Configurable Markov
  Decision Processes
Performance Improvement Bounds for Lipschitz Configurable Markov Decision Processes
Alberto Maria Metelli
54
0
0
21 Feb 2024
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning
  for Digital Twins
Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning for Digital Twins
Eslam Eldeeb
Houssem Sifaou
Osvaldo Simeone
M. Shehab
Hirley Alves
OffRL
110
4
0
13 Feb 2024
Off-policy Distributional Q($λ$): Distributional RL without
  Importance Sampling
Off-policy Distributional Q(λλλ): Distributional RL without Importance Sampling
Yunhao Tang
Mark Rowland
Rémi Munos
Bernardo Avila-Pires
Will Dabney
OffRL
60
1
0
08 Feb 2024
Assessing the Impact of Distribution Shift on Reinforcement Learning
  Performance
Assessing the Impact of Distribution Shift on Reinforcement Learning Performance
Ted Fujimoto
Joshua Suetterlein
Samrat Chatterjee
A. Ganguly
OffRL
86
4
0
05 Feb 2024
Regularized Q-Learning with Linear Function Approximation
Regularized Q-Learning with Linear Function Approximation
Jiachen Xi
Alfredo Garcia
P. Momcilovic
120
2
0
26 Jan 2024
Multi-agent deep reinforcement learning with centralized training and
  decentralized execution for transportation infrastructure management
Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management
M. Saifullah
K. G. Papakonstantinou
C. Andriotis
S. M. Stoffels
AI4CE
103
2
0
23 Jan 2024
AgentMixer: Multi-Agent Correlated Policy Factorization
AgentMixer: Multi-Agent Correlated Policy Factorization
Zhiyuan Li
Wenshuai Zhao
Lijun Wu
Joni Pajarinen
OffRL
84
2
0
16 Jan 2024
Personalized Reinforcement Learning with a Budget of Policies
Personalized Reinforcement Learning with a Budget of Policies
Dmitry Ivanov
Omer Ben-Porat
OffRL
23
2
0
12 Jan 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
141
3
0
30 Dec 2023
Agnostic Interactive Imitation Learning: New Theory and Practical
  Algorithms
Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms
Yichen Li
Chicheng Zhang
OffRL
84
0
0
28 Dec 2023
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
75
3
0
27 Dec 2023
Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian
  Score Climbing
Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing
Hany Abdulsamad
Sahel Iqbal
Adrien Corenflos
Simo Särkkä
94
2
0
21 Dec 2023
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles
  Control: Recent Advancements and Future Prospects
Multi-Agent Reinforcement Learning for Connected and Automated Vehicles Control: Recent Advancements and Future Prospects
Min Hua
Dong Chen
Xinda Qi
Kun Jiang
Z. Liu
Quan Zhou
Hongming Xu
79
10
0
18 Dec 2023
Episodic Return Decomposition by Difference of Implicitly Assigned
  Sub-Trajectory Reward
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward
Hao-Chu Lin
Hongqiu Wu
Jiaji Zhang
Yihao Sun
Junyin Ye
Yang Yu
73
2
0
17 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRLOOD
197
5
0
13 Dec 2023
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement
  Learning
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning
Kun-Li Channing Lin
Yufeng Wang
Peihao Chen
Runhao Zeng
Siyuan Zhou
Mingkui Tan
Chuang Gan
AI4CE
57
0
0
10 Dec 2023
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
DanZero+: Dominating the GuanDan Game through Reinforcement Learning
Youpeng Zhao
Yudong Lu
Jian Zhao
Wen-gang Zhou
Houqiang Li
96
6
0
05 Dec 2023
LLM A*: Human in the Loop Large Language Models Enabled A* Search for Robotics
LLM A*: Human in the Loop Large Language Models Enabled A* Search for Robotics
Hengjia Xiao
Peng Wang
Mingzhe Yu
Mattia Robbiani
69
25
0
04 Dec 2023
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement
  Learning
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
82
20
0
01 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region
  Conditional Value at Risk
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
88
19
0
01 Dec 2023
Towards a Standardized Reinforcement Learning Framework for AAM
  Contingency Management
Towards a Standardized Reinforcement Learning Framework for AAM Contingency Management
Luis E. Alvarez
Marc W. Brittain
Kara Breeden
49
3
0
17 Nov 2023
Offline RL with Observation Histories: Analyzing and Improving Sample
  Complexity
Offline RL with Observation Histories: Analyzing and Improving Sample Complexity
Joey Hong
Anca Dragan
Sergey Levine
OffRL
66
5
0
31 Oct 2023
Information-Theoretic Trust Regions for Stochastic Gradient-Based
  Optimization
Information-Theoretic Trust Regions for Stochastic Gradient-Based Optimization
Philipp Dahlinger
P. Becker
Maximilian Hüttenrauch
Gerhard Neumann
46
0
0
31 Oct 2023
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate
  Objective Variance in Policy Optimization Methods
Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods
Zhengpeng Xie
Changdong Yu
Weizheng Qiao
98
1
0
31 Oct 2023
Robot Control based on Motor Primitives -- A Comparison of Two
  Approaches
Robot Control based on Motor Primitives -- A Comparison of Two Approaches
Moses C. Nah
Johannes Lachner
Neville Hogan
33
3
0
28 Oct 2023
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
Dexter Neo
Tsuhan Chen
54
1
0
26 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
87
23
0
09 Oct 2023
When is Agnostic Reinforcement Learning Statistically Tractable?
When is Agnostic Reinforcement Learning Statistically Tractable?
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
120
5
0
09 Oct 2023
Deep Reinforcement Learning Algorithms for Hybrid V2X Communication: A
  Benchmarking Study
Deep Reinforcement Learning Algorithms for Hybrid V2X Communication: A Benchmarking Study
Fouzi Boukhalfa
Réda Alami
Mastane Achab
Eric Moulines
M. Bennis
26
1
0
04 Oct 2023
Distill Knowledge in Multi-task Reinforcement Learning with
  Optimal-Transport Regularization
Distill Knowledge in Multi-task Reinforcement Learning with Optimal-Transport Regularization
Bang Giang Le
Viet-Cuong Ta
OT
85
1
0
27 Sep 2023
A Structured Prediction Approach for Robot Imitation Learning
A Structured Prediction Approach for Robot Imitation Learning
Anqing Duan
Iason Batzianoulis
Raffaello Camoriano
Lorenzo Rosasco
Daniele Pucci
A. Billard
52
5
0
26 Sep 2023
Iterative Reachability Estimation for Safe Reinforcement Learning
Iterative Reachability Estimation for Safe Reinforcement Learning
Milan Ganai
Zheng Gong
Chenning Yu
Sylvia Herbert
Sicun Gao
79
18
0
24 Sep 2023
Projected Task-Specific Layers for Multi-Task Reinforcement Learning
Projected Task-Specific Layers for Multi-Task Reinforcement Learning
Josselin Somerville Roberts
Julia Di
45
1
0
15 Sep 2023
Deep Multi-Agent Reinforcement Learning for Decentralized Active
  Hypothesis Testing
Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing
Hadar Szostak
Kobi Cohen
64
4
0
14 Sep 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation
  Strategies towards Equal Long-term Benefit Rate
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
87
5
0
07 Sep 2023
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Miguel Abreu
Luis Paulo Reis
Nuno Lau
124
6
0
06 Sep 2023
Previous
12345...394041
Next