ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 1,112 papers shown
Title
Efficiently Escaping Saddle Points for Non-Convex Policy Optimization
Efficiently Escaping Saddle Points for Non-Convex Policy Optimization
Sadegh Khorasani
Saber Salehkaleybar
Negar Kiyavash
Niao He
Matthias Grossglauser
29
1
0
15 Nov 2023
Two Complementary Perspectives to Continual Learning: Ask Not Only What
  to Optimize, But Also How
Two Complementary Perspectives to Continual Learning: Ask Not Only What to Optimize, But Also How
Timm Hess
Tinne Tuytelaars
Gido M. van de Ven
44
7
0
08 Nov 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
35
1
0
30 Oct 2023
Robot Control based on Motor Primitives -- A Comparison of Two
  Approaches
Robot Control based on Motor Primitives -- A Comparison of Two Approaches
Moses C. Nah
Johannes Lachner
Neville Hogan
21
3
0
28 Oct 2023
End-to-end Offline Reinforcement Learning for Glycemia Control
End-to-end Offline Reinforcement Learning for Glycemia Control
Tristan Beolet
Alice Adenis
E. Huneker
Maxime Louis
OffRL
38
1
0
16 Oct 2023
Reinforcement Learning in the Era of LLMs: What is Essential? What is
  needed? An RL Perspective on RLHF, Prompting, and Beyond
Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Hao Sun
OffRL
34
21
0
09 Oct 2023
When is Agnostic Reinforcement Learning Statistically Tractable?
When is Agnostic Reinforcement Learning Statistically Tractable?
Zeyu Jia
Gene Li
Alexander Rakhlin
Ayush Sekhari
Nathan Srebro
OffRL
32
5
0
09 Oct 2023
Increasing Entropy to Boost Policy Gradient Performance on
  Personalization Tasks
Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Andrew Starnes
Anton Dereventsov
Clayton Webster
24
0
0
09 Oct 2023
Confronting Reward Model Overoptimization with Constrained RLHF
Confronting Reward Model Overoptimization with Constrained RLHF
Ted Moskovitz
Aaditya K. Singh
DJ Strouse
T. Sandholm
Ruslan Salakhutdinov
Anca D. Dragan
Stephen Marcus McAleer
39
48
0
06 Oct 2023
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for
  LLM Alignment
Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment
Tianhao Wu
Banghua Zhu
Ruoyu Zhang
Zhaojin Wen
Kannan Ramchandran
Jiantao Jiao
44
55
0
30 Sep 2023
A Structured Prediction Approach for Robot Imitation Learning
A Structured Prediction Approach for Robot Imitation Learning
Anqing Duan
Iason Batzianoulis
Raffaello Camoriano
Lorenzo Rosasco
Daniele Pucci
A. Billard
21
4
0
26 Sep 2023
Gray-box Adversarial Attack of Deep Reinforcement Learning-based Trading
  Agents
Gray-box Adversarial Attack of Deep Reinforcement Learning-based Trading Agents
Foozhan Ataiefard
Hadi Hemmati
AAML
29
2
0
26 Sep 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional
  Reinforcement Learning
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
30
9
0
25 Sep 2023
Machine Learning Meets Advanced Robotic Manipulation
Machine Learning Meets Advanced Robotic Manipulation
Saeid Nahavandi
R. Alizadehsani
D. Nahavandi
Chee Peng Lim
Kevin Kelly
Fernando Bello
24
17
0
22 Sep 2023
Learning to Recover for Safe Reinforcement Learning
Learning to Recover for Safe Reinforcement Learning
Haoyu Wang
Xin Yuan
Qinqing Ren
34
0
0
21 Sep 2023
Contrastive Initial State Buffer for Reinforcement Learning
Contrastive Initial State Buffer for Reinforcement Learning
Nico Messikommer
Yunlong Song
Davide Scaramuzza
OffRL
44
9
0
18 Sep 2023
Trust-Region Neural Moving Horizon Estimation for Robots
Trust-Region Neural Moving Horizon Estimation for Robots
Bingheng Wang
Xuyang Chen
Lin Zhao
19
2
0
12 Sep 2023
ACT: Empowering Decision Transformer with Dynamic Programming via
  Advantage Conditioning
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
Chenxiao Gao
Chenyang Wu
Mingjun Cao
Rui Kong
Zongzhang Zhang
Yang Yu
OffRL
34
13
0
12 Sep 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation
  Strategies towards Equal Long-term Benefit Rate
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
35
4
0
07 Sep 2023
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Miguel Abreu
Luis Paulo Reis
N. Lau
41
5
0
06 Sep 2023
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon
  Average Reward Markov Decision Processes
Regret Analysis of Policy Gradient Algorithm for Infinite Horizon Average Reward Markov Decision Processes
Qinbo Bai
Washim Uddin Mondal
Vaneet Aggarwal
34
9
0
05 Sep 2023
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous
  Robotics
R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics
Zexin Li
Aritra Samanta
Yufei Li
Andrea Soltoggio
Hyoseung Kim
Cong Liu
30
6
0
29 Aug 2023
Reinforcement Learning for Sampling on Temporal Medical Imaging
  Sequences
Reinforcement Learning for Sampling on Temporal Medical Imaging Sequences
Zhishen Huang
31
1
0
28 Aug 2023
Target-independent XLA optimization using Reinforcement Learning
Target-independent XLA optimization using Reinforcement Learning
Milan Ganai
Haichen Li
Theodore Enns
Yida Wang
Randy Huang
39
0
0
28 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
46
10
0
28 Aug 2023
Prompt-Based Length Controlled Generation with Reinforcement Learning
Prompt-Based Length Controlled Generation with Reinforcement Learning
Renlong Jie
Xiaojun Meng
Lifeng Shang
Xin Jiang
Qun Liu
17
8
0
23 Aug 2023
ILCAS: Imitation Learning-Based Configuration-Adaptive Streaming for
  Live Video Analytics with Cross-Camera Collaboration
ILCAS: Imitation Learning-Based Configuration-Adaptive Streaming for Live Video Analytics with Cross-Camera Collaboration
Duo Wu
Dayou Zhang
Miao Zhang
Ruoyu Zhang
Fang Wang
Shuguang Cui
26
7
0
19 Aug 2023
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent
  Policy Optimization
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization
Mohammad Mehdi Nasiri
M. Rezghi
38
0
0
13 Aug 2023
Reinforcement Learning for Financial Index Tracking
Reinforcement Learning for Financial Index Tracking
X. Peng
Chen Gong
X. He
19
1
0
05 Aug 2023
Wasserstein Diversity-Enriched Regularizer for Hierarchical
  Reinforcement Learning
Wasserstein Diversity-Enriched Regularizer for Hierarchical Reinforcement Learning
Haorui Li
Jiaqi Liang
Linjing Li
D. Zeng
11
0
0
02 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark
  and Case Study for Robotics Manipulation
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
35
14
0
31 Jul 2023
Submodular Reinforcement Learning
Submodular Reinforcement Learning
Manish Prajapat
Mojmír Mutný
M. Zeilinger
Andreas Krause
OffRL
30
12
0
25 Jul 2023
Counterfactual Explanation Policies in RL
Counterfactual Explanation Policies in RL
Shripad Deshmukh
R Srivatsan
Supriti Vijay
Jayakumar Subramanian
Chirag Agarwal
OffRL
37
0
0
25 Jul 2023
Secrets of RLHF in Large Language Models Part I: PPO
Secrets of RLHF in Large Language Models Part I: PPO
Rui Zheng
Shihan Dou
Songyang Gao
Yuan Hua
Wei Shen
...
Hang Yan
Tao Gui
Qi Zhang
Xipeng Qiu
Xuanjing Huang
ALM
OffRL
43
159
0
11 Jul 2023
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource
  Allocation
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
Abhijeet Pendyala
Justin Dettmer
Tobias Glasmachers
Asma Atamna
OffRL
19
6
0
06 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of
  Circular Cylinder with Sparse Surface Pressure Sensing
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
23
24
0
05 Jul 2023
Cooperative Multi-Agent Learning for Navigation via Structured State
  Abstraction
Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction
Mohamed K. Abdel-Aziz
Mohammed S. Elbamby
S. Samarakoon
M. Bennis
23
4
0
20 Jun 2023
Deep Reinforcement Learning with Task-Adaptive Retrieval via
  Hypernetwork
Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork
Yonggang Jin
Chenxu Wang
Tianyu Zheng
Liuyu Xiang
Yao-Chun Yang
Junge Zhang
Jie Fu
Zhaofeng He
3DH
42
0
0
19 Jun 2023
Actor-Critic Model Predictive Control
Actor-Critic Model Predictive Control
Angel Romero
Yunlong Song
Davide Scaramuzza
47
35
0
16 Jun 2023
Generalizable Resource Scaling of 5G Slices using Constrained
  Reinforcement Learning
Generalizable Resource Scaling of 5G Slices using Constrained Reinforcement Learning
Muhammad Sulaiman
Mahdieh Ahmadi
M. A. Salahuddin
R. Boutaba
A. Saleh
42
6
0
15 Jun 2023
Optimal Exploration for Model-Based RL in Nonlinear Systems
Optimal Exploration for Model-Based RL in Nonlinear Systems
Andrew Wagenmaker
Guanya Shi
Kevin G. Jamieson
36
14
0
15 Jun 2023
Towards AGI in Computer Vision: Lessons Learned from GPT and Large
  Language Models
Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models
Lingxi Xie
Longhui Wei
Xiaopeng Zhang
Kaifeng Bi
Xiaotao Gu
Jianlong Chang
Qi Tian
35
7
0
14 Jun 2023
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum
  Markov Games: Switching System Approach
Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach
Dong-hwan Lee
21
2
0
09 Jun 2023
Learning for Edge-Weighted Online Bipartite Matching with Robustness
  Guarantees
Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees
Pengfei Li
Jianyi Yang
Shaolei Ren
OffRL
27
4
0
31 May 2023
First Order Methods with Markovian Noise: from Acceleration to
  Variational Inequalities
First Order Methods with Markovian Noise: from Acceleration to Variational Inequalities
Aleksandr Beznosikov
S. Samsonov
Marina Sheshukova
Alexander Gasnikov
A. Naumov
Eric Moulines
46
14
0
25 May 2023
Constrained Proximal Policy Optimization
Constrained Proximal Policy Optimization
Chengbin Xuan
Feng Zhang
Faliang Yin
H. Lam
21
0
0
23 May 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
52
0
0
23 May 2023
Training Diffusion Models with Reinforcement Learning
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
44
318
0
22 May 2023
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
30
2
0
22 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
28
40
0
22 May 2023
Previous
12345...212223
Next