ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1502.05477
  4. Cited By
Trust Region Policy Optimization

Trust Region Policy Optimization

19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
ArXivPDFHTML

Papers citing "Trust Region Policy Optimization"

50 / 3,098 papers shown
Title
Constrained Proximal Policy Optimization
Constrained Proximal Policy Optimization
Chengbin Xuan
Feng Zhang
Faliang Yin
H. Lam
26
0
0
23 May 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
59
0
0
23 May 2023
Proximal Policy Gradient Arborescence for Quality Diversity
  Reinforcement Learning
Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning
Sumeet Batra
Bryon Tjanaka
Matthew C. Fontaine
Aleksei Petrenko
Stefanos Nikolaidis
Gaurav Sukhatme
OffRL
53
12
0
23 May 2023
Optimizing Long-term Value for Auction-Based Recommender Systems via
  On-Policy Reinforcement Learning
Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning
Ruiyang Xu
Jalaj Bhandari
D. Korenkevych
F. Liu
Yuchen He
Alex Nikulkov
Zheqing Zhu
OffRL
39
6
0
23 May 2023
GUARD: A Safe Reinforcement Learning Benchmark
GUARD: A Safe Reinforcement Learning Benchmark
Weiye Zhao
Rui Chen
Yifan Sun
Ruixuan Liu
Tianhao Wei
Changliu Liu
54
13
0
23 May 2023
Training Diffusion Models with Reinforcement Learning
Training Diffusion Models with Reinforcement Learning
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
44
320
0
22 May 2023
Regularization and Variance-Weighted Regression Achieves Minimax
  Optimality in Linear MDPs: Theory and Practice
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Toshinori Kitamura
Tadashi Kozuno
Yunhao Tang
Nino Vieillard
Michal Valko
...
Olivier Pietquin
M. Geist
Csaba Szepesvári
Wataru Kumagai
Yutaka Matsuo
OffRL
35
3
0
22 May 2023
Policy Representation via Diffusion Probability Model for Reinforcement
  Learning
Policy Representation via Diffusion Probability Model for Reinforcement Learning
Long Yang
Zhixiong Huang
Fenghao Lei
Yucun Zhong
Yiming Yang
Cong Fang
Shiting Wen
Binbin Zhou
Zhouchen Lin
DiffM
44
40
0
22 May 2023
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning
  with Energy-based Models
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Wenhao Ding
Tong Che
Ding Zhao
Marco Pavone
BDL
OffRL
22
2
0
18 May 2023
Optimistic Natural Policy Gradient: a Simple Efficient Policy
  Optimization Framework for Online RL
Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL
Qinghua Liu
Gellert Weisz
András Gyorgy
Chi Jin
Csaba Szepesvári
OffRL
26
9
0
18 May 2023
Client Selection for Federated Policy Optimization with Environment
  Heterogeneity
Client Selection for Federated Policy Optimization with Environment Heterogeneity
Zhijie Xie
S. H. Song
35
3
0
18 May 2023
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies
Hanna Ziesche
Leonel Rozo
28
5
0
17 May 2023
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary
  Prospects, and Challenges
Multi-Agent Reinforcement Learning: Methods, Applications, Visionary Prospects, and Challenges
Ziyuan Zhou
Guanjun Liu
Ying-Si Tang
38
14
0
17 May 2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning
  Research
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
Jiaming Ji
Jiayi Zhou
Borong Zhang
Juntao Dai
Xuehai Pan
Ruiyang Sun
Weidong Huang
Yiran Geng
Mickel Liu
Yaodong Yang
OffRL
75
47
0
16 May 2023
What Matters in Reinforcement Learning for Tractography
What Matters in Reinforcement Learning for Tractography
Antoine Théberge
Christian Desrosiers
Maxime Descoteaux
Pierre-Marc Jodoin
OffRL
29
2
0
15 May 2023
MIMEx: Intrinsic Rewards from Masked Input Modeling
MIMEx: Intrinsic Rewards from Masked Input Modeling
Toru Lin
Allan Jabri
OffRL
33
6
0
15 May 2023
A Theoretical Analysis of Optimistic Proximal Policy Optimization in
  Linear Markov Decision Processes
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes
Han Zhong
Tong Zhang
35
26
0
15 May 2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial
  MDP with Delayed Bandit Feedback
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback
Tal Lancewicki
Aviv A. Rosenberg
Dmitry Sotnikov
31
3
0
13 May 2023
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy
  Gradient Algorithms
Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms
Jinyang Jiang
Jiaqiao Hu
Yijie Peng
20
2
0
12 May 2023
Policy Gradient Algorithms Implicitly Optimize by Continuation
Policy Gradient Algorithms Implicitly Optimize by Continuation
Adrien Bolland
Gilles Louppe
D. Ernst
39
3
0
11 May 2023
Towards Scalable Adaptive Learning with Graph Neural Networks and
  Reinforcement Learning
Towards Scalable Adaptive Learning with Graph Neural Networks and Reinforcement Learning
Jean Vassoyan
Jill-Jênn Vie
Pirmin Lemberger
GNN
24
2
0
10 May 2023
Mixture of personality improved Spiking actor network for efficient
  multi-agent cooperation
Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Xiyun Li
Ziyi Ni
Jingqing Ruan
Linghui Meng
Jing Shi
Tielin Zhang
Bo Xu
61
4
0
10 May 2023
Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant
  Fuel Optimization
Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant Fuel Optimization
Paul Seurin
K. Shirvan
32
8
0
09 May 2023
Learnable Behavior Control: Breaking Atari Human World Records via
  Sample-Efficient Behavior Selection
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
Jiajun Fan
Yuzheng Zhuang
Yuecheng Liu
Jianye Hao
Bin Wang
Jiangcheng Zhu
Hao Wang
Shutao Xia
37
17
0
09 May 2023
Reinforcement Learning for Topic Models
Reinforcement Learning for Topic Models
Jeremy Costello
Marek Reformat
BDL
OffRL
26
2
0
08 May 2023
Local Optimization Achieves Global Optimality in Multi-Agent
  Reinforcement Learning
Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning
Yulai Zhao
Zhuoran Yang
Zhaoran Wang
Jason D. Lee
48
3
0
08 May 2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
Truncating Trajectories in Monte Carlo Reinforcement Learning
Riccardo Poiani
Alberto Maria Metelli
Marcello Restelli
29
2
0
07 May 2023
Federated Ensemble-Directed Offline Reinforcement Learning
Federated Ensemble-Directed Offline Reinforcement Learning
Desik Rengarajan
N. Ragothaman
D. Kalathil
S. Shakkottai
OffRL
35
1
0
04 May 2023
Correcting for Interference in Experiments: A Case Study at Douyin
Correcting for Interference in Experiments: A Case Study at Douyin
Vivek F. Farias
Hao Li
Tianyi Peng
Xinyuyang Ren
B. Hassibi
A. Zheng
41
9
0
04 May 2023
Representations and Exploration for Deep Reinforcement Learning using
  Singular Value Decomposition
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
Yash Chandak
S. Thakoor
Z. Guo
Yunhao Tang
Rémi Munos
Will Dabney
Diana Borsa
29
2
0
01 May 2023
Semi-Infinitely Constrained Markov Decision Processes and Efficient
  Reinforcement Learning
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning
Liangyu Zhang
Yang Peng
Wenhao Yang
Zhihua Zhang
21
1
0
29 Apr 2023
Systematic Review on Reinforcement Learning in the Field of Fintech
Systematic Review on Reinforcement Learning in the Field of Fintech
Nadeem Malibari
Iyad A. Katib
Rashid Mehmood
OffRL
26
4
0
29 Apr 2023
Learning to Extrapolate: A Transductive Approach
Learning to Extrapolate: A Transductive Approach
Aviv Netanyahu
Abhishek Gupta
Max Simchowitz
Kaipeng Zhang
Pulkit Agrawal
51
15
0
27 Apr 2023
Distance Weighted Supervised Learning for Offline Interaction Data
Distance Weighted Supervised Learning for Offline Interaction Data
Joey Hejna
Jensen Gao
Dorsa Sadigh
OffRL
38
13
0
26 Apr 2023
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement
  Learning
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Tuomas Haarnoja
Ben Moran
Guy Lever
Sandy H. Huang
Dhruva Tirumala
...
Andrea Huber
N. Hurley
F. Nori
R. Hadsell
N. Heess
52
143
0
26 Apr 2023
Zero-shot Transfer Learning of Driving Policy via Socially Adversarial
  Traffic Flow
Zero-shot Transfer Learning of Driving Policy via Socially Adversarial Traffic Flow
Dongkun Zhang
Jintao Xue
Yuxiang Cui
Yunkai Wang
Eryun Liu
Wei Jing
Junbo Chen
R. Xiong
Yue Wang
44
0
0
25 Apr 2023
System III: Learning with Domain Knowledge for Safety Constraints
System III: Learning with Domain Knowledge for Safety Constraints
Fazl Barez
Hosien Hasanbieg
Alesandro Abbate
35
4
0
23 Apr 2023
Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution
  Strategies
Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies
Oscar Li
James Harrison
Jascha Narain Sohl-Dickstein
Virginia Smith
Luke Metz
56
5
0
21 Apr 2023
A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning
A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning
Mizhaan Prajit Maniyar
Akash Mondal
Prashanth L.A.
S. Bhatnagar
51
0
0
21 Apr 2023
TempoRL: laser pulse temporal shape optimization with Deep Reinforcement
  Learning
TempoRL: laser pulse temporal shape optimization with Deep Reinforcement Learning
F. Capuano
D. Peceli
Gabriele Tiboni
Raffaello Camoriano
Bedvrich Rus
14
1
0
20 Apr 2023
Robust nonlinear set-point control with reinforcement learning
Robust nonlinear set-point control with reinforcement learning
Ruoqing Zhang
Per Mattsson
T. Wigren
OOD
6
2
0
20 Apr 2023
Heterogeneous-Agent Reinforcement Learning
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
J. Kuba
Xidong Feng
Siyi Hu
Jiaming Ji
Yaodong Yang
25
37
0
19 Apr 2023
Perception Imitation: Towards Synthesis-free Simulator for Autonomous
  Vehicles
Perception Imitation: Towards Synthesis-free Simulator for Autonomous Vehicles
Xiaoliang Ju
Yiyang Sun
Yiming Hao
Yikang Li
Yu Qiao
Hongsheng Li
22
1
0
19 Apr 2023
Searching for ribbons with machine learning
Searching for ribbons with machine learning
Sergei Gukov
James Halverson
Ciprian Manolescu
Fabian Ruehle
13
13
0
18 Apr 2023
Cooperative Multi-Agent Reinforcement Learning for Inventory Management
Cooperative Multi-Agent Reinforcement Learning for Inventory Management
Madhav Khirwar
Karthik S. Gurumoorthy
Ankit Jain
Shantala Manchenahally
23
4
0
18 Apr 2023
Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization
Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization
Agustinus Kristiadi
Alexander Immer
Runa Eschenhagen
Vincent Fortuin
BDL
UQCV
30
8
0
17 Apr 2023
Multi-agent Policy Reciprocity with Theoretical Guarantee
Multi-agent Policy Reciprocity with Theoretical Guarantee
Haozhi Wang
Yinchuan Li
Qing Wang
Yunfeng Shao
Jianye Hao
25
0
0
12 Apr 2023
Synthetic Sample Selection for Generalized Zero-Shot Learning
Synthetic Sample Selection for Generalized Zero-Shot Learning
Shreyank N. Gowda
32
16
0
06 Apr 2023
Quantum Imitation Learning
Quantum Imitation Learning
Zhihao Cheng
Kaining Zhang
Li Shen
Dacheng Tao
27
1
0
04 Apr 2023
Meta-Learning with a Geometry-Adaptive Preconditioner
Meta-Learning with a Geometry-Adaptive Preconditioner
Suhyun Kang
Duhun Hwang
Moonjung Eo
Taesup Kim
Wonjong Rhee
AI4CE
45
15
0
04 Apr 2023
Previous
123...141516...606162
Next