Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1502.05477
Cited By
Trust Region Policy Optimization
19 February 2015
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Trust Region Policy Optimization"
50 / 3,098 papers shown
Title
Careful at Estimation and Bold at Exploration
Xing Chen
Yijun Liu
Zhaogeng Liu
Hechang Chen
Hengshuai Yao
Yi-Ju Chang
16
0
0
22 Aug 2023
On the Opportunities and Challenges of Offline Reinforcement Learning for Recommender Systems
Xiaocong Chen
Siyu Wang
Julian McAuley
Dietmar Jannach
Lina Yao
OffRL
30
5
0
22 Aug 2023
A Homogenization Approach for Gradient-Dominated Stochastic Optimization
Jiyuan Tan
Chenyu Xue
Chuwen Zhang
Qi Deng
Dongdong Ge
Yinyu Ye
30
2
0
21 Aug 2023
Soft Decomposed Policy-Critic: Bridging the Gap for Effective Continuous Control with Discrete RL
Ye Zhang
Jian Sun
G. Wang
Zhuoxian Li
Wei Chen
OffRL
29
0
0
20 Aug 2023
ILCAS: Imitation Learning-Based Configuration-Adaptive Streaming for Live Video Analytics with Cross-Camera Collaboration
Duo Wu
Dayou Zhang
Miao Zhang
Ruoyu Zhang
Fang Wang
Shuguang Cui
26
7
0
19 Aug 2023
Fast Decision Support for Air Traffic Management at Urban Air Mobility Vertiports using Graph Learning
Prajit KrisshnaKumar
Jhoel Witter
Steve Paul
Han-Seon Cho
Karthik Dantu
Souma Chowdhury
21
3
0
17 Aug 2023
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization
Mohammad Mehdi Nasiri
M. Rezghi
43
0
0
13 Aug 2023
Learning Team-Based Navigation: A Review of Deep Reinforcement Learning Techniques for Multi-Agent Pathfinding
Jaeho Chung
Jamil Fayyad
Younes Al Younes
Homayoun Najjaran
38
13
0
11 Aug 2023
Pre-Trained Large Language Models for Industrial Control
Lei Song
Chuheng Zhang
Li Zhao
Jiang Bian
LM&Ro
AI4CE
32
12
0
06 Aug 2023
Reinforcement Learning for Financial Index Tracking
X. Peng
Chen Gong
X. He
26
1
0
05 Aug 2023
Deep Reinforcement Learning for Autonomous Spacecraft Inspection using Illumination
David van Wijk
Kyle Dunlap
M. Majji
Kerianne L. Hobbs
29
11
0
04 Aug 2023
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
Weiran Yao
Shelby Heinecke
Juan Carlos Niebles
Zhiwei Liu
Yihao Feng
...
Ran Xu
P. Mùi
Haiquan Wang
Caiming Xiong
Silvio Savarese
LLMAG
LM&Ro
41
74
0
04 Aug 2023
End-to-End Reinforcement Learning of Koopman Models for Economic Nonlinear Model Predictive Control
Daniel Mayfrank
Alexander Mitsos
Manuel Dahmen
29
3
0
03 Aug 2023
Controlling the Solo12 Quadruped Robot with Deep Reinforcement Learning
M. Aractingi
Pierre-Alexandre Léziart
He Cao
Julien Perez
Yuan Yao
Philippe Souères
33
29
0
02 Aug 2023
Wasserstein Diversity-Enriched Regularizer for Hierarchical Reinforcement Learning
Haorui Li
Jiaqi Liang
Linjing Li
D. Zeng
16
0
0
02 Aug 2023
PeRP: Personalized Residual Policies For Congestion Mitigation Through Co-operative Advisory Systems
Aamir Hasan
Neeloy Chakraborty
Haonan Chen
Jung-Hoon Cho
Cathy Wu
Katherine Driggs-Campbell
34
6
0
01 Aug 2023
Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation
Zhehua Zhou
Jiayang Song
Xuan Xie
Zhan Shu
Lei Ma
Dikai Liu
Jianxiong Yin
Simon See
35
15
0
31 Jul 2023
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
42
20
0
31 Jul 2023
Curiosity-Driven Reinforcement Learning based Low-Level Flight Control
Amir Ramezani Dooraki
Alexandros Iosifidis
17
0
0
28 Jul 2023
Submodular Reinforcement Learning
Manish Prajapat
Mojmír Mutný
Melanie Zeilinger
Andreas Krause
OffRL
35
12
0
25 Jul 2023
Counterfactual Explanation Policies in RL
Shripad Deshmukh
R Srivatsan
Supriti Vijay
Jayakumar Subramanian
Chirag Agarwal
OffRL
37
0
0
25 Jul 2023
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environment
Vaddadi Sai Rahul
Debajyoti Chakraborty
11
2
0
20 Jul 2023
REX: Rapid Exploration and eXploitation for AI Agents
Rithesh Murthy
Shelby Heinecke
Juan Carlos Niebles
Zhiwei Liu
Le Xue
...
Ran Xu
P. Mùi
Haiquan Wang
Caiming Xiong
Silvio Savarese
OffRL
34
8
0
18 Jul 2023
Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
Ruida Zhou
Tao-Wen Liu
Min Cheng
D. Kalathil
P. R. Kumar
Chao Tian
45
19
0
17 Jul 2023
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Yudong Luo
Guiliang Liu
Pascal Poupart
Yangchen Pan
43
10
0
17 Jul 2023
Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
T. Westenbroek
Jacob Levy
David Fridovich-Keil
38
0
0
16 Jul 2023
Maneuver Decision-Making Through Automatic Curriculum Reinforcement Learning Without Handcrafted Reward functions
Hong-Peng Zhang
26
2
0
12 Jul 2023
Secrets of RLHF in Large Language Models Part I: PPO
Rui Zheng
Shihan Dou
Songyang Gao
Yuan Hua
Wei Shen
...
Hang Yan
Tao Gui
Qi Zhang
Xipeng Qiu
Xuanjing Huang
ALM
OffRL
55
160
0
11 Jul 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
Idan Shenfeld
Zhang-Wei Hong
Aviv Tamar
Pulkit Agrawal
27
12
0
06 Jul 2023
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
Abhijeet Pendyala
Justin Dettmer
Tobias Glasmachers
Asma Atamna
OffRL
19
6
0
06 Jul 2023
Dynamic Feature-based Deep Reinforcement Learning for Flow Control of Circular Cylinder with Sparse Surface Pressure Sensing
Qiulei Wang
Lei Yan
Gang Hu
Wenli Chen
Jean Rabault
B. R. Noack
AI4CE
23
24
0
05 Jul 2023
RObotic MAnipulation Network (ROMAN)
\unicode
x
2013
\unicode{x2013}
\unicode
x
2013
Hybrid Hierarchical Learning for Solving Complex Sequential Tasks
Eleftherios Triantafyllidis
Fernando Acero
Zhaocheng Liu
Zhibin Li
29
0
0
30 Jun 2023
Navigation of micro-robot swarms for targeted delivery using reinforcement learning
Akshatha Jagadish
M. Varma
16
0
0
30 Jun 2023
Probabilistic Constraint for Safety-Critical Reinforcement Learning
Weiqin Chen
D. Subramanian
Santiago Paternain
37
15
0
29 Jun 2023
Policy Space Diversity for Non-Transitive Games
Jian Yao
Weiming Liu
Haobo Fu
Yaodong Yang
Stephen Marcus McAleer
Qiang Fu
Wei Yang
54
9
0
29 Jun 2023
SARC: Soft Actor Retrospective Critic
Sukriti Verma
Ayush Chopra
J. Subramanian
Mausoom Sarkar
Nikaash Puri
Piyush B. Gupta
Balaji Krishnamurthy
18
0
0
28 Jun 2023
RL
3
^3
3
: Boosting Meta Reinforcement Learning via RL inside RL
2
^2
2
Abhinav Bhatia
Samer B. Nashed
S. Zilberstein
OffRL
42
0
0
28 Jun 2023
Rethinking Closed-loop Training for Autonomous Driving
Chris Zhang
R. Guo
Wenyuan Zeng
Yuwen Xiong
Binbin Dai
Rui Hu
Mengye Ren
R. Urtasun
OffRL
33
30
0
27 Jun 2023
IIFL: Implicit Interactive Fleet Learning from Heterogeneous Human Supervisors
Gaurav Datta
Ryan Hoque
Anrui Gu
Eugen Solowjow
Ken Goldberg
51
3
0
27 Jun 2023
Inference for relative sparsity
Samuel J. Weisenthal
Sally W. Thurston
Ashkan Ertefaie
CML
28
0
0
25 Jun 2023
Provably Convergent Policy Optimization via Metric-aware Trust Region Methods
Jun Song
Niao He
Lijun Ding
Chaoyue Zhao
41
3
0
25 Jun 2023
Towards Optimal Pricing of Demand Response -- A Nonparametric Constrained Policy Optimization Approach
Jun Song
Chaoyue Zhao
OffRL
17
0
0
24 Jun 2023
Correcting discount-factor mismatch in on-policy policy gradient methods
Fengdi Che
Gautham Vasan
A. R. Mahmood
OffRL
22
9
0
23 Jun 2023
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning
Jinxin Liu
Ziqi Zhang
Zhenyu Wei
Zifeng Zhuang
Yachen Kang
Sibo Gai
Donglin Wang
OffRL
33
16
0
22 Jun 2023
MP3: Movement Primitive-Based (Re-)Planning Policy
Fabian Otto
Hongyi Zhou
Onur Celik
Ge Li
Rudolf Lioutikov
Gerhard Neumann
29
5
0
22 Jun 2023
State-wise Constrained Policy Optimization
Weiye Zhao
Rui Chen
Yifan Sun
Tianhao Wei
Changliu Liu
31
10
0
21 Jun 2023
Learning to Generate Better Than Your LLM
Jonathan D. Chang
Kianté Brantley
Rajkumar Ramamurthy
Dipendra Kumar Misra
Wen Sun
27
42
0
20 Jun 2023
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Huiguo He
Tianfu Wang
Huan Yang
Jianlong Fu
N. Yuan
Jian Yin
Hongyang Chao
Qi Zhang
EGVM
40
9
0
20 Jun 2023
IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL
Pascal Leroy
P. G. Morato
J. Pisane
A. Kolios
D. Ernst
OffRL
48
9
0
20 Jun 2023
Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers
Yongqi Dong
Tobias Datema
Vincent Wassenaar
Joris van de Weg
Cahit Tolga Kopar
Harim Suleman
OffRL
29
0
0
20 Jun 2023
Previous
1
2
3
...
12
13
14
...
60
61
62
Next