Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.06347
Cited By
v1
v2 (latest)
Proximal Policy Optimization Algorithms
20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Proximal Policy Optimization Algorithms"
50 / 626 papers shown
Title
Learning Goal-Directed Object Pushing in Cluttered Scenes with Location-Based Attention
Nils Dengler
Juan Del Aguila Ferrandis
João Moura
S. Vijayakumar
Maren Bennewitz
93
0
0
26 Mar 2024
Scenario-Based Curriculum Generation for Multi-Agent Autonomous Driving
Axel Brunnbauer
Luigi Berducci
P. Priller
D. Ničković
Radu Grosu
109
2
0
26 Mar 2024
Leveraging Symmetry in RL-based Legged Locomotion Control
Zhi Su
Xiaoyu Huang
Daniel Felipe Ordoñez Apraez
Yunfei Li
Zhongyu Li
...
Giulio Turrisi
Massimiliano Pontil
Claudio Semini
Yi Wu
Koushil Sreenath
89
16
0
26 Mar 2024
Learning To Guide Human Decision Makers With Vision-Language Models
Debodeep Banerjee
Stefano Teso
Burcu Sayin
Andrea Passerini
77
1
0
25 Mar 2024
Carbon Footprint Reduction for Sustainable Data Centers in Real-Time
Soumyendu Sarkar
Avisek Naug
Ricardo Luna
Antonio Guillen
Vineet Gundecha
Sahand Ghorbanpour
Sajad Mousavi
Dejan Markovikj
Ashwin Ramesh Babu
AI4CE
79
8
0
21 Mar 2024
Task-optimal data-driven surrogate models for eNMPC via differentiable simulation and optimization
Daniel Mayfrank
Na Young Ahn
Alexander Mitsos
Manuel Dahmen
116
2
0
21 Mar 2024
Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections
Zeng Peng
Xiao Zhou
Lei Zheng
Yubin Wang
Jun Ma
205
5
0
20 Mar 2024
ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models
Runyu Ma
Jelle Luijkx
Zlatan Ajanović
Jens Kober
LM&Ro
LRM
91
9
0
14 Mar 2024
Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate
Yifan Lin
Yuhao Wang
Enlu Zhou
122
0
0
01 Mar 2024
Symmetry-Breaking Augmentations for Ad Hoc Teamwork
Ravi Hammond
Dustin Craggs
Mingyu Guo
Jakob Foerster
Ian Reid
76
2
0
15 Feb 2024
Reinforcement Learning from Human Feedback with Active Queries
Kaixuan Ji
Jiafan He
Quanquan Gu
98
19
0
14 Feb 2024
Natural Language Reinforcement Learning
Xidong Feng
Bo Liu
Mengyue Yang
Ziyan Wang
Girish A. Koushiks
Yali Du
Ying Wen
Jun Wang
OffRL
97
5
0
11 Feb 2024
Institutional Platform for Secure Self-Service Large Language Model Exploration
V. Bumgardner
Mitchell A. Klusty
W. V. Logan
Samuel E. Armstrong
Caylin D. Hickey
Jeff Talbert
Caylin Hickey
Jeff Talbert
130
1
0
01 Feb 2024
Closure Discovery for Coarse-Grained Partial Differential Equations Using Grid-based Reinforcement Learning
Jan-Philipp von Bassewitz
Sebastian Kaltenbach
Petros Koumoutsakos
AI4CE
122
2
0
01 Feb 2024
Zero-Shot Reinforcement Learning via Function Encoders
Tyler Ingebrand
Amy Zhang
Ufuk Topcu
OffRL
81
5
0
30 Jan 2024
Self-Rewarding Language Models
Weizhe Yuan
Richard Yuanzhe Pang
Kyunghyun Cho
Xian Li
Sainbayar Sukhbaatar
Jing Xu
Jason Weston
ReLM
SyDa
ALM
LRM
370
338
0
18 Jan 2024
Crowd-PrefRL: Preference-Based Reward Learning from Crowds
David Chhan
Ellen R. Novoseller
Vernon J. Lawhern
140
5
0
17 Jan 2024
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Eduardo Sebastián
T. Duong
Nikolay Atanasov
Eduardo Montijano
C. Sagüés
120
3
0
30 Dec 2023
Designing a skilled soccer team for RoboCup: exploring skill-set-primitives through reinforcement learning
Miguel Abreu
Luis Paulo Reis
Nuno Lau
129
5
0
22 Dec 2023
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
148
5
0
13 Dec 2023
LLM A*: Human in the Loop Large Language Models Enabled A* Search for Robotics
Hengjia Xiao
Peng Wang
Mingzhe Yu
Mattia Robbiani
53
25
0
04 Dec 2023
Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization
Daniel Jarne Ornia
Giannis Delimpaltadakis
Jens Kober
Javier Alonso-Mora
65
3
0
30 Nov 2023
Critical Influence of Overparameterization on Sharpness-aware Minimization
Sungbin Shin
Dongyeop Lee
Maksym Andriushchenko
Namhoon Lee
AAML
150
2
0
29 Nov 2023
Mission-driven Exploration for Accelerated Deep Reinforcement Learning with Temporal Logic Task Specifications
Jun Wang
Hosein Hasanbeig
Kaiyuan Tan
Zihe Sun
Y. Kantaros
109
3
0
28 Nov 2023
Real-Time Recurrent Reinforcement Learning
Julian Lemmel
Radu Grosu
95
2
0
08 Nov 2023
Rule-Based Lloyd Algorithm for Multi-Robot Motion Planning and Control with Safety and Convergence Guarantees
Manuel Boldrer
Álvaro Serra-Gómez
Lorenzo Lyons
Vít Krátký
Javier Alonso-Mora
Laura Ferranti
131
4
0
30 Oct 2023
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
Saaket Agashe
Yue Fan
Anthony Reyna
Xin Eric Wang
LLMAG
LRM
139
14
0
05 Oct 2023
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving
Hao Sha
Yao Mu
Yuxuan Jiang
Li Chen
Chenfeng Xu
Ping Luo
Shengbo Eben Li
Masayoshi Tomizuka
Wei Zhan
Mingyu Ding
256
179
0
04 Oct 2023
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation
Benjamin Steenhoek
Michele Tufano
Neel Sundaresan
Alexey Svyatkovskiy
OffRL
ALM
131
21
0
03 Oct 2023
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study
Mikihisa Yuasa
Huy T. Tran
R. Sreenivas
FAtt
LRM
137
1
0
29 Sep 2023
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
Botian Xu
Feng Gao
Chao Yu
Chao Yu
Yi Wu
Yu Wang
105
30
0
22 Sep 2023
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELM
LRM
260
751
0
19 Sep 2023
GraspGF: Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping
Tianhao Wu
Mingdong Wu
Jiyao Zhang
Yunchong Gan
Hao Dong
116
25
0
12 Sep 2023
FLM-101B: An Open LLM and How to Train It with
100
K
B
u
d
g
e
t
100K Budget
100
K
B
u
d
g
e
t
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
135
22
0
07 Sep 2023
Addressing imperfect symmetry: A novel symmetry-learning actor-critic extension
Miguel Abreu
Luis Paulo Reis
Nuno Lau
88
6
0
06 Sep 2023
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
167
13
0
28 Aug 2023
SafeSteps: Learning Safer Footstep Planning Policies for Legged Robots via Model-Based Priors
Shafeef Omar
Lorenzo Amatucci
Victor Barasuol
Giulio Turrisi
Claudio Semini
68
4
0
24 Jul 2023
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
Yihe Zhou
Shunyu Liu
Yunpeng Qing
Kaixuan Chen
Tongya Zheng
Jie Song
Mingli Song
69
20
0
27 May 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
187
0
0
23 May 2023
Improving robot navigation in crowded environments using intrinsic rewards
Diego Martínez Baselga
L. Riazuelo
Luis Montano
112
14
0
13 Feb 2023
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
Xin Liu
Yaran Chen
Haoran Li
Boyu Li
Dong Zhao
SSL
123
10
0
11 Feb 2023
DITTO: Offline Imitation Learning with World Models
Branton DeMoss
Paul Duckworth
Nick Hawes
Ingmar Posner
Ingmar Posner
OffRL
66
18
0
06 Feb 2023
Online Reinforcement Learning in Non-Stationary Context-Driven Environments
Pouya Hamadanian
Arash Nasr-Esfahany
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
CLL
OffRL
113
0
0
04 Feb 2023
Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow
Ángela López-Cardona
Guillermo Bernárdez
Pere Barlet-Ros
A. Cabellos-Aparicio
193
4
0
23 Dec 2022
Experiential Explanations for Reinforcement Learning
Amal Alabdulkarim
Madhuri Singh
Gennie Mansi
Kaely Hall
Mark O. Riedl
Mark O. Riedl
OffRL
121
3
0
10 Oct 2022
Learning Progress Driven Multi-Agent Curriculum
Wenshuai Zhao
Zhiyuan Li
Joni Pajarinen
88
0
0
20 May 2022
User-Oriented Robust Reinforcement Learning
Haoyi You
Beichen Yu
Haiming Jin
Zhaoxing Yang
Jiahui Sun
OffRL
57
0
0
15 Feb 2022
Generative Design by Reinforcement Learning: Enhancing the Diversity of Topology Optimization Designs
Seowoo Jang
Soyoung Yoo
Namwoo Kang
AI4CE
74
72
0
17 Aug 2020
Explainability in Deep Reinforcement Learning
Alexandre Heuillet
Fabien Couthouis
Natalia Díaz Rodríguez
XAI
199
283
0
15 Aug 2020
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning
Phillip Swazinna
Steffen Udluft
Thomas Runkler
OffRL
64
84
0
12 Aug 2020
Previous
1
2
3
...
11
12
13
Next