ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 6,868 papers shown
Title
Population-Guided Parallel Policy Search for Reinforcement Learning
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for
  Addressing Value Estimation Errors
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Jingliang Duan
Yang Guan
Shengbo Eben Li
Yangang Ren
B. Cheng
OffRL
22
174
0
09 Jan 2020
On Computation and Generalization of Generative Adversarial Imitation
  Learning
On Computation and Generalization of Generative Adversarial Imitation Learning
Minshuo Chen
Yizhou Wang
Tianyi Liu
Zhuoran Yang
Xingguo Li
Zhaoran Wang
T. Zhao
37
40
0
09 Jan 2020
Learning to Move with Affordance Maps
Learning to Move with Affordance Maps
William Qi
Ravi Teja Mullapudi
Saurabh Gupta
Deva Ramanan
24
35
0
08 Jan 2020
An Exploration of Embodied Visual Exploration
An Exploration of Embodied Visual Exploration
Santhosh Kumar Ramakrishnan
Dinesh Jayaraman
Kristen Grauman
LM&Ro
32
98
0
07 Jan 2020
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via
  Reward Network Distillation
Joint Goal and Strategy Inference across Heterogeneous Demonstrators via Reward Network Distillation
Letian Chen
Rohan R. Paleja
Muyleng Ghuy
Matthew C. Gombolay
14
38
0
02 Jan 2020
Continuous-Discrete Reinforcement Learning for Hybrid Control in
  Robotics
Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics
Michael Neunert
A. Abdolmaleki
Markus Wulfmeier
Thomas Lampe
Jost Tobias Springenberg
Roland Hafner
Francesco Romano
J. Buchli
N. Heess
Martin Riedmiller
21
91
0
02 Jan 2020
Gait Library Synthesis for Quadruped Robots via Augmented Random Search
Gait Library Synthesis for Quadruped Robots via Augmented Random Search
Sashank Tirumala
Aditya Sagi
Kartik Paigwar
Ashish Joglekar
S. Bhatnagar
A. Ghosal
B. Amrutur
Shishir Kolathaya
22
6
0
30 Dec 2019
Deep Innovation Protection: Confronting the Credit Assignment Problem in
  Training Heterogeneous Neural Architectures
Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures
S. Risi
Kenneth O. Stanley
30
4
0
29 Dec 2019
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for
  Reproducible Deep Reinforcement Learning
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning
Keng Wah Loon
L. Graesser
Milan Cvitkovic
OffRL
18
13
0
28 Dec 2019
RecVAE: a New Variational Autoencoder for Top-N Recommendations with
  Implicit Feedback
RecVAE: a New Variational Autoencoder for Top-N Recommendations with Implicit Feedback
Ilya Shenbin
Anton M. Alekseev
E. Tutubalina
Valentin Malykh
Sergey I. Nikolenko
BDL
DRL
18
196
0
24 Dec 2019
A Survey of Deep Reinforcement Learning in Video Games
A Survey of Deep Reinforcement Learning in Video Games
Kun Shao
Zhentao Tang
Yuanheng Zhu
Nannan Li
Dongbin Zhao
OffRL
AI4TS
43
188
0
23 Dec 2019
Monte-Carlo Tree Search for Policy Optimization
Monte-Carlo Tree Search for Policy Optimization
Xiaobai Ma
Katherine Driggs-Campbell
Zongzhang Zhang
Mykel J. Kochenderfer
20
6
0
23 Dec 2019
Direct and indirect reinforcement learning
Direct and indirect reinforcement learning
Yang Guan
Shengbo Eben Li
Jingliang Duan
Jie Li
Yangang Ren
Qi Sun
B. Cheng
OffRL
38
34
0
23 Dec 2019
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse
  Rewards
Predictive Coding for Boosting Deep Reinforcement Learning with Sparse Rewards
Xingyu Lu
Stas Tiomkin
Pieter Abbeel
OffRL
31
4
0
21 Dec 2019
Optimizing Collision Avoidance in Dense Airspace using Deep
  Reinforcement Learning
Optimizing Collision Avoidance in Dense Airspace using Deep Reinforcement Learning
Sheng Li
M. Egorov
Mykel Kochenderfer
34
32
0
20 Dec 2019
Learning Convex Optimization Control Policies
Learning Convex Optimization Control Policies
Akshay Agrawal
Shane T. Barratt
Stephen P. Boyd
Bartolomeo Stellato
30
66
0
19 Dec 2019
Taming an autonomous surface vehicle for path following and collision
  avoidance using deep reinforcement learning
Taming an autonomous surface vehicle for path following and collision avoidance using deep reinforcement learning
Eivind Meyer
Haakon Robinson
Adil Rasheed
Omer San
25
65
0
18 Dec 2019
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNN
VLM
CLL
AI4CE
LRM
46
1,799
0
13 Dec 2019
A pedestrian path-planning model in accordance with obstacle's danger
  with reinforcement learning
A pedestrian path-planning model in accordance with obstacle's danger with reinforcement learning
Thanh-Trung Trinh
Dinh-Minh Vu
M. Kimura
11
2
0
06 Dec 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning
Leveraging Procedural Generation to Benchmark Reinforcement Learning
K. Cobbe
Christopher Hesse
Jacob Hilton
John Schulman
45
541
0
03 Dec 2019
SafeLife 1.0: Exploring Side Effects in Complex Environments
SafeLife 1.0: Exploring Side Effects in Complex Environments
Carroll L. Wainwright
P. Eckersley
27
12
0
03 Dec 2019
MnasFPN: Learning Latency-aware Pyramid Architecture for Object
  Detection on Mobile Devices
MnasFPN: Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices
Bo Chen
Golnaz Ghiasi
Hanxiao Liu
Nayeon Lee
Dmitry Kalenichenko
Hartwig Adam
Quoc V. Le
ObjD
33
53
0
02 Dec 2019
Policy Optimization Reinforcement Learning with Entropy Regularization
Policy Optimization Reinforcement Learning with Entropy Regularization
Jingbin Liu
Xinyang Gu
Shuai Liu
20
4
0
02 Dec 2019
Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous
  Multi-Lane Driving
Multi-Vehicle Mixed-Reality Reinforcement Learning for Autonomous Multi-Lane Driving
Rupert Mitchell
Jenny Fletcher
Jacopo Panerati
Amanda Prorok
30
17
0
26 Nov 2019
Adaptive dynamic programming for nonaffine nonlinear optimal control
  problem with state constraints
Adaptive dynamic programming for nonaffine nonlinear optimal control problem with state constraints
Jingliang Duan
Zhengyu Liu
Shengbo Eben Li
Qi Sun
Zhenzhong Jia
B. Cheng
15
64
0
26 Nov 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and
  Algorithms
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Kaipeng Zhang
Zhuoran Yang
Tamer Basar
63
1,184
0
24 Nov 2019
Search to Distill: Pearls are Everywhere but not the Eyes
Search to Distill: Pearls are Everywhere but not the Eyes
Yu Liu
Xuhui Jia
Mingxing Tan
Raviteja Vemulapalli
Yukun Zhu
Bradley Green
Xiaogang Wang
30
68
0
20 Nov 2019
Object Finding in Cluttered Scenes Using Interactive Perception
Object Finding in Cluttered Scenes Using Interactive Perception
Tonci Novkovic
Remi Pautrat
Fadri Furrer
Michel Breyer
Roland Siegwart
Juan I. Nieto
24
66
0
18 Nov 2019
IKEA Furniture Assembly Environment for Long-Horizon Complex
  Manipulation Tasks
IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks
Youngwoon Lee
E. Hu
Zhengyu Yang
Alexander Yin
Joseph J. Lim
31
122
0
17 Nov 2019
S2DNAS:Transforming Static CNN Model for Dynamic Inference via Neural
  Architecture Search
S2DNAS:Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search
Zhihang Yuan
Bingzhe Wu
Zheng Liang
Shiwan Zhao
Weichen Bi
Guangyu Sun
27
30
0
16 Nov 2019
Kinematic State Abstraction and Provably Efficient Rich-Observation
  Reinforcement Learning
Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning
Dipendra Kumar Misra
Mikael Henaff
A. Krishnamurthy
John Langford
26
151
0
13 Nov 2019
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using
  Proximal Policy Optimization
Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Eivind Bøhn
E. M. Coates
Signe Moe
T. Johansen
12
129
0
13 Nov 2019
Learning Representations in Reinforcement Learning:An Information
  Bottleneck Approach
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
37
10
0
12 Nov 2019
Multi-Path Policy Optimization
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
18
2
0
11 Nov 2019
Worst Cases Policy Gradients
Worst Cases Policy Gradients
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
21
75
0
09 Nov 2019
DeepRacer: Educational Autonomous Racing Platform for Experimentation
  with Sim2Real Reinforcement Learning
DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning
Bharathan Balaji
S. Mallya
Sahika Genc
Saurabh Gupta
Leo Dirac
...
Yunzhe Tao
Brian Townsend
E. Calleja
Sunil Muralidhara
Dhanasekar Karuppasamy
19
56
0
05 Nov 2019
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Explicit Explore-Exploit Algorithms in Continuous State Spaces
Mikael Henaff
OffRL
22
31
0
01 Nov 2019
Experienced Deep Reinforcement Learning with Generative Adversarial
  Networks (GANs) for Model-Free Ultra Reliable Low Latency Communication
Experienced Deep Reinforcement Learning with Generative Adversarial Networks (GANs) for Model-Free Ultra Reliable Low Latency Communication
Ali Taleb
Ieee Walid Saad Fellow
Ieee Mohammad Mozaffari Member
Ieee H. Vincent Poor Fellow
8
91
0
01 Nov 2019
Learning Fairness in Multi-Agent Systems
Learning Fairness in Multi-Agent Systems
Jiechuan Jiang
Zongqing Lu
16
69
0
31 Oct 2019
Feedback Linearization for Unknown Systems via Reinforcement Learning
Feedback Linearization for Unknown Systems via Reinforcement Learning
T. Westenbroek
David Fridovich-Keil
Eric Mazumdar
Shreyas Arora
Valmik Prabhu
S. Shankar Sastry
Claire Tomlin
14
28
0
29 Oct 2019
Navigation Agents for the Visually Impaired: A Sidewalk Simulator and
  Experiments
Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments
Martin Weiss
Simon Chamorro
Roger Girgis
Margaux Luck
Samira Ebrahimi Kahou
Joseph Paul Cohen
Derek Nowrouzezahrai
Doina Precup
Florian Golemo
C. Pal
42
10
0
29 Oct 2019
Learning to Predict Without Looking Ahead: World Models Without Forward
  Prediction
Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
C. Freeman
Luke Metz
David R Ha
33
35
0
29 Oct 2019
Asynchronous Methods for Model-Based Reinforcement Learning
Asynchronous Methods for Model-Based Reinforcement Learning
Yunzhi Zhang
I. Clavera
Bo-Yu Tsai
Pieter Abbeel
OffRL
13
27
0
28 Oct 2019
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement
  Learning
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Zihan Wang
Che Wang
Yanqiu Wu
Keith Ross
OffRL
27
121
0
27 Oct 2019
HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation
  with Mobile Manipulators
HRL4IN: Hierarchical Reinforcement Learning for Interactive Navigation with Mobile Manipulators
Chengshu Li
Fei Xia
R. M. Martin
Silvio Savarese
30
100
0
24 Oct 2019
Learning Hierarchical Control for Robust In-Hand Manipulation
Learning Hierarchical Control for Robust In-Hand Manipulation
Tingguang Li
K. Srinivasan
Max Q.-H. Meng
Wenzhen Yuan
Jeannette Bohg
29
41
0
24 Oct 2019
Collision Avoidance in Pedestrian-Rich Environments with Deep
  Reinforcement Learning
Collision Avoidance in Pedestrian-Rich Environments with Deep Reinforcement Learning
Michael Everett
Yu Fan Chen
Jonathan P. How
OffRL
17
169
0
24 Oct 2019
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta
  Reinforcement Learning
Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu
Deirdre Quillen
Zhanpeng He
Ryan Julian
Avnish Narayan
Hayden Shively
Adithya Bellathur
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
92
1,128
0
24 Oct 2019
Robust Visual Domain Randomization for Reinforcement Learning
Robust Visual Domain Randomization for Reinforcement Learning
Reda Bahi Slaoui
W. Clements
Jakob N. Foerster
Sébastien Toth
OOD
8
12
0
23 Oct 2019
Previous
123...130131132...136137138
Next