ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 6,896 papers shown
Title
Meta-Learning in Neural Networks: A Survey
Meta-Learning in Neural Networks: A Survey
Timothy M. Hospedales
Antreas Antoniou
P. Micaelli
Amos Storkey
OOD
93
1,935
0
11 Apr 2020
State-Only Imitation Learning for Dexterous Manipulation
State-Only Imitation Learning for Dexterous Manipulation
Ilija Radosavovic
Xiaolong Wang
Lerrel Pinto
Jitendra Malik
OffRL
19
121
0
07 Apr 2020
A Deep Ensemble Multi-Agent Reinforcement Learning Approach for Air
  Traffic Control
A Deep Ensemble Multi-Agent Reinforcement Learning Approach for Air Traffic Control
Supriyo Ghosh
Sean Laguna
Shiau Hong Lim
L. Wynter
Hasan A. Poonawala
29
14
0
03 Apr 2020
Learning Agile Robotic Locomotion Skills by Imitating Animals
Learning Agile Robotic Locomotion Skills by Imitating Animals
Xue Bin Peng
Erwin Coumans
Tingnan Zhang
T. Lee
Jie Tan
Sergey Levine
34
496
0
02 Apr 2020
A New Challenge: Approaching Tetris Link with AI
A New Challenge: Approaching Tetris Link with AI
Matthias Muller-Brockhausen
Mike Preuss
Aske Plaat
6
2
0
01 Apr 2020
Learning to Ask Medical Questions using Reinforcement Learning
Learning to Ask Medical Questions using Reinforcement Learning
Uri Shaham
Tom Zahavy
C. Caraballo
S. Mahajan
D. Massey
H. Krumholz
OOD
24
1
0
31 Mar 2020
Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation
Optical Non-Line-of-Sight Physics-based 3D Human Pose Estimation
Mariko Isogawa
Ye Yuan
Matthew O'Toole
Kris Kitani
3DH
19
59
0
31 Mar 2020
Robotic Table Tennis with Model-Free Reinforcement Learning
Robotic Table Tennis with Model-Free Reinforcement Learning
Wenbo Gao
L. Graesser
K. Choromanski
Xingyou Song
N. Lazić
Pannag R. Sanketi
Vikas Sindhwani
Navdeep Jaitly
19
44
0
31 Mar 2020
Controlling Rayleigh-Bénard convection via Reinforcement Learning
Controlling Rayleigh-Bénard convection via Reinforcement Learning
Gerben Beintema
Alessandro Corbetta
Luca Biferale
F. Toschi
AI4CE
27
79
0
31 Mar 2020
Leverage the Average: an Analysis of KL Regularization in RL
Leverage the Average: an Analysis of KL Regularization in RL
Nino Vieillard
Tadashi Kozuno
B. Scherrer
Olivier Pietquin
Rémi Munos
M. Geist
25
42
0
31 Mar 2020
When Autonomous Systems Meet Accuracy and Transferability through AI: A
  Survey
When Autonomous Systems Meet Accuracy and Transferability through AI: A Survey
Chongzhen Zhang
Jianrui Wang
Gary G. Yen
Chaoqiang Zhao
Qiyu Sun
Yang Tang
Feng Qian
Jürgen Kurths
AAML
35
20
0
29 Mar 2020
A Survey of Deep Learning for Scientific Discovery
A Survey of Deep Learning for Scientific Discovery
M. Raghu
Erica Schmidt
OOD
AI4CE
40
120
0
26 Mar 2020
Fiber: A Platform for Efficient Development and Distributed Training for
  Reinforcement Learning and Population-Based Methods
Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
Jiale Zhi
Rui Wang
Jeff Clune
Kenneth O. Stanley
OffRL
30
12
0
25 Mar 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
120
0
24 Mar 2020
Robust Deep Reinforcement Learning against Adversarial Perturbations on
  State Observations
Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations
Huan Zhang
Hongge Chen
Chaowei Xiao
Bo-wen Li
Mingyan D. Liu
Duane S. Boning
Cho-Jui Hsieh
AAML
38
261
0
19 Mar 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded
  Invention of Learning Challenges and their Solutions
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
22
125
0
19 Mar 2020
Neuroevolution of Self-Interpretable Agents
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
26
111
0
18 Mar 2020
Self-Supervised Discovering of Interpretable Features for Reinforcement
  Learning
Self-Supervised Discovering of Interpretable Features for Reinforcement Learning
Wenjie Shi
Gao Huang
Shiji Song
Zhuoyuan Wang
Tingyu Lin
Cheng Wu
SSL
28
18
0
16 Mar 2020
A Survey of End-to-End Driving: Architectures and Training Methods
A Survey of End-to-End Driving: Architectures and Training Methods
Ardi Tampuu
Maksym Semikin
Naveed Muhammad
D. Fishman
Tambet Matiisen
3DV
23
228
0
13 Mar 2020
Analyzing Visual Representations in Embodied Navigation Tasks
Analyzing Visual Representations in Embodied Navigation Tasks
Erik Wijmans
Julian Straub
Dhruv Batra
Irfan Essa
Judy Hoffman
Ari S. Morcos
19
2
0
12 Mar 2020
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based
  Agents
The Chef's Hat Simulation Environment for Reinforcement-Learning-Based Agents
Pablo V. A. Barros
Anne C. Bloem
Inge M. Hootsmans
Lena M. Opheij
Romain H. A. Toebosch
E. Barakova
A. Sciutti
17
9
0
12 Mar 2020
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Online Meta-Critic Learning for Off-Policy Actor-Critic Methods
Wei Zhou
Yiying Li
Yongxin Yang
Huaimin Wang
Timothy M. Hospedales
OffRL
30
46
0
11 Mar 2020
Fast Online Adaptation in Robotics through Meta-Learning Embeddings of
  Simulated Priors
Fast Online Adaptation in Robotics through Meta-Learning Embeddings of Simulated Priors
Rituraj Kaushik
Timothée Anne
Jean-Baptiste Mouret
30
52
0
10 Mar 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
Stable Policy Optimization via Off-Policy Divergence Regularization
Ahmed Touati
Amy Zhang
Joelle Pineau
Pascal Vincent
OffRL
30
17
0
09 Mar 2020
Efficiency and Equity are Both Essential: A Generalized Traffic Signal
  Controller with Deep Reinforcement Learning
Efficiency and Equity are Both Essential: A Generalized Traffic Signal Controller with Deep Reinforcement Learning
Shengchao Yan
Jingwei Zhang
Daniel Buescher
Wolfram Burgard
6
8
0
09 Mar 2020
Balance Between Efficient and Effective Learning: Dense2Sparse Reward
  Shaping for Robot Manipulation with Environment Uncertainty
Balance Between Efficient and Effective Learning: Dense2Sparse Reward Shaping for Robot Manipulation with Environment Uncertainty
Yongle Luo
Kun Dong
Lili Zhao
Zhiyong Sun
Chao Zhou
Bo Song
34
13
0
05 Mar 2020
Hierarchically Decoupled Imitation for Morphological Transfer
Hierarchically Decoupled Imitation for Morphological Transfer
D. Hejna
Pieter Abbeel
Lerrel Pinto
LM&Ro
25
40
0
03 Mar 2020
Safe Reinforcement Learning for Autonomous Vehicles through Parallel
  Constrained Policy Optimization
Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization
Lu Wen
Jingliang Duan
Shengbo Eben Li
Shaobing Xu
H. Peng
24
68
0
03 Mar 2020
Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning
Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning
Xingyou Song
Yuxiang Yang
K. Choromanski
Ken Caluwaerts
Wenbo Gao
Chelsea Finn
Jie Tan
114
79
0
02 Mar 2020
AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep
  Reinforcement Learning
AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning
Qijing Huang
Ameer Haj-Ali
William S. Moses
J. Xiang
Ion Stoica
Krste Asanović
J. Wawrzynek
21
56
0
02 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
20
159
0
01 Mar 2020
TAdam: A Robust Stochastic Gradient Optimizer
TAdam: A Robust Stochastic Gradient Optimizer
Wendyam Eric Lionel Ilboudo
Taisuke Kobayashi
Kenji Sugimoto
ODL
22
12
0
29 Feb 2020
Off-Policy Deep Reinforcement Learning with Analogous Disentangled
  Exploration
Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration
Guy Van den Broeck
Yitao Liang
Mathias Niepert
OffRL
16
3
0
25 Feb 2020
On Reinforcement Learning for Turn-based Zero-sum Markov Games
On Reinforcement Learning for Turn-based Zero-sum Markov Games
Devavrat Shah
Varun Somani
Qiaomin Xie
Zhi Xu
21
11
0
25 Feb 2020
Provable Representation Learning for Imitation Learning via Bi-level
  Optimization
Provable Representation Learning for Imitation Learning via Bi-level Optimization
Sanjeev Arora
S. Du
Sham Kakade
Yuping Luo
Nikunj Saunshi
23
60
0
24 Feb 2020
Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image
  Retrieval
Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval
A. Bhunia
Yongxin Yang
Timothy M. Hospedales
Tao Xiang
Yi-Zhe Song
35
102
0
24 Feb 2020
Reinforcement Learning Framework for Deep Brain Stimulation Study
Reinforcement Learning Framework for Deep Brain Stimulation Study
Dmitrii Krylov
Rémi Tachet des Combes
Romain Laroche
M. Rosenblum
Dmitry Dylov
OffRL
13
12
0
22 Feb 2020
Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot
  Locomotion
Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion
Siddhant Gangapurwala
Alexander L. Mitchell
Ioannis Havoutis
27
54
0
22 Feb 2020
Safe Imitation Learning via Fast Bayesian Reward Inference from
  Preferences
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences
Daniel S. Brown
Russell Coleman
R. Srinivasan
S. Niekum
BDL
30
101
0
21 Feb 2020
Learning to Walk in the Real World with Minimal Human Effort
Learning to Walk in the Real World with Minimal Human Effort
Sehoon Ha
P. Xu
Zhenyu Tan
Sergey Levine
Jie Tan
29
169
0
20 Feb 2020
Learn to Design the Heuristics for Vehicle Routing Problem
Learn to Design the Heuristics for Vehicle Routing Problem
Lei Gao
Mingxiang Chen
Qichang Chen
Ganzhong Luo
Nuoyi Zhu
Zhixin Liu
19
51
0
20 Feb 2020
Keep Doing What Worked: Behavioral Modelling Priors for Offline
  Reinforcement Learning
Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning
Noah Y. Siegel
Jost Tobias Springenberg
Felix Berkenkamp
A. Abdolmaleki
Michael Neunert
Thomas Lampe
Roland Hafner
Nicolas Heess
Martin Riedmiller
OffRL
22
282
0
19 Feb 2020
Value-driven Hindsight Modelling
Value-driven Hindsight Modelling
A. Guez
Fabio Viola
T. Weber
Lars Buesing
Steven Kapturowski
Doina Precup
David Silver
N. Heess
OffRL
16
12
0
19 Feb 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
Tianpei Yang
Jianye Hao
Zhaopeng Meng
Zongzhang Zhang
Yujing Hu
...
Changjie Fan
Weixun Wang
Wulong Liu
Zhaodong Wang
J. Peng
OffRL
22
12
0
19 Feb 2020
An Efficient Transfer Learning Framework for Multiagent Reinforcement
  Learning
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
Tianpei Yang
Weixun Wang
Hongyao Tang
Jianye Hao
Zhaopeng Meng
...
Wulong Liu
Chen Zhang
Yujing Hu
Yingfeng Chen
Changjie Fan
26
22
0
19 Feb 2020
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics
Reinforcement Learning for Molecular Design Guided by Quantum Mechanics
G. Simm
Robert Pinsler
José Miguel Hernández-Lobato
AI4CE
21
82
0
18 Feb 2020
Learning Zero-Sum Simultaneous-Move Markov Games Using Function
  Approximation and Correlated Equilibrium
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium
Qiaomin Xie
Yudong Chen
Zhaoran Wang
Zhuoran Yang
39
124
0
17 Feb 2020
Jelly Bean World: A Testbed for Never-Ending Learning
Jelly Bean World: A Testbed for Never-Ending Learning
Emmanouil Antonios Platanios
Abulhair Saparov
Tom Michael Mitchell
VLM
18
23
0
15 Feb 2020
Robust Reinforcement Learning via Adversarial training with Langevin
  Dynamics
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics
Parameswaran Kamalaruban
Yu-ting Huang
Ya-Ping Hsieh
Paul Rolland
C. Shi
V. Cevher
31
60
0
14 Feb 2020
Learning Functionally Decomposed Hierarchies for Continuous Control
  Tasks with Path Planning
Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen
Lukás Jendele
Emre Aksan
Otmar Hilliges
OffRL
27
25
0
14 Feb 2020
Previous
123...129130131...136137138
Next