ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 6,731 papers shown
Title
Policy Search by Target Distribution Learning for Continuous Control
Policy Search by Target Distribution Learning for Continuous Control
Chuheng Zhang
Yuanqi Li
Jian Li
19
6
0
27 May 2019
Composing Task-Agnostic Policies with Deep Reinforcement Learning
Composing Task-Agnostic Policies with Deep Reinforcement Learning
A. H. Qureshi
Jacob J. Johnson
Yuzhe Qin
Taylor Henderson
Byron Boots
Michael C. Yip
OffRL
22
30
0
25 May 2019
Adversarial Policies: Attacking Deep Reinforcement Learning
Adversarial Policies: Attacking Deep Reinforcement Learning
Adam Gleave
Michael Dennis
Cody Wild
Neel Kant
Sergey Levine
Stuart J. Russell
AAML
27
349
0
25 May 2019
Adaptive Symmetric Reward Noising for Reinforcement Learning
Adaptive Symmetric Reward Noising for Reinforcement Learning
R. Vivanti
Talya D. Sohlberg-Baris
Shlomo Cohen
Orna Cohen
AAML
11
1
0
24 May 2019
Arena: A General Evaluation Platform and Building Toolkit for
  Multi-Agent Intelligence
Arena: A General Evaluation Platform and Building Toolkit for Multi-Agent Intelligence
Yuhang Song
Andrzej Wojcicki
Thomas Lukasiewicz
Jianyi Wang
Abi Aryan
Zhenghua Xu
Mai Xu
Zihan Ding
Lianlong Wu
AI4CE
ELM
27
33
0
17 May 2019
Meta reinforcement learning as task inference
Meta reinforcement learning as task inference
Jan Humplik
Alexandre Galashov
Leonard Hasenclever
Pedro A. Ortega
Yee Whye Teh
N. Heess
OffRL
31
127
0
15 May 2019
Trajectory-Based Off-Policy Deep Reinforcement Learning
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr
Michael Volpp
Marc Toussaint
Sebastian Trimpe
Christian Daniel
OffRL
26
2
0
14 May 2019
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Yilun Du
Karthik Narasimhan
19
33
0
13 May 2019
Design of Artificial Intelligence Agents for Games using Deep
  Reinforcement Learning
Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning
A. Roibu
27
1
0
10 May 2019
Attention-based Deep Reinforcement Learning for Multi-view Environments
Attention-based Deep Reinforcement Learning for Multi-view Environments
Elaheh Barati
Xuewen Chen
Z. Zhong
22
6
0
10 May 2019
P3O: Policy-on Policy-off Policy Optimization
P3O: Policy-on Policy-off Policy Optimization
Rasool Fakoor
Pratik Chaudhari
Alex Smola
OffRL
20
51
0
05 May 2019
Scaling and Benchmarking Self-Supervised Visual Representation Learning
Scaling and Benchmarking Self-Supervised Visual Representation Learning
Priya Goyal
D. Mahajan
Abhinav Gupta
Ishan Misra
SSL
24
396
0
03 May 2019
From Video Game to Real Robot: The Transfer between Action Spaces
From Video Game to Real Robot: The Transfer between Action Spaces
Janne Karttunen
Anssi Kanervisto
Ville Kyrki
Ville Hautamaki
12
8
0
02 May 2019
DAC: The Double Actor-Critic Architecture for Learning Options
DAC: The Double Actor-Critic Architecture for Learning Options
Shangtong Zhang
Shimon Whiteson
22
72
0
29 Apr 2019
Arbitrage of Energy Storage in Electricity Markets with Deep
  Reinforcement Learning
Arbitrage of Energy Storage in Electricity Markets with Deep Reinforcement Learning
Hanchen Xu
Xiao Li
Xiangyu Zhang
Junbo Zhang
15
26
0
28 Apr 2019
Neural Logic Reinforcement Learning
Neural Logic Reinforcement Learning
Zhengyao Jiang
Shan Luo
NAI
27
71
0
24 Apr 2019
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Model-free Deep Reinforcement Learning for Urban Autonomous Driving
Jianyu Chen
Bodi Yuan
Masayoshi Tomizuka
24
262
0
20 Apr 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ConvLab: Multi-Domain End-to-End Dialog System Platform
Sungjin Lee
Qi Zhu
Ryuichi Takanobu
Xiang Li
Yaoqin Zhang
...
Jinchao Li
Baolin Peng
Xiujun Li
Minlie Huang
Jianfeng Gao
VLM
27
110
0
18 Apr 2019
Decoupled Data Based Approach for Learning to Control Nonlinear
  Dynamical Systems
Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems
Ran A. Wang
Karthikeya S. Parunandi
Dan Yu
D. Kalathil
S. Chakravorty
15
11
0
17 Apr 2019
Learning to Guide: Guidance Law Based on Deep Meta-learning and Model
  Predictive Path Integral Control
Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control
Chen Liang
Weihong Wang
Zhenghua Liu
Chao Lai
Benchun Zhou
19
28
0
15 Apr 2019
Model-Free Reinforcement Learning for Financial Portfolios: A Brief
  Survey
Model-Free Reinforcement Learning for Financial Portfolios: A Brief Survey
Yoshiharu Sato
OffRL
24
32
0
10 Apr 2019
Policy Gradient Search: Online Planning and Expert Iteration without
  Search Trees
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
Thomas W. Anthony
Robert Nishihara
Philipp Moritz
Tim Salimans
John Schulman
25
30
0
07 Apr 2019
Multi-Preference Actor Critic
Multi-Preference Actor Critic
Ishan Durugkar
Matthew J. Hausknecht
Adith Swaminathan
Patrick MacAlpine
14
1
0
05 Apr 2019
Architecture Search of Dynamic Cells for Semantic Video Segmentation
Architecture Search of Dynamic Cells for Semantic Video Segmentation
Vladimir Nekrasov
Hao Chen
Chunhua Shen
Ian Reid
32
20
0
04 Apr 2019
PaintBot: A Reinforcement Learning Approach for Natural Media Painting
PaintBot: A Reinforcement Learning Approach for Natural Media Painting
Biao Jia
Chen Fang
Jonathan Brandt
Byungmoon Kim
Tianyi Zhou
22
15
0
03 Apr 2019
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning
  Without a Supercomputer
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
E. Beeching
Christian Wolf
J. Dibangoye
Olivier Simonin
OffRL
LRM
35
25
0
03 Apr 2019
Autoregressive Policies for Continuous Control Deep Reinforcement
  Learning
Autoregressive Policies for Continuous Control Deep Reinforcement Learning
D. Korenkevych
A. R. Mahmood
G. Vasan
James Bergstra
19
28
0
27 Mar 2019
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Q-Learning for Continuous Actions with Cross-Entropy Guided Policies
Riley Simmons-Edler
Ben Eisner
E. Mitchell
Sebastian Seung
Daniel D. Lee
44
28
0
25 Mar 2019
Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed
  Behaviors
Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors
Fang-I Hsiao
Jui-Hsuan Kuo
Min Sun
OffRL
18
14
0
25 Mar 2019
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based
  Algorithms on Mobile Robots
HouseExpo: A Large-scale 2D Indoor Layout Dataset for Learning-based Algorithms on Mobile Robots
Tingguang Li
Danny Ho
Chenming Li
Delong Zhu
Chaoqun Wang
Max Q.-H. Meng
3DV
19
55
0
23 Mar 2019
Macro Action Reinforcement Learning with Sequence Disentanglement using
  Variational Autoencoder
Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder
Heecheol Kim
Masanori Yamada
Kosuke Miyoshi
Hiroshi Yamakawa
DRL
16
6
0
22 Mar 2019
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to
  Multiple Quadrotors
Sim-to-(Multi)-Real: Transfer of Low-Level Robust Control Policies to Multiple Quadrotors
Artem Molchanov
Tao Chen
Wolfgang Hönig
James A. Preiss
Nora Ayanian
Gaurav Sukhatme
24
107
0
11 Mar 2019
Learning to Paint With Model-based Deep Reinforcement Learning
Learning to Paint With Model-based Deep Reinforcement Learning
Zhewei Huang
Wen Heng
Shuchang Zhou
GAN
21
16
0
11 Mar 2019
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy
  Critics
Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics
Denis Steckelmacher
Hélène Plisnier
D. Roijers
A. Nowé
OffRL
23
17
0
11 Mar 2019
Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered
  Scenes
Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes
Bohan Wu
Iretiayo Akinola
Peter K. Allen
22
34
0
08 Mar 2019
Provably Robust Blackbox Optimization for Reinforcement Learning
Provably Robust Blackbox Optimization for Reinforcement Learning
K. Choromanski
Aldo Pacchiano
Jack Parker-Holder
Yunhao Tang
Deepali Jain
Yuxiang Yang
Atil Iscen
Jasmine Hsu
Vikas Sindhwani
13
5
0
07 Mar 2019
Using Natural Language for Reward Shaping in Reinforcement Learning
Using Natural Language for Reward Shaping in Reinforcement Learning
Prasoon Goyal
S. Niekum
Raymond J. Mooney
LM&Ro
41
175
0
05 Mar 2019
Episodic Learning with Control Lyapunov Functions for Uncertain Robotic
  Systems
Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems
Andrew J. Taylor
Victor D. Dorobantu
Hoang Minh Le
Yisong Yue
Aaron D. Ames
117
78
0
04 Mar 2019
Sim-to-Real Transfer for Biped Locomotion
Sim-to-Real Transfer for Biped Locomotion
Wenhao Yu
Visak C. V. Kumar
Greg Turk
Chenxi Liu
12
113
0
04 Mar 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Zhou Fan
Ruilong Su
Weinan Zhang
Yong Yu
14
133
0
04 Mar 2019
Efficient Reinforcement Learning for StarCraft by Abstract Forward
  Models and Transfer Learning
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning
Ruo-Ze Liu
Haifeng Guo
Xiaozhong Ji
Yang Yu
Zhen-Jia Pang
Zitai Xiao
Yuzhou Wu
Tong Lu
OffRL
19
13
0
02 Mar 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention
  across Neural Network Layers
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
16
2
0
27 Feb 2019
Neural Packet Classification
Neural Packet Classification
Eric Liang
Hang Zhu
Xin Jin
Ion Stoica
OffRL
35
120
0
27 Feb 2019
Design of intentional backdoors in sequential models
Design of intentional backdoors in sequential models
Zhaoyuan Yang
N. Iyer
Johan Reimann
Nurali Virani
SILM
AAML
17
38
0
26 Feb 2019
Cooperative Learning of Disjoint Syntax and Semantics
Cooperative Learning of Disjoint Syntax and Semantics
Serhii Havrylov
Germán Kruszewski
Armand Joulin
18
48
0
25 Feb 2019
Investigating Generalisation in Continuous Deep Reinforcement Learning
Investigating Generalisation in Continuous Deep Reinforcement Learning
Chenyang Zhao
Olivier Sigaud
F. Stulp
Timothy M. Hospedales
OffRL
22
48
0
19 Feb 2019
Neural-encoding Human Experts' Domain Knowledge to Warm Start
  Reinforcement Learning
Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning
Andrew Silva
Matthew C. Gombolay
OffRL
27
20
0
15 Feb 2019
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy
  Observations
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations
Yuhui Wang
Hao He
Xiaoyang Tan
30
9
0
15 Feb 2019
Learn a Prior for RHEA for Better Online Planning
Learn a Prior for RHEA for Better Online Planning
Xinyao Tong
W. Liu
Bin Li
OffRL
32
0
0
14 Feb 2019
Non-Asymptotic Analysis of Monte Carlo Tree Search
Non-Asymptotic Analysis of Monte Carlo Tree Search
Devavrat Shah
Qiaomin Xie
Zhi Xu
13
9
0
14 Feb 2019
Previous
123...131132133134135
Next