ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.06347
  4. Cited By
Proximal Policy Optimization Algorithms

Proximal Policy Optimization Algorithms

20 July 2017
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
    OffRL
ArXivPDFHTML

Papers citing "Proximal Policy Optimization Algorithms"

50 / 6,944 papers shown
Title
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Meta-Gradient Reinforcement Learning with an Objective Discovered Online
Zhongwen Xu
H. V. Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
19
77
0
16 Jul 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
Liang Liu
Hao Lu
Hongwei Zou
Haipeng Xiong
Zhiguo Cao
Chunhua Shen
OffRL
27
70
0
16 Jul 2020
Efficient Empowerment Estimation for Unsupervised Stabilization
Efficient Empowerment Estimation for Unsupervised Stabilization
Ruihan Zhao
Kevin Lu
Pieter Abbeel
Stas Tiomkin
32
8
0
14 Jul 2020
Explore and Explain: Self-supervised Navigation and Recounting
Explore and Explain: Self-supervised Navigation and Recounting
Roberto Bigazzi
Federico Landi
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
EgoV
LM&Ro
24
17
0
14 Jul 2020
Robustifying Reinforcement Learning Agents via Action Space Adversarial
  Training
Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Kai Liang Tan
Yasaman Esfandiari
Xian Yeow Lee
Aakanksha
S. Sarkar
AAML
26
55
0
14 Jul 2020
Lifelong Policy Gradient Learning of Factored Policies for Faster
  Training Without Forgetting
Lifelong Policy Gradient Learning of Factored Policies for Faster Training Without Forgetting
Jorge Armando Mendez Mendez
Boyu Wang
Eric Eaton
CLL
36
38
0
14 Jul 2020
Reinforcement Learning of Musculoskeletal Control from Functional
  Simulations
Reinforcement Learning of Musculoskeletal Control from Functional Simulations
Emanuel Joos
Fabien Péan
Orçun Göksel
AI4CE
25
12
0
13 Jul 2020
Relational-Grid-World: A Novel Relational Reasoning Environment and An
  Agent Model for Relational Information Extraction
Relational-Grid-World: A Novel Relational Reasoning Environment and An Agent Model for Relational Information Extraction
Faruk Küçüksubasi
Elif Surer
24
2
0
12 Jul 2020
Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones
Long-Term Planning with Deep Reinforcement Learning on Autonomous Drones
Ugurkan Ates
23
10
0
11 Jul 2020
An Asymptotically Optimal Multi-Armed Bandit Algorithm and
  Hyperparameter Optimization
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter Optimization
Yimin Huang
Yujun Li
Hanrong Ye
Zhenguo Li
Zhihua Zhang
30
7
0
11 Jul 2020
Learning to plan with uncertain topological maps
Learning to plan with uncertain topological maps
E. Beeching
J. Dibangoye
Olivier Simonin
Christian Wolf
19
40
0
10 Jul 2020
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis
  and Application to Actor-Critic
A Two-Timescale Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
Mingyi Hong
Hoi-To Wai
Zhaoran Wang
Zhuoran Yang
18
135
0
10 Jul 2020
Auxiliary Tasks Speed Up Learning PointGoal Navigation
Auxiliary Tasks Speed Up Learning PointGoal Navigation
Joel Ye
Dhruv Batra
Erik Wijmans
Abhishek Das
3DPC
EgoV
17
79
0
09 Jul 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
31
287
0
08 Jul 2020
Natural Emergence of Heterogeneous Strategies in Artificially
  Intelligent Competitive Teams
Natural Emergence of Heterogeneous Strategies in Artificially Intelligent Competitive Teams
A. Deka
Katia P. Sycara
AAML
31
32
0
06 Jul 2020
Decentralized Reinforcement Learning: Global Decision-Making via Local
  Economic Transactions
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions
Michael Chang
Sid Kaushik
Matthew C. Weinberg
Thomas Griffiths
Sergey Levine
12
16
0
05 Jul 2020
Variational Policy Gradient Method for Reinforcement Learning with
  General Utilities
Variational Policy Gradient Method for Reinforcement Learning with General Utilities
Junyu Zhang
Alec Koppel
Amrit Singh Bedi
Csaba Szepesvári
Mengdi Wang
27
138
0
04 Jul 2020
Discount Factor as a Regularizer in Reinforcement Learning
Discount Factor as a Regularizer in Reinforcement Learning
Ron Amit
Ron Meir
K. Ciosek
OffRL
25
72
0
04 Jul 2020
Reinforcement Learning based Control of Imitative Policies for
  Near-Accident Driving
Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving
Zhangjie Cao
Erdem Biyik
Woodrow Z. Wang
Allan Raventos
Adrien Gaidon
Guy Rosman
Dorsa Sadigh
34
65
0
01 Jul 2020
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning
Elise van der Pol
Daniel E. Worrall
H. V. Hoof
F. Oliehoek
Max Welling
BDL
AI4CE
31
156
0
30 Jun 2020
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential
  Advertising
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Xiaotian Hao
Zhaoqing Peng
Yi Ma
Guanlong Wang
Junqi Jin
...
Zhenzhe Zheng
Chuan Yu
Han Li
Jian Xu
Kun Gai
15
26
0
29 Jun 2020
Active Finite Reward Automaton Inference and Reinforcement Learning
  Using Queries and Counterexamples
Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples
Zhe Xu
Bo Wu
Aditya Ojha
Daniel Neider
Ufuk Topcu
OffRL
19
30
0
28 Jun 2020
A Unifying Framework for Reinforcement Learning and Planning
A Unifying Framework for Reinforcement Learning and Planning
Thomas M. Moerland
Joost Broekens
Aske Plaat
Catholijn M. Jonker
OffRL
36
9
0
26 Jun 2020
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Online 3D Bin Packing with Constrained Deep Reinforcement Learning
Hang Zhao
Qijin She
Chenyang Zhu
Yin Yang
Kai Xu
OffRL
19
26
0
26 Jun 2020
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Shengyi Huang
Santiago Ontañón
35
310
0
25 Jun 2020
Control-Aware Representations for Model-based Reinforcement Learning
Control-Aware Representations for Model-based Reinforcement Learning
Brandon Cui
Yinlam Chow
Mohammad Ghavamzadeh
BDL
18
13
0
24 Jun 2020
Automatic Data Augmentation for Generalization in Deep Reinforcement
  Learning
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Roberta Raileanu
M. Goldstein
Denis Yarats
Ilya Kostrikov
Rob Fergus
OffRL
22
109
0
23 Jun 2020
Safe Reinforcement Learning via Curriculum Induction
Safe Reinforcement Learning via Curriculum Induction
M. Turchetta
Andrey Kolobov
S. Shah
Andreas Krause
Alekh Agarwal
23
91
0
22 Jun 2020
Automated Optical Multi-layer Design via Deep Reinforcement Learning
Automated Optical Multi-layer Design via Deep Reinforcement Learning
Haozhu Wang
Zeyu Zheng
Chengang Ji
L. J. Guo
20
3
0
21 Jun 2020
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration
  for Mean-Field Reinforcement Learning
Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning
Lingxiao Wang
Zhuoran Yang
Zhaoran Wang
32
26
0
21 Jun 2020
An adaptive stochastic gradient-free approach for high-dimensional
  blackbox optimization
An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Anton Dereventsov
Clayton Webster
Joseph Daws
22
10
0
18 Jun 2020
Reparameterized Variational Divergence Minimization for Stable Imitation
Reparameterized Variational Divergence Minimization for Stable Imitation
Dilip Arumugam
Debadeepta Dey
Alekh Agarwal
Asli Celikyilmaz
E. Nouri
W. Dolan
33
3
0
18 Jun 2020
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Deep Reinforcement Learning amidst Lifelong Non-Stationarity
Annie Xie
James Harrison
Chelsea Finn
CLL
OffRL
35
64
0
18 Jun 2020
DREAM: Deep Regret minimization with Advantage baselines and Model-free
  learning
DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Eric Steinberger
Adam Lerer
Noam Brown
36
53
0
18 Jun 2020
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using
  Deep Reinforcement Learning
COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning
Eivind Meyer
Amalie Heiberg
Adil Rasheed
Omer San
38
74
0
16 Jun 2020
Solving the Order Batching and Sequencing Problem using Deep
  Reinforcement Learning
Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning
Bram Cals
Yingqian Zhang
R. Dijkman
Claudy van Dorst
OffRL
22
29
0
16 Jun 2020
Semantic Curiosity for Active Visual Learning
Semantic Curiosity for Active Visual Learning
Devendra Singh Chaplot
Helen Jiang
Saurabh Gupta
Abhinav Gupta
ObjD
16
72
0
16 Jun 2020
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
ShieldNN: A Provably Safe NN Filter for Unsafe NN Controllers
James Ferlez
Mahmoud M. Elnaggar
Yasser Shoukry
C. Fleming
AAML
62
33
0
16 Jun 2020
Designing high-fidelity multi-qubit gates for semiconductor quantum dots
  through deep reinforcement learning
Designing high-fidelity multi-qubit gates for semiconductor quantum dots through deep reinforcement learning
Sahar Daraeizadeh
S. Premaratne
A. Matsuura
14
5
0
15 Jun 2020
Efficient Model-Based Reinforcement Learning through Optimistic Policy
  Search and Planning
Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning
Sebastian Curi
Felix Berkenkamp
Andreas Krause
33
82
0
15 Jun 2020
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in
  Cooperative Tasks
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Georgios Papoudakis
Filippos Christianos
Lukas Schafer
Stefano V. Albrecht
OffRL
26
220
0
14 Jun 2020
Reinforcement Learning with Supervision from Noisy Demonstrations
Reinforcement Learning with Supervision from Noisy Demonstrations
Kun-Peng Ning
Sheng-Jun Huang
14
7
0
14 Jun 2020
Non-local Policy Optimization via Diversity-regularized Collaborative
  Exploration
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration
Zhenghao Peng
Hao Sun
Bolei Zhou
18
18
0
14 Jun 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Residual Force Control for Agile Human Behavior Imitation and Extended
  Motion Synthesis
Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis
Ye Yuan
Kris Kitani
29
75
0
12 Jun 2020
Deep Reinforcement and InfoMax Learning
Deep Reinforcement and InfoMax Learning
Bogdan Mazoure
Rémi Tachet des Combes
T. Doan
Philip Bachman
R. Devon Hjelm
AI4CE
25
108
0
12 Jun 2020
SAMBA: Safe Model-Based & Active Reinforcement Learning
SAMBA: Safe Model-Based & Active Reinforcement Learning
Alexander I. Cowen-Rivers
Daniel Palenicek
Vincent Moens
Mohammed Abdullah
Aivar Sootla
Jun Wang
Haitham Bou-Ammar
23
44
0
12 Jun 2020
Does Unsupervised Architecture Representation Learning Help Neural
  Architecture Search?
Does Unsupervised Architecture Representation Learning Help Neural Architecture Search?
Shen Yan
Yu Zheng
Wei Ao
Xiao Zeng
Mi Zhang
SSL
AI4CE
32
99
0
12 Jun 2020
Avoiding Side Effects in Complex Environments
Avoiding Side Effects in Complex Environments
Alexander Matt Turner
Neale Ratzlaff
Prasad Tadepalli
30
34
0
11 Jun 2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale
  Empirical Study
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study
Marcin Andrychowicz
Anton Raichuk
Piotr Stańczyk
Manu Orsini
Sertan Girgin
...
M. Geist
Olivier Pietquin
Marcin Michalski
Sylvain Gelly
Olivier Bachem
OffRL
31
214
0
10 Jun 2020
Previous
123...128129130...137138139
Next