Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2011.02614
Cited By
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
5 November 2020
Tanmay Gangwani
Jian Peng
Yuanshuo Zhou
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity"
26 / 26 papers shown
Title
GenDICE: Generalized Offline Estimation of Stationary Values
Ruiyi Zhang
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
185
174
0
21 Feb 2020
Imitation Learning via Off-Policy Distribution Matching
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OOD
OffRL
152
204
0
10 Dec 2019
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
T. Doan
Bogdan Mazoure
Moloud Abdar
A. Durand
Joelle Pineau
R. Devon Hjelm
60
15
0
17 Sep 2019
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination
Shauharda Khadka
Somdeb Majumdar
Santiago Miret
Stephen McAleer
Kagan Tumer
49
60
0
18 Jun 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
151
337
0
10 Jun 2019
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
50
100
0
02 May 2019
Off-Policy Policy Gradient with State Distribution Correction
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
OffRL
157
67
0
17 Apr 2019
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
87
150
0
19 Feb 2019
Hardware Conditioned Policies for Multi-Robot Transfer Learning
Tao Chen
Adithyavairavan Murali
Abhinav Gupta
59
102
0
24 Nov 2018
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
61
67
0
25 May 2018
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
99
1,085
0
16 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
62
124
0
13 Feb 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
68
347
0
18 Dec 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
517
19,065
0
20 Jul 2017
Quality and Diversity Optimization: A Unifying Modular Framework
Antoine Cully
Y. Demiris
66
271
0
12 May 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
90
361
0
10 Apr 2017
Stein Variational Policy Gradient
Yang Liu
Prajit Ramachandran
Qiang Liu
Jian-wei Peng
69
139
0
07 Apr 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
92
1,541
0
10 Mar 2017
Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm
Qiang Liu
Dilin Wang
BDL
73
1,092
0
16 Aug 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
223
5,077
0
05 Jun 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
424
576
0
04 Apr 2016
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
101
3,414
0
08 Jun 2015
Illuminating search spaces by mapping elites
Jean-Baptiste Mouret
Jeff Clune
76
735
0
20 Apr 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning
R. Sutton
A. R. Mahmood
Martha White
88
271
0
14 Mar 2015
Robots that can adapt like animals
Antoine Cully
Jeff Clune
Danesh Tarapore
Jean-Baptiste Mouret
87
1,037
0
13 Jul 2014
Estimating divergence functionals and the likelihood ratio by convex risk minimization
X. Nguyen
Martin J. Wainwright
Michael I. Jordan
223
803
0
04 Sep 2008
1