ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.02614
  4. Cited By
Harnessing Distribution Ratio Estimators for Learning Agents with
  Quality and Diversity

Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity

5 November 2020
Tanmay Gangwani
Jian Peng
Yuanshuo Zhou
ArXiv (abs)PDFHTML

Papers citing "Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity"

26 / 26 papers shown
Title
GenDICE: Generalized Offline Estimation of Stationary Values
GenDICE: Generalized Offline Estimation of Stationary Values
Ruiyi Zhang
Bo Dai
Lihong Li
Dale Schuurmans
OffRL
185
174
0
21 Feb 2020
Imitation Learning via Off-Policy Distribution Matching
Imitation Learning via Off-Policy Distribution Matching
Ilya Kostrikov
Ofir Nachum
Jonathan Tompson
OODOffRL
152
204
0
10 Dec 2019
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement
  Learning
Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
T. Doan
Bogdan Mazoure
Moloud Abdar
A. Durand
Joelle Pineau
R. Devon Hjelm
60
15
0
17 Sep 2019
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent
  Coordination
Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination
Shauharda Khadka
Somdeb Majumdar
Santiago Miret
Stephen McAleer
Kagan Tumer
49
60
0
18 Jun 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary
  Distribution Corrections
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
151
337
0
10 Jun 2019
Collaborative Evolutionary Reinforcement Learning
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka
Somdeb Majumdar
Tarek Nassar
Zach Dwiel
E. Tumer
Santiago Miret
Yinyin Liu
Kagan Tumer
50
100
0
02 May 2019
Off-Policy Policy Gradient with State Distribution Correction
Off-Policy Policy Gradient with State Distribution Correction
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
OffRL
157
67
0
17 Apr 2019
Emergent Coordination Through Competition
Emergent Coordination Through Competition
Siqi Liu
Guy Lever
J. Merel
S. Tunyasuvunakool
N. Heess
T. Graepel
87
150
0
19 Feb 2019
Hardware Conditioned Policies for Multi-Robot Transfer Learning
Hardware Conditioned Policies for Multi-Robot Transfer Learning
Tao Chen
Adithyavairavan Murali
Abhinav Gupta
59
102
0
24 Nov 2018
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
61
67
0
25 May 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
99
1,085
0
16 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
62
124
0
13 Feb 2018
Improving Exploration in Evolution Strategies for Deep Reinforcement
  Learning via a Population of Novelty-Seeking Agents
Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents
Edoardo Conti
Vashisht Madhavan
F. Such
Joel Lehman
Kenneth O. Stanley
Jeff Clune
68
347
0
18 Dec 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
517
19,065
0
20 Jul 2017
Quality and Diversity Optimization: A Unifying Modular Framework
Quality and Diversity Optimization: A Unifying Modular Framework
Antoine Cully
Y. Demiris
66
271
0
12 May 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
90
361
0
10 Apr 2017
Stein Variational Policy Gradient
Stein Variational Policy Gradient
Yang Liu
Prajit Ramachandran
Qiang Liu
Jian-wei Peng
69
139
0
07 Apr 2017
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
Tim Salimans
Jonathan Ho
Xi Chen
Szymon Sidor
Ilya Sutskever
92
1,541
0
10 Mar 2017
Stein Variational Gradient Descent: A General Purpose Bayesian Inference
  Algorithm
Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm
Qiang Liu
Dilin Wang
BDL
73
1,092
0
16 Aug 2016
OpenAI Gym
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRLODL
223
5,077
0
05 Jun 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
424
576
0
04 Apr 2016
High-Dimensional Continuous Control Using Generalized Advantage
  Estimation
High-Dimensional Continuous Control Using Generalized Advantage Estimation
John Schulman
Philipp Moritz
Sergey Levine
Michael I. Jordan
Pieter Abbeel
OffRL
101
3,414
0
08 Jun 2015
Illuminating search spaces by mapping elites
Illuminating search spaces by mapping elites
Jean-Baptiste Mouret
Jeff Clune
76
735
0
20 Apr 2015
An Emphatic Approach to the Problem of Off-policy Temporal-Difference
  Learning
An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning
R. Sutton
A. R. Mahmood
Martha White
88
271
0
14 Mar 2015
Robots that can adapt like animals
Robots that can adapt like animals
Antoine Cully
Jeff Clune
Danesh Tarapore
Jean-Baptiste Mouret
87
1,037
0
13 Jul 2014
Estimating divergence functionals and the likelihood ratio by convex
  risk minimization
Estimating divergence functionals and the likelihood ratio by convex risk minimization
X. Nguyen
Martin J. Wainwright
Michael I. Jordan
223
803
0
04 Sep 2008
1