ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.00088
  4. Cited By
Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to
  Find a Set of Diverse Policies

Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies

31 May 2019
M. A. Masood
Finale Doshi-Velez
ArXivPDFHTML

Papers citing "Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies"

14 / 14 papers shown
Title
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Diverse Exploration via Conjugate Policies for Policy Gradient Methods
Andrew Cohen
Xingye Qiao
Lei Yu
E. Way
Xiangrong Tong
44
9
0
10 Feb 2019
Learning Self-Imitating Diverse Policies
Learning Self-Imitating Diverse Policies
Tanmay Gangwani
Qiang Liu
Jian Peng
56
67
0
25 May 2018
Stein Points
Stein Points
W. Chen
Lester W. Mackey
Jackson Gorham
François‐Xavier Briol
Chris J. Oates
60
102
0
27 Mar 2018
Diverse Exploration for Fast and Safe Policy Improvement
Diverse Exploration for Fast and Safe Policy Improvement
Andrew Cohen
Lei Yu
Robert Wright
OffRL
41
15
0
22 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
97
1,085
0
16 Feb 2018
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
Zhang-Wei Hong
Tzu-Yun Shann
Shih-Yang Su
Yi-Hsiang Chang
Chun-Yi Lee
57
124
0
13 Feb 2018
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
487
19,019
0
20 Jul 2017
Noisy Networks for Exploration
Noisy Networks for Exploration
Meire Fortunato
M. G. Azar
Bilal Piot
Jacob Menick
Ian Osband
...
Rémi Munos
Demis Hassabis
Olivier Pietquin
Charles Blundell
Shane Legg
79
895
0
30 Jun 2017
Stein Variational Policy Gradient
Stein Variational Policy Gradient
Yang Liu
Prajit Ramachandran
Qiang Liu
Jian-wei Peng
69
139
0
07 Apr 2017
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning
Justin Fu
John D. Co-Reyes
Sergey Levine
OffRL
52
155
0
03 Mar 2017
Reinforcement Learning with Deep Energy-Based Policies
Reinforcement Learning with Deep Energy-Based Policies
Tuomas Haarnoja
Haoran Tang
Pieter Abbeel
Sergey Levine
100
1,340
0
27 Feb 2017
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
167
1,478
0
06 Jun 2016
Non-Deterministic Policies in Markovian Decision Processes
Non-Deterministic Policies in Markovian Decision Processes
M. M. Fard
Joelle Pineau
92
28
0
16 Jan 2014
A Kernel Method for the Two-Sample Problem
A Kernel Method for the Two-Sample Problem
Arthur Gretton
Karsten Borgwardt
Malte J. Rasch
Bernhard Schölkopf
Alex Smola
231
2,357
0
15 May 2008
1