Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.10309
Cited By
Learning Self-Imitating Diverse Policies
25 May 2018
Tanmay Gangwani
Qiang Liu
Jian Peng
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning Self-Imitating Diverse Policies"
14 / 14 papers shown
Title
Preference-Guided Reinforcement Learning for Efficient Exploration
Guojian Wang
Faguo Wu
Xiao Zhang
Tianyuan Chen
Xuyang Chen
Lin Zhao
40
0
0
09 Jul 2024
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
41
3
0
27 Dec 2023
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
26
324
0
02 May 2022
A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets
J. E. Grigsby
Yanjun Qi
OffRL
29
5
0
10 Oct 2021
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Jianye Hao
Tianpei Yang
Hongyao Tang
Chenjia Bai
Jinyi Liu
Zhaopeng Meng
Peng Liu
Zhen Wang
OffRL
36
93
0
14 Sep 2021
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
Kuan Fang
Yuke Zhu
Silvio Savarese
Li Fei-Fei
48
6
0
26 Jun 2021
Simplifying Deep Reinforcement Learning via Self-Supervision
Daochen Zha
Kwei-Herng Lai
Kaixiong Zhou
Xia Hu
SSL
49
15
0
10 Jun 2021
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Tanmay Gangwani
Jian Peng
Yuanshuo Zhou
26
10
0
05 Nov 2020
Self-Imitation Learning via Generalized Lower Bound Q-learning
Yunhao Tang
SSL
33
24
0
12 Jun 2020
Novel Policy Seeking with Constrained Optimization
Hao Sun
Zhenghao Peng
Bo Dai
Jian Guo
Dahua Lin
Bolei Zhou
24
13
0
21 May 2020
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung
Giseung Park
Y. Sung
OffRL
24
38
0
09 Jan 2020
Multi-Path Policy Optimization
L. Pan
Qingpeng Cai
Longbo Huang
18
2
0
11 Nov 2019
Active Domain Randomization
Bhairav Mehta
Manfred Diaz
Florian Golemo
C. Pal
Liam Paull
24
256
0
09 Apr 2019
Amplifying the Imitation Effect for Reinforcement Learning of UCAV's Mission Execution
G. Lee
Chang Ouk Kim
18
4
0
17 Jan 2019
1