ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.04640
  4. Cited By
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
v1v2v3v4 (latest)

Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

12 May 2019
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Shangtong Zhang
Andrzej Wojcicki
Mai Xu
    LRM
ArXiv (abs)PDFHTML

Papers citing "Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards"

34 / 34 papers shown
Title
Fast Task Inference with Variational Intrinsic Successor Features
Fast Task Inference with Variational Intrinsic Successor Features
Steven Hansen
Will Dabney
André Barreto
T. Wiele
David Warde-Farley
Volodymyr Mnih
BDL
76
152
0
12 Jun 2019
Unsupervised Control Through Non-Parametric Discriminative Rewards
Unsupervised Control Through Non-Parametric Discriminative Rewards
David Warde-Farley
T. Wiele
Tejas D. Kulkarni
Catalin Ionescu
Steven Hansen
Volodymyr Mnih
DRLOffRLSSL
86
178
0
28 Nov 2018
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Diversity-Driven Extensible Hierarchical Reinforcement Learning
Yuhang Song
Jianyi Wang
Thomas Lukasiewicz
Zhenghua Xu
Mai Xu
42
18
0
10 Nov 2018
Contingency-Aware Exploration in Reinforcement Learning
Contingency-Aware Exploration in Reinforcement Learning
Jongwook Choi
Yijie Guo
Marcin Moczulski
Junhyuk Oh
Neal Wu
Mohammad Norouzi
Honglak Lee
65
73
0
05 Nov 2018
Exploration by Random Network Distillation
Exploration by Random Network Distillation
Yuri Burda
Harrison Edwards
Amos Storkey
Oleg Klimov
159
1,342
0
30 Oct 2018
Large-Scale Study of Curiosity-Driven Learning
Large-Scale Study of Curiosity-Driven Learning
Yuri Burda
Harrison Edwards
Deepak Pathak
Amos Storkey
Trevor Darrell
Alexei A. Efros
LRM
72
706
0
13 Aug 2018
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
Curiosity Driven Exploration of Learned Disentangled Goal Spaces
A. Laversanne-Finot
Alexandre Péré
Pierre-Yves Oudeyer
DRL
71
88
0
04 Jul 2018
Relational inductive biases, deep learning, and graph networks
Relational inductive biases, deep learning, and graph networks
Peter W. Battaglia
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
V. Zambaldi
...
Pushmeet Kohli
M. Botvinick
Oriol Vinyals
Yujia Li
Razvan Pascanu
AI4CENAI
766
3,129
0
04 Jun 2018
Generalisation of structural knowledge in the hippocampal-entorhinal
  system
Generalisation of structural knowledge in the hippocampal-entorhinal system
James C. R. Whittington
Timothy H. Muller
Shirley Mark
Caswell Barry
Timothy Edward John Behrens
98
53
0
23 May 2018
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Unsupervised Video Object Segmentation for Deep Reinforcement Learning
Vikrant Goel
James Weng
Pascal Poupart
OCL
69
66
0
20 May 2018
World Models
World Models
David R Ha
Jürgen Schmidhuber
SyDa
143
1,098
0
27 Mar 2018
Relational Neural Expectation Maximization: Unsupervised Discovery of
  Objects and their Interactions
Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions
Sjoerd van Steenkiste
Michael Chang
Klaus Greff
Jürgen Schmidhuber
BDLOCLDRL
209
291
0
28 Feb 2018
Diversity is All You Need: Learning Skills without a Reward Function
Diversity is All You Need: Learning Skills without a Reward Function
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
103
1,088
0
16 Feb 2018
Intrinsically Motivated Goal Exploration Processes with Automatic
  Curriculum Learning
Intrinsically Motivated Goal Exploration Processes with Automatic Curriculum Learning
Sébastien Forestier
Rémy Portelas
Yoan Mollard
Pierre-Yves Oudeyer
86
188
0
07 Aug 2017
Proximal Policy Optimization Algorithms
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
529
19,237
0
20 Jul 2017
Count-Based Exploration in Feature Space for Reinforcement Learning
Count-Based Exploration in Feature Space for Reinforcement Learning
Jarryd Martin
S. N. Sasikumar
Tom Everitt
Marcus Hutter
68
124
0
25 Jun 2017
Visual Interaction Networks
Visual Interaction Networks
Nicholas Watters
Andrea Tacchetti
T. Weber
Razvan Pascanu
Peter W. Battaglia
Daniel Zoran
PINN3DH
96
279
0
05 Jun 2017
A simple neural network module for relational reasoning
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNNNAI
189
1,615
0
05 Jun 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRMSSL
113
2,449
0
15 May 2017
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Stochastic Neural Networks for Hierarchical Reinforcement Learning
Carlos Florensa
Yan Duan
Pieter Abbeel
BDL
92
361
0
10 Apr 2017
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Joshua Achiam
S. Shankar Sastry
71
238
0
06 Mar 2017
Count-Based Exploration with Neural Density Models
Count-Based Exploration with Neural Density Models
Georg Ostrovski
Marc G. Bellemare
Aaron van den Oord
Rémi Munos
86
625
0
03 Mar 2017
Interaction Networks for Learning about Objects, Relations and Physics
Interaction Networks for Learning about Objects, Relations and Physics
Peter W. Battaglia
Razvan Pascanu
Matthew Lai
Danilo Jimenez Rezende
Koray Kavukcuoglu
AI4CEOCLPINNGNN
543
1,412
0
01 Dec 2016
Variational Intrinsic Control
Variational Intrinsic Control
Karol Gregor
Danilo Jimenez Rezende
Daan Wierstra
DRLOffRL
88
429
0
22 Nov 2016
Reinforcement Learning with Unsupervised Auxiliary Tasks
Reinforcement Learning with Unsupervised Auxiliary Tasks
Max Jaderberg
Volodymyr Mnih
Wojciech M. Czarnecki
Tom Schaul
Joel Z Leibo
David Silver
Koray Kavukcuoglu
SSL
109
1,229
0
16 Nov 2016
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement
  Learning
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Haoran Tang
Rein Houthooft
Davis Foote
Adam Stooke
Xi Chen
Yan Duan
John Schulman
F. Turck
Pieter Abbeel
OffRL
106
775
0
15 Nov 2016
Unifying Count-Based Exploration and Intrinsic Motivation
Unifying Count-Based Exploration and Intrinsic Motivation
Marc G. Bellemare
S. Srinivasan
Georg Ostrovski
Tom Schaul
D. Saxton
Rémi Munos
176
1,483
0
06 Jun 2016
Information Theoretically Aided Reinforcement Learning for Embodied
  Agents
Information Theoretically Aided Reinforcement Learning for Embodied Agents
Guido Montúfar
K. Zahedi
Nihat Ay
45
10
0
31 May 2016
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label
  Classification
From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification
André F. T. Martins
Ramón Fernández Astudillo
184
726
0
05 Feb 2016
Variational Information Maximisation for Intrinsically Motivated
  Reinforcement Learning
Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning
S. Mohamed
Danilo Jimenez Rezende
DRLSSL
99
402
0
29 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
106
854
0
31 Jul 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
463
43,328
0
11 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
348
10,079
0
10 Feb 2015
Neural Machine Translation by Jointly Learning to Align and Translate
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
575
27,325
0
01 Sep 2014
1