ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.05363
  4. Cited By
Curiosity-driven Exploration by Self-supervised Prediction

Curiosity-driven Exploration by Self-supervised Prediction

15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
    LRMSSL
ArXiv (abs)PDFHTML

Papers citing "Curiosity-driven Exploration by Self-supervised Prediction"

50 / 1,353 papers shown
Title
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward
DaeJin Jo
Sungwoong Kim
D. W. Nam
Taehwan Kwon
Seungeun Rho
Jongmin Kim
Donghoon Lee
OffRL
73
10
0
11 Oct 2022
The Role of Exploration for Task Transfer in Reinforcement Learning
The Role of Exploration for Task Transfer in Reinforcement Learning
Jonathan C. Balloch
Julia Kim
Jessica B. Langebrake Inman
Mark O. Riedl
OffRL
119
3
0
11 Oct 2022
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Zixian Ma
Rose E. Wang
Li Fei-Fei
Michael S. Bernstein
Ranjay Krishna
73
17
0
09 Oct 2022
Elastic Step DQN: A novel multi-step algorithm to alleviate
  overestimation in Deep QNetworks
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks
Adrian Ly
Richard Dazeley
Peter Vamplew
Francisco Cruz
Sunil Aryal
116
13
0
07 Oct 2022
Generative Augmented Flow Networks
Generative Augmented Flow Networks
L. Pan
Dinghuai Zhang
Aaron Courville
Longbo Huang
Yoshua Bengio
188
49
0
07 Oct 2022
Exploration via Planning for Information about the Optimal Trajectory
Exploration via Planning for Information about the Optimal Trajectory
Viraj Mehta
I. Char
J. Abbate
R. Conlin
M. Boyer
Stefano Ermon
J. Schneider
Willie Neiswanger
OffRL
79
6
0
06 Oct 2022
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with
  Multi-choice Dynamics Model
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Jinyi Liu
Yingfeng Chen
Changjie Fan
128
15
0
02 Oct 2022
Visuo-Tactile Transformers for Manipulation
Visuo-Tactile Transformers for Manipulation
Yizhou Chen
A. Sipos
Mark Van der Merwe
Nima Fazeli
ViT
93
36
0
30 Sep 2022
Does Zero-Shot Reinforcement Learning Exist?
Does Zero-Shot Reinforcement Learning Exist?
Ahmed Touati
Jérémy Rapin
Yann Ollivier
OffRL
116
46
0
29 Sep 2022
Opportunities and Challenges from Using Animal Videos in Reinforcement
  Learning for Navigation
Opportunities and Challenges from Using Animal Videos in Reinforcement Learning for Navigation
Vittorio Giammarino
James Queeney
Lucas C. Carstensen
Michael Hasselmo
I. Paschalidis
OffRL
84
5
0
25 Sep 2022
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels
Sai Rajeswar
Pietro Mazzaglia
Tim Verbelen
Alexandre Piché
Bart Dhoedt
Rameswar Panda
Alexandre Lacoste
SSL
104
21
0
24 Sep 2022
First-order Policy Optimization for Robust Markov Decision Process
First-order Policy Optimization for Robust Markov Decision Process
Yan Li
Guanghui Lan
Tuo Zhao
170
25
0
21 Sep 2022
Deep Model Predictive Variable Impedance Control
Deep Model Predictive Variable Impedance Control
Akhil S. Anand
Fares J. Abu-Dakka
J. Gravdahl
62
12
0
20 Sep 2022
Graph Value Iteration
Graph Value Iteration
Dieqiao Feng
Carla P. Gomes
B. Selman
63
0
0
20 Sep 2022
Active Predicting Coding: Brain-Inspired Reinforcement Learning for
  Sparse Reward Robotic Control Problems
Active Predicting Coding: Brain-Inspired Reinforcement Learning for Sparse Reward Robotic Control Problems
Alexander Ororbia
A. Mali
96
8
0
19 Sep 2022
An information-theoretic perspective on intrinsic motivation in
  reinforcement learning: a survey
An information-theoretic perspective on intrinsic motivation in reinforcement learning: a survey
A. Aubret
L. Matignon
S. Hassas
96
37
0
19 Sep 2022
Rewarding Episodic Visitation Discrepancy for Exploration in
  Reinforcement Learning
Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
89
12
0
19 Sep 2022
Optimistic Curiosity Exploration and Conservative Exploitation with
  Linear Reward Shaping
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRLOnRL
84
11
0
15 Sep 2022
Self-supervised Sequential Information Bottleneck for Robust Exploration
  in Deep Reinforcement Learning
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning
Bang You
Jingming Xie
Youping Chen
Jan Peters
Oleg Arenz
45
2
0
12 Sep 2022
Task-Agnostic Learning to Accomplish New Tasks
Task-Agnostic Learning to Accomplish New Tasks
Xianqi Zhang
Xingtao Wang
Xu Liu
Wenrui Wang
Xiaopeng Fan
Debin Zhao
OffRL
141
0
0
09 Sep 2022
A taxonomy of surprise definitions
A taxonomy of surprise definitions
Alireza Modirshanechi
Johanni Brea
W. Gerstner
31
32
0
02 Sep 2022
Cell-Free Latent Go-Explore
Cell-Free Latent Go-Explore
Quentin Gallouedec
Emmanuel Dellandrea
100
2
0
31 Aug 2022
Unsupervised Representation Learning in Deep Reinforcement Learning: A
  Review
Unsupervised Representation Learning in Deep Reinforcement Learning: A Review
N. Botteghi
M. Poel
C. Brune
SSLOffRL
105
13
0
27 Aug 2022
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement
  Learning
Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Zijian Gao
Kele Xu
Yuanzhao Zhai
Dawei Feng
Bo Ding
Xinjun Mao
Huaimin Wang
105
1
0
24 Aug 2022
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Zijian Gao
Yiying Li
Kele Xu
Yuanzhao Zhai
Dawei Feng
Bo Ding
Xinjun Mao
Huaimin Wang
91
0
0
24 Aug 2022
Some Supervision Required: Incorporating Oracle Policies in
  Reinforcement Learning via Epistemic Uncertainty Metrics
Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics
Jun Jet Tai
Jordan Terry
M. Innocente
J. Brusey
N. Horri
80
2
0
22 Aug 2022
Spectral Decomposition Representation for Reinforcement Learning
Spectral Decomposition Representation for Reinforcement Learning
Zhaolin Ren
Tianjun Zhang
Lisa Lee
Joseph E. Gonzalez
Dale Schuurmans
Bo Dai
OffRL
100
29
0
19 Aug 2022
Intelligent problem-solving as integrated hierarchical reinforcement
  learning
Intelligent problem-solving as integrated hierarchical reinforcement learning
Manfred Eppe
Christian Gumbsch
Matthias Kerzel
Phuong D. H. Nguyen
Martin Volker Butz
S. Wermter
94
78
0
18 Aug 2022
Learning to Coordinate for a Worker-Station Multi-robot System in Planar
  Coverage Tasks
Learning to Coordinate for a Worker-Station Multi-robot System in Planar Coverage Tasks
Jing Tang
Yuan Gao
Tin Lun Lam
104
7
0
05 Aug 2022
Impact Makes a Sound and Sound Makes an Impact: Sound Guides
  Representations and Explorations
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations
Xufeng Zhao
C. Weber
Muhammad Burhan Hafez
S. Wermter
75
9
0
04 Aug 2022
Reinforcement learning with experience replay and adaptation of action
  dispersion
Reinforcement learning with experience replay and adaptation of action dispersion
Pawel Wawrzyñski
Wojciech Masarczyk
M. Ostaszewski
27
1
0
30 Jul 2022
Multi-Agent Reinforcement Learning for Long-Term Network Resource
  Allocation through Auction: a V2X Application
Multi-Agent Reinforcement Learning for Long-Term Network Resource Allocation through Auction: a V2X Application
Jing Tan
R. Khalili
Holger Karl
A. Hecker
OffRL
36
2
0
29 Jul 2022
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via
  Best-Response Diversity
Generating Teammates for Training Robust Ad Hoc Teamwork Agents via Best-Response Diversity
Arrasy Rahman
Elliot Fosong
Ignacio Carlucho
Stefano V. Albrecht
103
10
0
28 Jul 2022
Modelling non-reinforced preferences using selective attention
Modelling non-reinforced preferences using selective attention
Noor Sajid
P. Tigas
Zafeirios Fountas
Qinghai Guo
Alexey Zakharov
Lancelot Da Costa
75
1
0
25 Jul 2022
Human-to-Robot Imitation in the Wild
Human-to-Robot Imitation in the Wild
Shikhar Bahl
Abhi Gupta
Deepak Pathak
126
174
0
19 Jul 2022
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step
  Inverse Models
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models
Alex Lamb
Riashat Islam
Yonathan Efroni
Aniket Didolkar
Dipendra Kumar Misra
Dylan J. Foster
Lekan Molu
Rajan Chari
A. Krishnamurthy
John Langford
99
24
0
17 Jul 2022
Robust AI Driving Strategy for Autonomous Vehicles
Robust AI Driving Strategy for Autonomous Vehicles
S. Nageshrao
Yousaf Rahman
V. Ivanovic
M. Janković
E. Tseng
M. Hafner
Dimitar Filev
81
5
0
16 Jul 2022
The Free Energy Principle for Perception and Action: A Deep Learning
  Perspective
The Free Energy Principle for Perception and Action: A Deep Learning Perspective
Pietro Mazzaglia
Tim Verbelen
Ozan Çatal
Bart Dhoedt
DRLAI4CE
70
33
0
13 Jul 2022
Reactive Exploration to Cope with Non-Stationarity in Lifelong
  Reinforcement Learning
Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
C. Steinparz
Thomas Schmied
Fabian Paischer
Marius-Constantin Dinu
Vihang Patil
Angela Bitto-Nemling
Hamid Eghbalzadeh
Sepp Hochreiter
CLL
97
12
0
12 Jul 2022
HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed
  Adaptive Reinforce Algorithm
HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm
K. Weerakoon
Souradip Chakraborty
N. Karapetyan
A. Sathyamoorthy
Amrit Singh Bedi
Tianyi Zhou
86
14
0
08 Jul 2022
Equivariant Representation Learning via Class-Pose Decomposition
Equivariant Representation Learning via Class-Pose Decomposition
Giovanni Luca Marchetti
Gustaf Tegnér
Anastasiia Varava
Danica Kragic
DRL
97
16
0
07 Jul 2022
WebShop: Towards Scalable Real-World Web Interaction with Grounded
  Language Agents
WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Shunyu Yao
Howard Chen
John Yang
Karthik Narasimhan
LLMAGLM&Ro
195
522
0
04 Jul 2022
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement
  Learning
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning
Dogay Kamar
N. K. Üre
Gözde B. Ünal
18
1
0
28 Jun 2022
Causal Dynamics Learning for Task-Independent State Abstraction
Causal Dynamics Learning for Task-Independent State Abstraction
Zizhao Wang
Xuesu Xiao
Zifan Xu
Yuke Zhu
Peter Stone
CML
80
58
0
27 Jun 2022
Emergence of Novelty in Evolutionary Algorithms
Emergence of Novelty in Evolutionary Algorithms
David Herel
Dominika Zogatova
Matej Kripner
Tomas Mikolov
11
0
0
27 Jun 2022
Guided Exploration in Reinforcement Learning via Monte Carlo Critic
  Optimization
Guided Exploration in Reinforcement Learning via Monte Carlo Critic Optimization
Igor Kuznetsov
89
2
0
25 Jun 2022
Learning Rhetorical Structure Theory-based descriptions of observed
  behaviour
Learning Rhetorical Structure Theory-based descriptions of observed behaviour
L. Botelho
Luís Nunes
Ricardo Ribeiro
Rui J. Lopes
23
0
0
24 Jun 2022
Curious Exploration via Structured World Models Yields Zero-Shot Object
  Manipulation
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation
Cansu Sancaktar
Sebastian Blaes
Georg Martius
LM&Ro
78
26
0
22 Jun 2022
Multi-Horizon Representations with Hierarchical Forward Models for
  Reinforcement Learning
Multi-Horizon Representations with Hierarchical Forward Models for Reinforcement Learning
Trevor A. McInroe
Lukas Schafer
Stefano V. Albrecht
71
4
0
22 Jun 2022
Evolution through Large Models
Evolution through Large Models
Joel Lehman
Jonathan Gordon
Shawn Jain
Kamal Ndousse
Cathy Yeh
Kenneth O. Stanley
100
94
0
17 Jun 2022
Previous
123...101112...262728
Next