ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.05363
  4. Cited By
Curiosity-driven Exploration by Self-supervised Prediction

Curiosity-driven Exploration by Self-supervised Prediction

15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
    LRM
    SSL
ArXivPDFHTML

Papers citing "Curiosity-driven Exploration by Self-supervised Prediction"

50 / 522 papers shown
Title
Exploration in Deep Reinforcement Learning: A Survey
Exploration in Deep Reinforcement Learning: A Survey
Pawel Ladosz
Lilian Weng
Minwoo Kim
H. Oh
OffRL
31
324
0
02 May 2022
Towards Flexible Inference in Sequential Decision Problems via
  Bidirectional Transformers
Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers
Micah Carroll
Jessy Lin
Orr Paradise
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
42
10
0
28 Apr 2022
Discovering Intrinsic Reward with Contrastive Random Walk
Discovering Intrinsic Reward with Contrastive Random Walk
Zixuan Pan
Zihao Wei
Yidong Huang
Aditya Gupta
37
0
0
23 Apr 2022
Embodied Navigation at the Art Gallery
Embodied Navigation at the Art Gallery
Roberto Bigazzi
Federico Landi
S. Cascianelli
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
LM&Ro
26
3
0
19 Apr 2022
Learning Design and Construction with Varying-Sized Materials via
  Prioritized Memory Resets
Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets
Yunfei Li
Tao Kong
Lei Li
Yi Wu
58
4
0
12 Apr 2022
Semantic Exploration from Language Abstractions and Pretrained
  Representations
Semantic Exploration from Language Abstractions and Pretrained Representations
Allison C. Tam
Neil C. Rabinowitz
Andrew Kyle Lampinen
Nicholas A. Roy
Stephanie C. Y. Chan
D. Strouse
Jane X. Wang
Andrea Banino
Felix Hill
LM&Ro
41
68
0
08 Apr 2022
Habitat-Web: Learning Embodied Object-Search Strategies from Human
  Demonstrations at Scale
Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Ram Ramrakhya
Eric Undersander
Dhruv Batra
Abhishek Das
LM&Ro
44
109
0
07 Apr 2022
Off-Policy Evaluation with Online Adaptation for Robot Exploration in
  Challenging Environments
Off-Policy Evaluation with Online Adaptation for Robot Exploration in Challenging Environments
Yafei Hu
Junyi Geng
Chen Wang
John Keller
Sebastian Scherer
OffRL
34
15
0
07 Apr 2022
On scientific understanding with artificial intelligence
On scientific understanding with artificial intelligence
Mario Krenn
R. Pollice
S. Guo
Matteo Aldeghi
Alba Cervera-Lierta
...
Florian Hase
A. Jinich
AkshatKumar Nigam
Zhenpeng Yao
Alán Aspuru-Guzik
40
186
0
04 Apr 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy
  Optimization
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
Zihan Zhou
Wei Fu
Bingliang Zhang
Yi Wu
25
28
0
04 Apr 2022
Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement
  Learning
Aggressive Quadrotor Flight Using Curiosity-Driven Reinforcement Learning
Q. Sun
Jinbao Fang
Weixing Zheng
Yang Tang
19
27
0
26 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
23
119
0
25 Mar 2022
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot
  Object Navigation
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation
S. Gadre
Mitchell Wortsman
Gabriel Ilharco
Ludwig Schmidt
Shuran Song
CLIP
LM&Ro
44
142
0
20 Mar 2022
AI Autonomy : Self-Initiated Open-World Continual Learning and
  Adaptation
AI Autonomy : Self-Initiated Open-World Continual Learning and Adaptation
Bing-Quan Liu
Sahisnu Mazumder
Eric Robertson
Scott Grigsby
CLL
AI4CE
38
21
0
17 Mar 2022
Stubborn: A Strong Baseline for Indoor Object Navigation
Stubborn: A Strong Baseline for Indoor Object Navigation
Haokuan Luo
Albert Yue
Zhang-Wei Hong
Pulkit Agrawal
26
41
0
14 Mar 2022
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling
Subhojyoti Mukherjee
Josiah P. Hanna
Robert D. Nowak
OffRL
29
12
0
09 Mar 2022
Rényi State Entropy for Exploration Acceleration in Reinforcement
  Learning
Rényi State Entropy for Exploration Acceleration in Reinforcement Learning
Mingqi Yuan
Man-On Pun
Dong Wang
27
23
0
08 Mar 2022
Transfer Dynamics in Emergent Evolutionary Curricula
Transfer Dynamics in Emergent Evolutionary Curricula
Aaron Dharna
Amy K. Hoover
Julian Togelius
Lisa Soros
27
6
0
03 Mar 2022
Follow your Nose: Using General Value Functions for Directed Exploration
  in Reinforcement Learning
Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Durgesh Kalwar
Omkar Shelke
Somjit Nath
Hardik Meisheri
H. Khadilkar
30
1
0
02 Mar 2022
Learning Robust Real-Time Cultural Transmission without Human Data
Learning Robust Real-Time Cultural Transmission without Human Data
Cultural General Intelligence Team
Avishkar Bhoopchand
Bethanie Brownfield
Adrian Collister
Agustin Dal Lago
...
Alex Platonov
Evan Senter
Sukhdeep Singh
Alexander Zacherl
Lei M. Zhang
VLM
46
11
0
01 Mar 2022
Collaborative Training of Heterogeneous Reinforcement Learning Agents in
  Environments with Sparse Rewards: What and When to Share?
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?
Alain Andres
Esther Villar-Rodriguez
Javier Del Ser
22
9
0
24 Feb 2022
Using Deep Reinforcement Learning with Automatic Curriculum Learning for
  Mapless Navigation in Intralogistics
Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics
Honghu Xue
Benedikt Hein
M. Bakr
Georg Schildbach
Bengt Abel
Elmar Rueckert
16
15
0
23 Feb 2022
Learning Causal Overhypotheses through Exploration in Children and
  Computational Models
Learning Causal Overhypotheses through Exploration in Children and Computational Models
Eliza Kosoy
Adrian Liu
Jasmine Collins
David M. Chan
Jessica B. Hamrick
Nan Rosemary Ke
Sandy H Huang
Bryanna Kaufmann
John F. Canny
Alison Gopnik
CML
24
9
0
21 Feb 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward Functions
Robert Meier
Asier Mujika
37
7
0
16 Feb 2022
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Michael Laskin
Hao Liu
Xue Bin Peng
Denis Yarats
Aravind Rajeswaran
Pieter Abbeel
SSL
78
66
0
01 Feb 2022
Efficient Reinforcement Learning in Block MDPs: A Model-free
  Representation Learning Approach
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Xuezhou Zhang
Yuda Song
Masatoshi Uehara
Mengdi Wang
Alekh Agarwal
Wen Sun
OffRL
34
57
0
31 Jan 2022
Generative Adversarial Exploration for Reinforcement Learning
Generative Adversarial Exploration for Reinforcement Learning
Weijun Hong
Menghui Zhu
Minghuan Liu
Weinan Zhang
Ming Zhou
Yong Yu
Peng Sun
OnRL
39
7
0
27 Jan 2022
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Learning to Act with Affordance-Aware Multimodal Neural SLAM
Zhiwei Jia
Kaixiang Lin
Yizhou Zhao
Qiaozi Gao
Govind Thattai
Gaurav Sukhatme
LM&Ro
31
15
0
24 Jan 2022
Generative Planning for Temporally Coordinated Exploration in
  Reinforcement Learning
Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning
Haichao Zhang
Wei Xu
Haonan Yu
38
10
0
24 Jan 2022
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven
  Learning in Artificial Intelligence Tasks
From Psychological Curiosity to Artificial Curiosity: Curiosity-Driven Learning in Artificial Intelligence Tasks
Chenyu Sun
Hangwei Qian
Chunyan Miao
18
10
0
20 Jan 2022
Demystifying Reinforcement Learning in Time-Varying Systems
Demystifying Reinforcement Learning in Time-Varying Systems
Pouya Hamadanian
Malte Schwarzkopf
Siddartha Sen
MohammadIman Alizadeh
47
1
0
14 Jan 2022
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
Vincent Mai
Kaustubh Mani
Liam Paull
40
34
0
05 Jan 2022
Towards Disturbance-Free Visual Mobile Manipulation
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni
Kiana Ehsani
Luca Weihs
Jordi Salvador
28
9
0
17 Dec 2021
Symmetry-aware Neural Architecture for Embodied Visual Navigation
Symmetry-aware Neural Architecture for Embodied Visual Navigation
Shuang Liu
Takayuki Okatani
34
1
0
17 Dec 2021
Programmatic Reward Design by Example
Programmatic Reward Design by Example
Weichao Zhou
Wenchao Li
34
15
0
14 Dec 2021
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Model-Value Inconsistency as a Signal for Epistemic Uncertainty
Angelos Filos
Eszter Vértes
Zita Marinho
Gregory Farquhar
Diana Borsa
A. Friesen
Feryal M. P. Behbahani
Tom Schaul
André Barreto
Simon Osindero
44
7
0
08 Dec 2021
Godot Reinforcement Learning Agents
Godot Reinforcement Learning Agents
E. Beeching
Jilles Debangoye
Olivier Simonin
Christian Wolf
GP
OnRL
24
5
0
07 Dec 2021
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical
  Reinforcement Learning
JueWu-MC: Playing Minecraft with Sample-efficient Hierarchical Reinforcement Learning
Zichuan Lin
Junyou Li
Jianing Shi
Deheng Ye
Qiang Fu
Wei Yang
BDL
45
34
0
07 Dec 2021
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent
  Learning
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
D. Mguni
Taher Jafferjee
Jianhong Wang
Oliver Slumbers
Nicolas Perez Nieves
Feifei Tong
Yang Li
Jiangcheng Zhu
Yaodong Yang
Jun Wang
47
18
0
05 Dec 2021
SEAL: Self-supervised Embodied Active Learning using Exploration and 3D
  Consistency
SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency
Devendra Singh Chaplot
Murtaza Dalal
Saurabh Gupta
Jitendra Malik
Ruslan Salakhutdinov
27
74
0
02 Dec 2021
Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped
  Environments with Moving Sounds
Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds
Abdelrahman Younes
Daniel Honerkamp
Tim Welschehold
Abhinav Valada
30
40
0
29 Nov 2021
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration
Simone Parisi
Victoria Dean
Deepak Pathak
Abhinav Gupta
LM&Ro
44
50
0
25 Nov 2021
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven
  Exploration
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration
Lu Zheng
Jiarui Chen
Jianhao Wang
Jiamin He
Yujing Hu
Yingfeng Chen
Changjie Fan
Yang Gao
Chongjie Zhang
16
82
0
22 Nov 2021
Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert
  Approach
Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach
Wenqi Zhang
Kai Zhao
Peng Li
Xiaochun Zhu
Faping Ye
Wei Jiang
Huiqiao Fu
Tao Wang
31
8
0
16 Nov 2021
CubeTR: Learning to Solve The Rubiks Cube Using Transformers
Mustafa Chasmai
ViT
37
1
0
11 Nov 2021
Play to Grade: Testing Coding Games as Classifying Markov Decision
  Process
Play to Grade: Testing Coding Games as Classifying Markov Decision Process
Allen Nie
Emma Brunskill
Chris Piech
29
11
0
27 Oct 2021
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State
  Covering and Goal Reaching
Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching
Pierre-Alexandre Kamienny
Jean Tarbouriech
Sylvain Lamprier
A. Lazaric
Ludovic Denoyer
SSL
45
18
0
27 Oct 2021
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement
  Learning
Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning
Junsup Kim
Younggyo Seo
Jinwoo Shin
22
58
0
26 Oct 2021
Generalized Out-of-Distribution Detection: A Survey
Generalized Out-of-Distribution Detection: A Survey
Jingkang Yang
Kaiyang Zhou
Yixuan Li
Ziwei Liu
193
881
0
21 Oct 2021
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Anti-Concentrated Confidence Bonuses for Scalable Exploration
Jordan T. Ash
Cyril Zhang
Surbhi Goel
A. Krishnamurthy
Sham Kakade
43
6
0
21 Oct 2021
Previous
123456...91011
Next