Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1705.05363
Cited By
Curiosity-driven Exploration by Self-supervised Prediction
15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Curiosity-driven Exploration by Self-supervised Prediction"
50 / 1,353 papers shown
Title
Reward Models in Deep Reinforcement Learning: A Survey
Rui Yu
Shenghua Wan
Yucen Wang
Chen-Xiao Gao
Le Gan
Zongzhang Zhang
De-Chuan Zhan
OffRL
27
0
0
18 Jun 2025
Efficient and Generalizable Environmental Understanding for Visual Navigation
Ruoyu Wang
Xinshu Li
Chen Wang
Lina Yao
CML
25
0
0
18 Jun 2025
Reasoning with Exploration: An Entropy Perspective
Daixuan Cheng
Shaohan Huang
Xuekai Zhu
Bo Dai
Wayne Xin Zhao
Zhenliang Zhang
Furu Wei
LRM
38
0
0
17 Jun 2025
Uncertainty Prioritized Experience Replay
Rodrigo Carrasco-Davis
Sebastian Lee
Claudia Clopath
Will Dabney
39
0
0
10 Jun 2025
WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making
Guillaume Levy
Cédric Colas
Pierre-Yves Oudeyer
Thomas Carta
Clément Romac
LRM
18
0
0
07 Jun 2025
Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain
Dimitris Panagopoulos
Adolfo Perrusquía
Weisi Guo
26
0
0
07 Jun 2025
Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces
Chaofan Pan
Jiafen Liu
Yanhua Li
Linbo Xiong
Fan Min
Wei Wei
Xin Yang
CLL
55
0
0
06 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
40
0
0
06 Jun 2025
Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning
Kyungsoo Kim
Jeongsoo Ha
Yusung Kim
BDL
47
7
0
05 Jun 2025
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals
Yangyang Zhao
Ben Niu
L. Qin
Shihan Wang
67
0
0
04 Jun 2025
Go-Browse: Training Web Agents with Structured Exploration
Apurva Gandhi
Graham Neubig
LLMAG
69
1
0
04 Jun 2025
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments
Umberto Gonçalves de Sousa
AI4CE
22
0
0
02 Jun 2025
Autonomous Behavior and Whole-Brain Dynamics Emerge in Embodied Zebrafish Agents with Model-based Intrinsic Motivation
Reece Keller
Alyn Tornell
Felix Pei
Xaq Pitkow
Leo Kozachkov
Aran Nayebi
26
0
0
30 May 2025
Maximizing Confidence Alone Improves Reasoning
Mihir Prabhudesai
Lili Chen
Alex Ippoliti
Katerina Fragkiadaki
Hao Liu
Deepak Pathak
OOD
OffRL
ReLM
LRM
139
3
0
28 May 2025
Universal Value-Function Uncertainties
Moritz A. Zanger
Max Weltevrede
Yaniv Oren
Pascal R. van der Vaart
Caroline Horsch
Wendelin Bohmer
M. Spaan
OffRL
79
0
0
27 May 2025
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects
Yixin Cui
Haotian Lin
Shuo Yang
Yixiao Wang
Yanjun Huang
Hong Chen
LM&Ro
LRM
ELM
126
0
0
26 May 2025
SCAR: Shapley Credit Assignment for More Efficient RLHF
Meng Cao
Shuyuan Zhang
Xiao-Wen Chang
Doina Precup
119
0
0
26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning
Leander Diaz-Bone
Marco Bagatella
Jonas Hübotter
Andreas Krause
OffRL
92
0
0
26 May 2025
Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning
Nicolas Castanet
Olivier Sigaud
Sylvain Lamprier
OffRL
114
0
0
23 May 2025
Predictability-Based Curiosity-Guided Action Symbol Discovery
Burcu Kilic
Alper Ahmetoglu
Emre Ugur
31
0
0
23 May 2025
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration
Jingtong Gao
Ling Pan
Yejing Wang
Rui Zhong
Chi Lu
Qingpeng Cai
Peng Jiang
Xiangyu Zhao
LRM
105
1
0
23 May 2025
Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets
Idriss Malek
Abhijit Sharma
Salem Lahlou
101
1
0
21 May 2025
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning
Alex Su
Haozhe Wang
Weiming Ren
Fangzhen Lin
Wenhu Chen
MLLM
OffRL
LRM
VLM
77
2
0
21 May 2025
Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning
Wei-Chen Liao
Ti-Rong Wu
I-Chen Wu
78
0
0
19 May 2025
Action-Dependent Optimality-Preserving Reward Shaping
Grant C. Forbes
Jianxun Wang
Leonardo Villalobos-Arias
Arnav Jhala
David L. Roberts
OffRL
67
0
0
19 May 2025
Automatic Reward Shaping from Confounded Offline Data
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
OffRL
OnRL
108
0
0
16 May 2025
Exploration by Random Distribution Distillation
Zhirui Fang
Kai Yang
Jian Tao
Jiafei Lyu
Lusong Li
Li Shen
Xiu Li
121
1
0
16 May 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges
Miguel Arana-Catania
Weisi Guo
CML
102
0
0
13 May 2025
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models
Seungjae Lee
Daniel Ekpo
Haowen Liu
Furong Huang
Abhinav Shrivastava
Jia-Bin Huang
LM&Ro
151
0
0
12 May 2025
Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture
Rintaro Ando
46
0
0
12 May 2025
ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments
Umberto Gonçalves de Sousa
60
0
0
07 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data
Baida Zhang
Yakai Chen
Huichun Li
Zhenghu Zu
63
0
0
07 May 2025
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning
Shashwat Pandey
AI4CE
30
0
0
06 May 2025
A Computational Model of Inclusive Pedagogy: From Understanding to Application
Francesco Balzan
Pedro P. Santos
Maurizio Gabbrielli
Mahault Albarracin
Manuel Lopes
122
0
0
02 May 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
Lang Feng
Weihao Tan
Zhiyi Lyu
Longtao Zheng
Haiyang Xu
Ming Yan
Fei Huang
Jingyi Wang
66
0
0
01 May 2025
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy
Aditya Vatsavai
Ganesh Narasimha
Yongtao Liu
Jan-Chi Yang
Hiroshu Funakubo
M. Ziatdinov
Rama K Vasudevan
79
0
0
28 Apr 2025
Emergence of Goal-Directed Behaviors via Active Inference with Self-Prior
Dongmin Kim
Hoshinori Kanazawa
Naoto Yoshida
Yasuo Kuniyoshi
AI4CE
67
0
0
15 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments
Licheng Luo
Mingyu Cai
101
0
0
09 Apr 2025
Wanting to be Understood
Chrisantha Fernando
Dylan Banarse
Simon Osindero
72
1
0
09 Apr 2025
An Information-Geometric Approach to Artificial Curiosity
Alexander Nedergaard
Pablo A. Morales
133
0
0
08 Apr 2025
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices
Zhe Wang
Yifei Zhu
100
0
0
04 Apr 2025
Reciprocity-Aware Convolutional Neural Networks for Map-Based Path Loss Prediction
Ryan Dempsey
Jonathan Ethier
Halim Yanikomeroglu
64
0
0
04 Apr 2025
Exploration-Driven Generative Interactive Environments
N. Savov
Naser Kazemi
Mohammad Mahdi
Danda Pani Paudel
Xi Wang
Luc Van Gool
VGen
3DV
117
1
0
03 Apr 2025
Intrinsically-Motivated Humans and Agents in Open-World Exploration
Aly Lidayan
Yuqing Du
Eliza Kosoy
Maria Rufova
Pieter Abbeel
Alison Gopnik
108
2
0
31 Mar 2025
World Model Agents with Change-Based Intrinsic Motivation
Jeremias Ferrao
Rafael Cunha
OffRL
MoE
127
1
0
26 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies
Shih-Min Yang
Martin Magnusson
J. A. Stork
Todor Stoyanov
79
0
0
23 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Yexin Li
Pring Wong
Hanfang Zhang
Shuo Chen
Siyuan Qi
OffRL
87
1
0
23 Mar 2025
Disentangling Uncertainties by Learning Compressed Data Representation
Zhiyu An
Zhibo Hou
Wan Du
UQCV
UD
115
0
0
20 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability
Zihao Liu
Xing Liu
Yizhai Zhang
Zhengxiong Liu
Panfeng Huang
119
0
0
19 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Moritz A. Zanger
Pascal R. van der Vaart
Wendelin Bohmer
M. Spaan
UQCV
BDL
507
2
0
14 Mar 2025
1
2
3
4
...
26
27
28
Next