Curiosity-driven Exploration by Self-supervised Prediction

15 May 2017

Papers citing "Curiosity-driven Exploration by Self-supervised Prediction"

50 / 1,353 papers shown

Title
Reward Models in Deep Reinforcement Learning: A Survey Rui Yu Shenghua Wan Yucen Wang Chen-Xiao Gao Le Gan Zongzhang Zhang De-Chuan Zhan OffRL 27 0 0 18 Jun 2025
Efficient and Generalizable Environmental Understanding for Visual Navigation Ruoyu Wang Xinshu Li Chen Wang Lina Yao CML 25 0 0 18 Jun 2025
Reasoning with Exploration: An Entropy Perspective Daixuan Cheng Shaohan Huang Xuekai Zhu Bo Dai Wayne Xin Zhao Zhenliang Zhang Furu Wei LRM 38 0 0 17 Jun 2025
Uncertainty Prioritized Experience Replay Rodrigo Carrasco-Davis Sebastian Lee Claudia Clopath Will Dabney 39 0 0 10 Jun 2025
WorldLLM: Improving LLMs' world modeling using curiosity-driven theory-making Guillaume Levy Cédric Colas Pierre-Yves Oudeyer Thomas Carta Clément Romac LRM 18 0 0 07 Jun 2025
Learning What Matters Now: A Dual-Critic Context-Aware RL Framework for Priority-Driven Information Gain Dimitris Panagopoulos Adolfo Perrusquía Weisi Guo 26 0 0 07 Jun 2025
Action-Adaptive Continual Learning: Enabling Policy Generalization under Dynamic Action Spaces Chaofan Pan Jiafen Liu Yanhua Li Linbo Xiong Fan Min Wei Wei Xin Yang CLL 55 0 0 06 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification Geonwoo Cho Jaemoon Lee Jaegyun Im Subi Lee Jihwan Lee Sundong Kim 40 0 0 06 Jun 2025
Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning Kyungsoo Kim Jeongsoo Ha Yusung Kim BDL 47 7 0 05 Jun 2025
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals Yangyang Zhao Ben Niu L. Qin Shihan Wang 67 0 0 04 Jun 2025
Go-Browse: Training Web Agents with Structured Exploration Apurva Gandhi Graham Neubig LLMAG 69 1 0 04 Jun 2025
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments Umberto Gonçalves de Sousa AI4CE 22 0 0 02 Jun 2025
Autonomous Behavior and Whole-Brain Dynamics Emerge in Embodied Zebrafish Agents with Model-based Intrinsic Motivation Reece Keller Alyn Tornell Felix Pei Xaq Pitkow Leo Kozachkov Aran Nayebi 26 0 0 30 May 2025
Maximizing Confidence Alone Improves Reasoning Mihir Prabhudesai Lili Chen Alex Ippoliti Katerina Fragkiadaki Hao Liu Deepak Pathak OOD OffRL ReLM LRM 139 3 0 28 May 2025
Universal Value-Function Uncertainties Moritz A. Zanger Max Weltevrede Yaniv Oren Pascal R. van der Vaart Caroline Horsch Wendelin Bohmer M. Spaan OffRL 79 0 0 27 May 2025
Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects Yixin Cui Haotian Lin Shuo Yang Yixiao Wang Yanjun Huang Hong Chen LM&Ro LRM ELM 126 0 0 26 May 2025
SCAR: Shapley Credit Assignment for More Efficient RLHF Meng Cao Shuyuan Zhang Xiao-Wen Chang Doina Precup 119 0 0 26 May 2025
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement Learning Leander Diaz-Bone Marco Bagatella Jonas Hübotter Andreas Krause OffRL 92 0 0 26 May 2025
Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning Nicolas Castanet Olivier Sigaud Sylvain Lamprier OffRL 114 0 0 23 May 2025
Predictability-Based Curiosity-Guided Action Symbol Discovery Burcu Kilic Alper Ahmetoglu Emre Ugur 31 0 0 23 May 2025
Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration Jingtong Gao Ling Pan Yejing Wang Rui Zhong Chi Lu Qingpeng Cai Peng Jiang Xiangyu Zhao LRM 105 1 0 23 May 2025
Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets Idriss Malek Abhijit Sharma Salem Lahlou 101 1 0 21 May 2025
Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning Alex Su Haozhe Wang Weiming Ren Fangzhen Lin Wenhu Chen MLLM OffRL LRM VLM 77 2 0 21 May 2025
Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning Wei-Chen Liao Ti-Rong Wu I-Chen Wu 78 0 0 19 May 2025
Action-Dependent Optimality-Preserving Reward Shaping Grant C. Forbes Jianxun Wang Leonardo Villalobos-Arias Arnav Jhala David L. Roberts OffRL 67 0 0 19 May 2025
Automatic Reward Shaping from Confounded Offline Data Mingxuan Li Junzhe Zhang Elias Bareinboim OffRL OnRL 108 0 0 16 May 2025
Exploration by Random Distribution Distillation Zhirui Fang Kai Yang Jian Tao Jiafei Lyu Lusong Li Li Shen Xiu Li 121 1 0 16 May 2025
Parameter Estimation using Reinforcement Learning Causal Curiosity: Limits and Challenges Miguel Arana-Catania Weisi Guo CML 102 0 0 13 May 2025
Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models Seungjae Lee Daniel Ekpo Haowen Liu Furong Huang Abhinav Shrivastava Jia-Bin Huang LM&Ro 151 0 0 12 May 2025
Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture Rintaro Ando 46 0 0 12 May 2025
ARDNS-FN-Quantum: A Quantum-Enhanced Reinforcement Learning Framework with Cognitive-Inspired Adaptive Exploration for Dynamic Environments Umberto Gonçalves de Sousa 60 0 0 07 May 2025
Optimization of Infectious Disease Intervention Measures Based on Reinforcement Learning - Empirical analysis based on UK COVID-19 epidemic data Baida Zhang Yakai Chen Huichun Li Zhenghu Zu 63 0 0 07 May 2025
Interpretable Learning Dynamics in Unsupervised Reinforcement Learning Shashwat Pandey AI4CE 30 0 0 06 May 2025
A Computational Model of Inclusive Pedagogy: From Understanding to Application Francesco Balzan Pedro P. Santos Maurizio Gabbrielli Mahault Albarracin Manuel Lopes 122 0 0 02 May 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning Lang Feng Weihao Tan Zhiyi Lyu Longtao Zheng Haiyang Xu Ming Yan Fei Huang Jingyi Wang 66 0 0 01 May 2025
Curiosity Driven Exploration to Optimize Structure-Property Learning in Microscopy Aditya Vatsavai Ganesh Narasimha Yongtao Liu Jan-Chi Yang Hiroshu Funakubo M. Ziatdinov Rama K Vasudevan 79 0 0 28 Apr 2025
Emergence of Goal-Directed Behaviors via Active Inference with Self-Prior Dongmin Kim Hoshinori Kanazawa Naoto Yoshida Yasuo Kuniyoshi AI4CE 67 0 0 15 Apr 2025
Bridging Deep Reinforcement Learning and Motion Planning for Model-Free Navigation in Cluttered Environments Licheng Luo Mingyu Cai 101 0 0 09 Apr 2025
Wanting to be Understood Chrisantha Fernando Dylan Banarse Simon Osindero 72 1 0 09 Apr 2025
An Information-Geometric Approach to Artificial Curiosity Alexander Nedergaard Pablo A. Morales 133 0 0 08 Apr 2025
NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices Zhe Wang Yifei Zhu 100 0 0 04 Apr 2025
Reciprocity-Aware Convolutional Neural Networks for Map-Based Path Loss Prediction Ryan Dempsey Jonathan Ethier Halim Yanikomeroglu 64 0 0 04 Apr 2025
Exploration-Driven Generative Interactive Environments N. Savov Naser Kazemi Mohammad Mahdi Danda Pani Paudel Xi Wang Luc Van Gool VGen 3DV 117 1 0 03 Apr 2025
Intrinsically-Motivated Humans and Agents in Open-World Exploration Aly Lidayan Yuqing Du Eliza Kosoy Maria Rufova Pieter Abbeel Alison Gopnik 108 2 0 31 Mar 2025
World Model Agents with Change-Based Intrinsic Motivation Jeremias Ferrao Rafael Cunha OffRL MoE 127 1 0 26 Mar 2025
KEA: Keeping Exploration Alive by Proactively Coordinating Exploration Strategies Shih-Min Yang Martin Magnusson J. A. Stork Todor Stoyanov 79 0 0 23 Mar 2025
CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning Yexin Li Pring Wong Hanfang Zhang Shuo Chen Siyuan Qi OffRL 87 1 0 23 Mar 2025
Disentangling Uncertainties by Learning Compressed Data Representation Zhiyu An Zhibo Hou Wan Du UQCV UD 115 0 0 20 Mar 2025
Curiosity-Diffuser: Curiosity Guide Diffusion Models for Reliability Zihao Liu Xing Liu Yizhai Zhang Zhengxiong Liu Panfeng Huang 119 0 0 19 Mar 2025
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model Moritz A. Zanger Pascal R. van der Vaart Wendelin Bohmer M. Spaan UQCV BDL 507 2 0 14 Mar 2025