ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.05363
  4. Cited By
Curiosity-driven Exploration by Self-supervised Prediction

Curiosity-driven Exploration by Self-supervised Prediction

15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
    LRMSSL
ArXiv (abs)PDFHTML

Papers citing "Curiosity-driven Exploration by Self-supervised Prediction"

50 / 1,353 papers shown
Title
Temporal Difference Flows
Jesse Farebrother
Matteo Pirotta
Andrea Tirinzoni
Rémi Munos
A. Lazaric
Ahmed Touati
AI4TSAIFin
168
1
0
12 Mar 2025
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Swift Hydra: Self-Reinforcing Generative Framework for Anomaly Detection with Multiple Mamba Models
Nguyen H K. Do
Truc Nguyen
Malik Hassanaly
Raed Alharbi
Jung Taek Seo
My T. Thai
73
0
0
09 Mar 2025
InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model
Feeza Khan Khanzada
Jaerock Kwon
84
1
0
07 Mar 2025
Curiosity-Driven Imagination: Discovering Plan Operators and Learning Associated Policies for Open-World Adaptation
Pierrick Lorang
Hong Lu
Matthias Scheutz
84
0
0
06 Mar 2025
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models
Cansu Sancaktar
Christian Gumbsch
Andrii Zadaianchuk
Pavel Kolev
Georg Martius
LM&RoVLM
171
2
0
03 Mar 2025
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
M3HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality
Ziyan Wang
Zhicheng Zhang
Fei Fang
Yali Du
123
3
0
03 Mar 2025
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Reducing Reward Dependence in RL Through Adaptive Confidence Discounting
Muhammed Yusuf Satici
David L. Roberts
OffRL
73
0
0
28 Feb 2025
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
Toru Lin
Kartik Sachdev
Linxi Fan
Jitendra Malik
Yuke Zhu
130
11
0
27 Feb 2025
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic
Stefano Viel
Luca Viano
Volkan Cevher
203
1
0
27 Feb 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
244
3
0
24 Feb 2025
Brain-Model Evaluations Need the NeuroAI Turing Test
Jenelle Feather
Meenakshi Khosla
N. Apurva Ratan Murty
Aran Nayebi
158
6
0
22 Feb 2025
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Towards a Reward-Free Reinforcement Learning Framework for Vehicle Control
Jielong Yang
Daoyuan Huang
106
0
0
21 Feb 2025
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
Jayden Teoh
Wenjun Li
Pradeep Varakantham
115
2
0
08 Feb 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
155
0
0
06 Feb 2025
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
CAIMAN: Causal Action Influence Detection for Sample-efficient Loco-manipulation
Yuanchen Yuan
Jin Cheng
Núria Armengol Urpí
Stelian Coros
139
1
0
02 Feb 2025
Regularized Langevin Dynamics for Combinatorial Optimization
Regularized Langevin Dynamics for Combinatorial Optimization
Shengyu Feng
Yiming Yang
159
1
0
01 Feb 2025
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
Zishun Yu
Tengyu Xu
Di Jin
Karthik Abinav Sankararaman
Yun He
...
Eryk Helenowski
Chen Zhu
Sinong Wang
Hao Ma
Han Fang
LRM
242
11
0
29 Jan 2025
Episodic Novelty Through Temporal Distance
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
111
1
0
28 Jan 2025
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation
Wenzhang Liu
Lianjun Jin
Lu Ren
Chaoxu Mu
Changyin Sun
CML
92
0
0
24 Jan 2025
CuriousBot: Interactive Mobile Exploration via Actionable 3D Relational Object Graph
CuriousBot: Interactive Mobile Exploration via Actionable 3D Relational Object Graph
Yixuan Wang
Leonor Fermoselle
Tarik Kelestemur
Jiuguang Wang
Yunzhu Li
99
1
0
23 Jan 2025
Boosting MCTS with Free Energy Minimization
Boosting MCTS with Free Energy Minimization
Mawaba Pascal Dao
Adrian Peter
157
0
0
22 Jan 2025
Adaptive Data Exploitation in Deep Reinforcement Learning
Adaptive Data Exploitation in Deep Reinforcement Learning
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
OffRL
459
0
0
22 Jan 2025
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Autonomous Algorithm for Training Autonomous Vehicles with Minimal Human Intervention
Sang-Hyun Lee
Daehyeok Kwon
Seung-Woo Seo
140
1
0
17 Jan 2025
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
Yun Qu
Yuhang Jiang
Boyuan Wang
Yixiu Mao
Cheems Wang
Chang-Shu Liu
Xiangyang Ji
190
8
0
10 Jan 2025
CALM: Curiosity-Driven Auditing for Large Language Models
Xiang Zheng
Longxiang Wang
Yi Liu
Jie Zhang
Chao Shen
Cong Wang
MLAU
105
2
0
06 Jan 2025
PIMAEX: Multi-Agent Exploration through Peer Incentivization
Michael Kolle
Johannes Tochtermann
Julian Schonberger
Gerhard Stenzel
Philipp Altmann
Claudia Linnhoff-Popien
108
0
0
03 Jan 2025
β\betaβ-DQN: Improving Deep Q-Learning By Evolving the Behavior
Hongming Zhang
Fengshuo Bai
Chenjun Xiao
Chao Gao
Bo Xu
Martin Müller
OffRL
94
3
0
03 Jan 2025
Advances in Transformers for Robotic Applications: A Review
Advances in Transformers for Robotic Applications: A Review
Nikunj Sanghai
Nik Bear Brown
AI4CE
148
0
0
13 Dec 2024
Umbrella Reinforcement Learning -- computationally efficient tool for
  hard non-linear problems
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems
Egor E. Nuzhin
Nikolai V. Brilliantov
98
1
0
21 Nov 2024
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
A. Jain
Harley Wiltzer
Jesse Farebrother
Irina Rish
Glen Berseth
Sanjiban Choudhury
141
2
0
11 Nov 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&RoVGen
150
5
0
11 Nov 2024
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Xingrui Yu
Zhenglin Wan
David Mark Bossens
Yueming Lyu
Qing Guo
Ivor W. Tsang
490
0
0
11 Nov 2024
Learning World Models for Unconstrained Goal Navigation
Learning World Models for Unconstrained Goal Navigation
Yuanlin Duan
Wensen Mao
He Zhu
60
1
0
03 Nov 2024
Exploring the Edges of Latent State Clusters for Goal-Conditioned
  Reinforcement Learning
Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning
Yuanlin Duan
Guofeng Cui
He Zhu
OffRL
126
0
0
03 Nov 2024
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer
  Vision
IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer
Jack Spruyt
ViT
41
0
0
31 Oct 2024
Online Intrinsic Rewards for Decision Making Agents from Large Language
  Model Feedback
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
Qinqing Zheng
Mikael Henaff
Amy Zhang
Aditya Grover
Brandon Amos
LLMAGOffRL
111
3
0
30 Oct 2024
Prioritized Generative Replay
Prioritized Generative Replay
Renhao Wang
Kevin Frans
Pieter Abbeel
Sergey Levine
Alexei A. Efros
OnRLDiffM
195
7
0
23 Oct 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSLOffRLOnRL
194
0
0
23 Oct 2024
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
Yanjun Chen
Wei Wei
Xianghui Wang
Zhiqiang Xu
Xiaoyu Shen
Wei Zhang
38
0
0
22 Oct 2024
We Urgently Need Intrinsically Kind Machines
We Urgently Need Intrinsically Kind Machines
Joshua T. S. Hewson
SyDa
27
0
0
21 Oct 2024
Novelty-based Sample Reuse for Continuous Robotics Control
Novelty-based Sample Reuse for Continuous Robotics Control
Ke Duan
Kai Yang
Houde Liu
Xueqian Wang
77
0
0
17 Oct 2024
Potential-Based Intrinsic Motivation: Preserving Optimality With
  Complex, Non-Markovian Shaping Rewards
Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards
Grant C. Forbes
Leonardo Villalobos-Arias
Jianxun Wang
Arnav Jhala
David L. Roberts
77
1
0
16 Oct 2024
Sample-Efficient Reinforcement Learning with Temporal Logic Objectives:
  Leveraging the Task Specification to Guide Exploration
Sample-Efficient Reinforcement Learning with Temporal Logic Objectives: Leveraging the Task Specification to Guide Exploration
Y. Kantaros
Jun Wang
112
5
0
16 Oct 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical
  Reinforcement Learning
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
Jiaheng Hu
Zizhao Wang
Peter Stone
Roberto Martín-Martín
78
2
0
15 Oct 2024
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Andrew Levy
A. Allievi
George Konidaris
105
0
0
15 Oct 2024
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Overcoming Slow Decision Frequencies in Continuous Control: Model-Based Sequence Reinforcement Learning for Model-Free Control
Devdhar Patel
H. Siegelmann
OffRL
125
0
0
11 Oct 2024
On the Evaluation of Generative Robotic Simulations
On the Evaluation of Generative Robotic Simulations
Feng Chen
Botian Xu
Pu Hua
Peiqi Duan
Yanchao Yang
Yi Ma
Huazhe Xu
VGen
102
0
0
10 Oct 2024
Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained
  Foundation Models
Fostering Intrinsic Motivation in Reinforcement Learning with Pretrained Foundation Models
Alain Andres
Javier Del Ser
OffRL
58
0
0
09 Oct 2024
Effective Exploration Based on the Structural Information Principles
Effective Exploration Based on the Structural Information Principles
Xianghua Zeng
Hao Peng
Angsheng Li
69
2
0
09 Oct 2024
Effort Allocation for Deadline-Aware Task and Motion Planning: A
  Metareasoning Approach
Effort Allocation for Deadline-Aware Task and Motion Planning: A Metareasoning Approach
Yoonchang Sung
Shahaf S. Shperberg
Qi Wang
Peter Stone
70
0
0
08 Oct 2024
Previous
12345...262728
Next