ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.05363
  4. Cited By
Curiosity-driven Exploration by Self-supervised Prediction

Curiosity-driven Exploration by Self-supervised Prediction

15 May 2017
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
    LRMSSL
ArXiv (abs)PDFHTML

Papers citing "Curiosity-driven Exploration by Self-supervised Prediction"

50 / 1,353 papers shown
Title
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning
Hyungho Na
IL-Chul Moon
69
1
0
30 May 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
176
0
0
29 May 2024
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement
  Learning
Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning
Adriana Hugessen
Roger Creus Castanyer
Faisal Mohamed
Glen Berseth
70
0
0
27 May 2024
Knowing What Not to Do: Leverage Language Model Insights for Action
  Space Pruning in Multi-agent Reinforcement Learning
Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning
Zhihao Liu
Xianliang Yang
Zichuan Liu
Yifan Xia
Wei Jiang
Yuanyu Zhang
Lijuan Li
Guoliang Fan
Lei Song
Bian Jiang
LLMAG
82
3
0
27 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
128
4
0
25 May 2024
Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural
  Networks
Blood Glucose Control Via Pre-trained Counterfactual Invertible Neural Networks
Jingchi Jiang
Rujia Shen
Boran Wang
Yi Guan
OffRLBDL
75
1
0
23 May 2024
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement
  Learning
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Xuezhou Xu
Hang Su
Xingxing Zhang
Jun Zhu
132
5
0
23 May 2024
A social path to human-like artificial intelligence
A social path to human-like artificial intelligence
Edgar A. Duénez-Guzmán
Suzanne Sadedin
Jane X. Wang
Kevin R. McKee
Joel Z Leibo
GNN
98
30
0
22 May 2024
Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework
Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework
Edward Y. Chang
120
0
0
20 May 2024
Learning Future Representation with Synthetic Observations for
  Sample-efficient Reinforcement Learning
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
79
2
0
20 May 2024
Visual Episodic Memory-based Exploration
Visual Episodic Memory-based Exploration
J. Vice
Natalie Ruiz-Sanchez
P. Douglas
G. Sukthankar
78
0
0
18 May 2024
Intrinsic Rewards for Exploration without Harm from Observational Noise:
  A Simulation Study Based on the Free Energy Principle
Intrinsic Rewards for Exploration without Harm from Observational Noise: A Simulation Study Based on the Free Energy Principle
Theodore Jerome Tinker
Kenji Doya
Jun Tani
59
0
0
13 May 2024
Enhancing Q-Learning with Large Language Model Heuristics
Enhancing Q-Learning with Large Language Model Heuristics
Xiefeng Wu
LRM
100
0
0
06 May 2024
Continuously evolving rewards in an open-ended environment
Continuously evolving rewards in an open-ended environment
Richard M. Bailey
75
1
0
02 May 2024
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through
  Exploiting State-Action Space Structure
MESA: Cooperative Meta-Exploration in Multi-Agent Learning through Exploiting State-Action Space Structure
Zhicheng Zhang
Yancheng Liang
Yi Wu
Fei Fang
72
2
0
01 May 2024
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks
Tongzhou Mu
Minghua Liu
Hao Su
OffRL
96
4
0
25 Apr 2024
SwarmRL: Building the Future of Smart Active Systems
SwarmRL: Building the Future of Smart Active Systems
S. Tovey
Christoph Lohrmann
Tobias Merkt
David Zimmer
Konstantin Nikolaou
Simon Koppenhoefer
Anna Bushmakina
Jonas Scheunemann
Christian Holm
AI4CE
123
2
0
25 Apr 2024
Generalizing Multi-Step Inverse Models for Representation Learning to
  Finite-Memory POMDPs
Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs
Lili Wu
Ben Evans
Riashat Islam
Raihan Seraj
Yonathan Efroni
Alex Lamb
92
1
0
22 Apr 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned
  Reinforcement Learning
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
61
0
0
19 Apr 2024
A Note on Loss Functions and Error Compounding in Model-based
  Reinforcement Learning
A Note on Loss Functions and Error Compounding in Model-based Reinforcement Learning
Nan Jiang
86
6
0
15 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from
  Human Feedback for LLMs
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
Ashwin Kalyan
Karthik Narasimhan
Ameet Deshpande
Bruno Castro da Silva
91
38
0
12 Apr 2024
Is Exploration All You Need? Effective Exploration Characteristics for
  Transfer in Reinforcement Learning
Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
Jonathan C. Balloch
Rishav Bhagat
Geigh Zollicoffer
Ruoran Jia
Julia Kim
Mark O. Riedl
OffRL
87
1
0
02 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for
  Robot Manipulation
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
95
0
0
02 Apr 2024
Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic
  Rewards for Goal-directed Molecular Generation
Mol-AIR: Molecular Reinforcement Learning with Adaptive Intrinsic Rewards for Goal-directed Molecular Generation
Jinyeong Park
Jaegyoon Ahn
Jonghwan Choi
Jibum Kim
87
4
0
29 Mar 2024
VDSC: Enhancing Exploration Timing with Value Discrepancy and State
  Counts
VDSC: Enhancing Exploration Timing with Value Discrepancy and State Counts
Marius Captari
Remo Sasso
M. Sabatelli
26
0
0
26 Mar 2024
Multistep Inverse Is Not All You Need
Multistep Inverse Is Not All You Need
Alexander Levine
Peter Stone
Amy Zhang
AI4CE
108
4
0
18 Mar 2024
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
Nicholas Zolman
Urban Fasel
J. Nathan Kutz
Steven L. Brunton
AI4CE
87
11
0
14 Mar 2024
Multi-Objective Optimization Using Adaptive Distributed Reinforcement
  Learning
Multi-Objective Optimization Using Adaptive Distributed Reinforcement Learning
Jing Tan
R. Khalili
Holger Karl
75
1
0
13 Mar 2024
Unveiling the Significance of Toddler-Inspired Reward Transition in
  Goal-Oriented Reinforcement Learning
Unveiling the Significance of Toddler-Inspired Reward Transition in Goal-Oriented Reinforcement Learning
Junseok Park
Yoonsung Kim
Hee Bin Yoo
Min Whoo Lee
Kibeom Kim
Won-Seok Choi
Minsu Lee
Byoung-Tak Zhang
OffRL
68
1
0
11 Mar 2024
Generalising Multi-Agent Cooperation through Task-Agnostic Communication
Generalising Multi-Agent Cooperation through Task-Agnostic Communication
Dulhan Jayalath
Steven D. Morad
Amanda Prorok
75
0
0
11 Mar 2024
TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation
  under Visual Corruptions
TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Maytus Piriyajitakonkij
Mingfei Sun
Mengmi Zhang
Wei Pan
ViT
111
1
0
04 Mar 2024
Efficient Episodic Memory Utilization of Cooperative Multi-Agent
  Reinforcement Learning
Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning
Hyungho Na
Yunkyeong Seo
IL-Chul Moon
75
7
0
02 Mar 2024
Curiosity-driven Red-teaming for Large Language Models
Curiosity-driven Red-teaming for Large Language Models
Zhang-Wei Hong
Idan Shenfeld
Tsun-Hsuan Wang
Yung-Sung Chuang
Aldo Pareja
James R. Glass
Akash Srivastava
Pulkit Agrawal
LRM
116
45
0
29 Feb 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
92
4
0
29 Feb 2024
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward
  Encodings
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
111
13
0
27 Feb 2024
Foundation Policies with Hilbert Representations
Foundation Policies with Hilbert Representations
Seohong Park
Tobias Kreiman
Sergey Levine
SSLOffRL
112
30
0
23 Feb 2024
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for
  Robotic Manipulation
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation
Hanxiao Jiang
Binghao Huang
Ruihai Wu
Zhuoran Li
Shubham Garg
H. Nayyeri
Shenlong Wang
Yunzhu Li
103
23
0
23 Feb 2024
Trajectory-wise Iterative Reinforcement Learning Framework for
  Auto-bidding
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding
Haoming Li
Yusen Huo
Shuai Dou
Zhenzhe Zheng
Zhilin Zhang
Chuan Yu
Jian Xu
Fan Wu
OffRL
86
5
0
23 Feb 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
117
12
0
22 Feb 2024
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback
  and Dynamic Distance Constraint
MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Xinglin Zhou
Yifu Yuan
Shaofu Yang
Jianye Hao
77
2
0
22 Feb 2024
Data-driven Discovery with Large Generative Models
Data-driven Discovery with Large Generative Models
Bodhisattwa Prasad Majumder
Harshit Surana
Dhruv Agarwal
Sanchaita Hazra
Ashish Sabharwal
Peter Clark
87
13
0
21 Feb 2024
Discrete Probabilistic Inference as Control in Multi-path Environments
Discrete Probabilistic Inference as Control in Multi-path Environments
T. Deleu
Padideh Nouri
Nikolay Malkin
Doina Precup
Yoshua Bengio
183
31
0
15 Feb 2024
Potential-Based Reward Shaping For Intrinsic Motivation
Potential-Based Reward Shaping For Intrinsic Motivation
Grant C. Forbes
Nitish Gupta
Leonardo Villalobos-Arias
Colin M. Potts
Arnav Jhala
David L. Roberts
18
5
0
12 Feb 2024
Monitored Markov Decision Processes
Monitored Markov Decision Processes
Simone Parisi
Montaser Mohammedalamen
Alireza Kazemipour
Matthew E. Taylor
Michael Bowling
OffRL
92
4
0
09 Feb 2024
Scaling Artificial Intelligence for Digital Wargaming in Support of
  Decision-Making
Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making
Scotty Black
Christian J. Darken
27
2
0
08 Feb 2024
Learning Diverse Policies with Soft Self-Generated Guidance
Learning Diverse Policies with Soft Self-Generated Guidance
Guojian Wang
Faguo Wu
Xiao Zhang
Jianxiang Liu
OffRL
63
4
0
07 Feb 2024
The Essential Role of Causality in Foundation World Models for Embodied
  AI
The Essential Role of Causality in Foundation World Models for Embodied AI
Tarun Gupta
Wenbo Gong
Chao Ma
Nick Pawlowski
Agrin Hilmkil
...
Jianfeng Gao
Stefan Bauer
Danica Kragic
Bernhard Schölkopf
Cheng Zhang
92
17
0
06 Feb 2024
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent
  Deep Reinforcement Learning
Joint Intrinsic Motivation for Coordinated Exploration in Multi-Agent Deep Reinforcement Learning
Maxime Toquebiau
Nicolas Bredeche
F. Benamar
Jae-Yun Jun
77
1
0
06 Feb 2024
A call for embodied AI
A call for embodied AI
Giuseppe Paolo
Jonas Gonzalez-Billandon
Balázs Kégl
LM&Ro
92
9
0
06 Feb 2024
Reinforcement Learning from Bagged Reward
Reinforcement Learning from Bagged Reward
Yuting Tang
Xin-Qiang Cai
Yao-Xiang Ding
Qiyu Wu
Guoqing Liu
Masashi Sugiyama
OffRL
86
0
0
06 Feb 2024
Previous
12345...262728
Next