ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.09430
  4. Cited By
State Entropy Maximization with Random Encoders for Efficient
  Exploration
v1v2v3v4 (latest)

State Entropy Maximization with Random Encoders for Efficient Exploration

International Conference on Machine Learning (ICML), 2021
18 February 2021
Younggyo Seo
Lili Chen
Jinwoo Shin
Honglak Lee
Pieter Abbeel
Kimin Lee
ArXiv (abs)PDFHTMLGithub (2434★)

Papers citing "State Entropy Maximization with Random Encoders for Efficient Exploration"

50 / 94 papers shown
Polychromic Objectives for Reinforcement Learning
Polychromic Objectives for Reinforcement Learning
Jubayer Ibn Hamid
Ifdita Hasan Orney
Ellen Xu
Chelsea Finn
Dorsa Sadigh
OffRLOnRL
166
3
0
29 Sep 2025
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning
Yulei Qin
Xiaoyu Tan
Zhengbao He
Gang Li
Haojia Lin
...
Yuzheng Cai
Xuan Zhang
Sheng Ye
Ke Li
Xing Sun
539
2
0
26 Sep 2025
Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design
Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design
Hampus Gummesson Svensson
Ola Engkvist
J. Janet
C. Tyrchan
M. Chehreghani
OffRL
407
1
0
26 Jun 2025
Provable Maximum Entropy Manifold Exploration via Diffusion Models
Provable Maximum Entropy Manifold Exploration via Diffusion Models
Riccardo De Santi
Marin Vlastelica
Ya-Ping Hsieh
Zebang Shen
Niao He
Andreas Krause
DiffM
249
8
0
18 Jun 2025
Predictability-Based Curiosity-Guided Action Symbol Discovery
Predictability-Based Curiosity-Guided Action Symbol Discovery
Burcu Kilic
Alper Ahmetoglu
Emre Ugur
214
0
0
23 May 2025
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models
Seungjae Lee
Daniel Ekpo
Haowen Liu
Furong Huang
Abhinav Shrivastava
Jia-Bin Huang
LM&Ro
804
0
0
12 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
529
3
0
02 May 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
433
0
0
30 Jan 2025
Episodic Novelty Through Temporal Distance
Episodic Novelty Through Temporal DistanceInternational Conference on Learning Representations (ICLR), 2025
Y. Jiang
Qihan Liu
Yiqin Yang
Xiaoteng Ma
Dianyu Zhong
...
Jun Yang
Bin Liang
Bo Xu
Chongjie Zhang
Qianchuan Zhao
OffRL
402
10
0
28 Jan 2025
NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations
NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations
Myunsoo Kim
Hayeong Lee
Seong-Woong Shim
JunHo Seo
Byung-Jun Lee
LLMAG
459
0
0
22 Jan 2025
The impact of intrinsic rewards on exploration in Reinforcement Learning
The impact of intrinsic rewards on exploration in Reinforcement Learning
Aya Kayal
Eduardo Pignatelli
Laura Toni
305
8
0
20 Jan 2025
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration
Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2024
Xingrui Yu
Zhenglin Wan
David Mark Bossens
Yueming Lyu
Qing Guo
Ivor W. Tsang
1.2K
2
0
11 Nov 2024
Robot Policy Learning with Temporal Optimal Transport Reward
Robot Policy Learning with Temporal Optimal Transport RewardNeural Information Processing Systems (NeurIPS), 2024
Yuwei Fu
Haichao Zhang
Di Wu
Wei Xu
Benoit Boulet
OffRL
282
6
0
29 Oct 2024
Effective Exploration Based on the Structural Information Principles
Effective Exploration Based on the Structural Information PrinciplesNeural Information Processing Systems (NeurIPS), 2024
Xianghua Zeng
Hao Peng
Angsheng Li
184
10
0
09 Oct 2024
Choices are More Important than Efforts: LLM Enables Efficient
  Multi-Agent Exploration
Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration
Yun Qu
Boyuan Wang
Yuhang Jiang
Jianzhun Shao
Yixiu Mao
Cheems Wang
Chang Liu
Xiangyang Ji
401
12
0
03 Oct 2024
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
Taisuke Kobayashi
559
2
0
29 Sep 2024
Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation
Guido Maria DÁmely di Melendugno
Alessandro Flaborea
Pascal Mettes
Yuta Kyuragi
336
2
0
18 Jul 2024
Constrained Intrinsic Motivation for Reinforcement Learning
Constrained Intrinsic Motivation for Reinforcement Learning
Xiang Zheng
Jie Zhang
Chao Shen
Cong Wang
322
5
0
12 Jul 2024
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search
  through State Occupancy Regularization
Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization
Liam Schramm
Abdeslam Boularias
281
1
0
07 Jul 2024
External Model Motivated Agents: Reinforcement Learning for Enhanced
  Environment Sampling
External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling
Rishav Bhagat
Jonathan C. Balloch
Zhiyu Lin
Julia Kim
Mark O. Riedl
323
0
0
28 Jun 2024
The Limits of Pure Exploration in POMDPs: When the Observation Entropy
  is Enough
The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
358
7
0
18 Jun 2024
How to Explore with Belief: State Entropy Maximization in POMDPs
How to Explore with Belief: State Entropy Maximization in POMDPs
Riccardo Zamboni
Duilio Cirino
Marcello Restelli
Mirco Mutti
287
6
0
04 Jun 2024
Learning Multimodal Behaviors from Scratch with Diffusion Policy
  Gradient
Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient
Zechu Li
Rickmer Krohn
Tao Chen
Anurag Ajay
Pulkit Agrawal
Georgia Chalvatzaki
DiffM
272
43
0
02 Jun 2024
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning
Mingqi Yuan
Roger Creus Castanyer
Bo Li
Xin Jin
Glen Berseth
Wenjun Zeng
578
10
0
29 May 2024
Function Approximation for Reinforcement Learning Controller for Energy
  from Spread Waves
Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves
Soumyendu Sarkar
Vineet Gundecha
Sahand Ghorbanpour
Alexander Shmakov
Ashwin Ramesh Babu
Avisek Naug
Alexandre Frederic Julien Pichard
Mathieu Cocho
217
8
0
17 Apr 2024
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy
  Regularization
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
Tianying Ji
Yongyuan Liang
Yan Zeng
Yu-Juan Luo
Guowei Xu
Jiawei Guo
Ruijie Zheng
Furong Huang
Gang Hua
Huazhe Xu
CML
386
20
0
22 Feb 2024
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with
  Adaptive Temporal Distances for Offline Reinforcement Learning
CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning
Chenyu Sun
Hangwei Qian
Chunyan Miao
OffRL
281
1
0
19 Dec 2023
Learning to Discover Skills through Guidance
Learning to Discover Skills through GuidanceNeural Information Processing Systems (NeurIPS), 2023
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Sejik Park
Kyushik Min
Jaegul Choo
451
12
0
31 Oct 2023
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio
  Minimization
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio MinimizationInternational Conference on Learning Representations (ICLR), 2023
Guowei Xu
Ruijie Zheng
Yongyuan Liang
Xiyao Wang
Zhecheng Yuan
...
Shuzhen Li
Yanjie Ze
Hal Daumé
Furong Huang
Huazhe Xu
389
52
0
30 Oct 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of SkillsInternational Conference on Machine Learning (ICML), 2023
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSLDRL
354
16
0
30 Oct 2023
Unsupervised Behavior Extraction via Random Intent Priors
Unsupervised Behavior Extraction via Random Intent PriorsNeural Information Processing Systems (NeurIPS), 2023
Haotian Hu
Yiqin Yang
Jianing Ye
Ziqing Mai
Chongjie Zhang
OffRL
318
15
0
28 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary ObjectivesInternational Conference on Learning Representations (ICLR), 2023
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
496
4
0
27 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
499
82
0
13 Oct 2023
RoboCLIP: One Demonstration is Enough to Learn Robot Policies
RoboCLIP: One Demonstration is Enough to Learn Robot PoliciesNeural Information Processing Systems (NeurIPS), 2023
Sumedh Anand Sontakke
Jesse Zhang
Sébastien M. R. Arnold
Karl Pertsch
Erdem Biyik
Dorsa Sadigh
Chelsea Finn
Laurent Itti
OffRL
276
135
0
11 Oct 2023
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically
  for Model-Based RL
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RLInternational Conference on Learning Representations (ICLR), 2023
Xiyao Wang
Ruijie Zheng
Yanchao Sun
Ruonan Jia
Wichayaporn Wongkamjan
Huazhe Xu
Furong Huang
OffRL
344
19
0
11 Oct 2023
Improving Offline-to-Online Reinforcement Learning with Q Conditioned
  State Entropy Exploration
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration
Ziqi Zhang
Xiao Xiong
Zifeng Zhuang
Jinxin Liu
Xuetao Zhang
OffRLOnRL
432
0
0
07 Oct 2023
RLLTE: Long-Term Evolution Project of Reinforcement Learning
RLLTE: Long-Term Evolution Project of Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023
Tao Lv
Zequn Zhang
Yang Xu
Shihao Luo
Bo Li
Xin Jin
Wenjun Zeng
OffRL
261
4
0
28 Sep 2023
Maximum diffusion reinforcement learning
Maximum diffusion reinforcement learning
Thomas A. Berrueta
Allison Pinosky
Todd Murphey
AI4CEDiffM
590
24
0
26 Sep 2023
Go Beyond Imagination: Maximizing Episodic Reachability with World
  Models
Go Beyond Imagination: Maximizing Episodic Reachability with World ModelsInternational Conference on Machine Learning (ICML), 2023
Yao Fu
Run Peng
Honglak Lee
237
1
0
25 Aug 2023
Reinforcement Learning by Guided Safe Exploration
Reinforcement Learning by Guided Safe ExplorationEuropean Conference on Artificial Intelligence (ECAI), 2023
Qisong Yang
T. D. Simão
N. Jansen
Simon Tindemans
M. Spaan
OffRLOnRL
255
10
0
26 Jul 2023
FOCUS: Object-Centric World Models for Robotics Manipulation
FOCUS: Object-Centric World Models for Robotics Manipulation
Stefano Ferraro
Pietro Mazzaglia
Tim Verbelen
Bart Dhoedt
OCLLM&Ro
311
17
0
05 Jul 2023
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning
  Environments for Goal-Oriented Tasks
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented TasksNeural Information Processing Systems (NeurIPS), 2023
Maxime Chevalier-Boisvert
Bolun Dai
Mark Towers
Rodrigo de Lazcano
Lucas Willems
Salem Lahlou
Suman Pal
Pablo Samuel Castro
Jordan Terry
VGen
414
345
0
24 Jun 2023
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning
CLUE: Calibrated Latent Guidance for Offline Reinforcement LearningConference on Robot Learning (CoRL), 2023
Jinxin Liu
Lipeng Zu
Li He
Xuetao Zhang
OffRL
404
13
0
23 Jun 2023
Explore to Generalize in Zero-Shot RL
Explore to Generalize in Zero-Shot RLNeural Information Processing Systems (NeurIPS), 2023
E. Zisselman
Itai Lavie
Daniel Soudry
Aviv Tamar
416
23
0
05 Jun 2023
Accelerating Reinforcement Learning with Value-Conditional State Entropy
  Exploration
Accelerating Reinforcement Learning with Value-Conditional State Entropy ExplorationNeural Information Processing Systems (NeurIPS), 2023
Dongyoung Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
292
31
0
31 May 2023
Unlocking the Power of Representations in Long-term Novelty-based
  Exploration
Unlocking the Power of Representations in Long-term Novelty-based ExplorationInternational Conference on Learning Representations (ICLR), 2023
Alaa Saade
Steven Kapturowski
Daniele Calandriello
Charles Blundell
Pablo Sprechmann
Leopoldo Sarra
Oliver Groth
Michal Valko
Bilal Piot
OffRL
435
10
0
02 May 2023
Bridging RL Theory and Practice with the Effective Horizon
Bridging RL Theory and Practice with the Effective HorizonNeural Information Processing Systems (NeurIPS), 2023
Cassidy Laidlaw
Stuart J. Russell
Anca Dragan
OffRL
402
40
0
19 Apr 2023
Data-efficient, Explainable and Safe Box Manipulation: Illustrating the
  Advantages of Physical Priors in Model-Predictive Control
Data-efficient, Explainable and Safe Box Manipulation: Illustrating the Advantages of Physical Priors in Model-Predictive ControlConference on Learning for Dynamics & Control (L4DC), 2023
Achkan Salehi
Stéphane Doncieux
OffRL
245
2
0
02 Mar 2023
Self-supervised network distillation: an effective approach to
  exploration in sparse reward environments
Self-supervised network distillation: an effective approach to exploration in sparse reward environmentsNeurocomputing (Neurocomputing), 2023
Matej Pecháč
M. Chovanec
Igor Farkaš
294
10
0
22 Feb 2023
Improving robot navigation in crowded environments using intrinsic rewards
Improving robot navigation in crowded environments using intrinsic rewardsIEEE International Conference on Robotics and Automation (ICRA), 2023
Diego Martínez Baselga
L. Riazuelo
Luis Montano
468
22
0
13 Feb 2023
12
Next
Page 1 of 2