v1v2v3v4 (latest)

State Entropy Maximization with Random Encoders for Efficient Exploration

International Conference on Machine Learning (ICML), 2021

18 February 2021

Pieter Abbeel

ArXiv (abs)PDF HTML Github (2434★)

Papers citing "State Entropy Maximization with Random Encoders for Efficient Exploration"

50 / 94 papers shown

Polychromic Objectives for Reinforcement Learning

166

29 Sep 2025

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

...

539

26 Sep 2025

Diverse Mini-Batch Selection in Reinforcement Learning for Efficient Chemical Exploration in de novo Drug Design

Hampus Gummesson Svensson

407

26 Jun 2025

Provable Maximum Entropy Manifold Exploration via Diffusion Models

249

18 Jun 2025

Predictability-Based Curiosity-Guided Action Symbol Discovery

Burcu Kilic

Alper Ahmetoglu

Emre Ugur

214

23 May 2025

Imagine, Verify, Execute: Memory-guided Agentic Exploration with Vision-Language Models

804

12 May 2025

Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story

529

02 May 2025

DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems

Se-Wook Yoo

Seung-Woo Seo

433

30 Jan 2025

Episodic Novelty Through Temporal DistanceInternational Conference on Learning Representations (ICLR), 2025

...

402

28 Jan 2025

NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations

459

22 Jan 2025

The impact of intrinsic rewards on exploration in Reinforcement Learning

Aya Kayal

Eduardo Pignatelli

Laura Toni

305

20 Jan 2025

Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive ExplorationAdaptive Agents and Multi-Agent Systems (AAMAS), 2024

1.2K

11 Nov 2024

Robot Policy Learning with Temporal Optimal Transport RewardNeural Information Processing Systems (NeurIPS), 2024

Haichao Zhang

282

29 Oct 2024

Effective Exploration Based on the Structural Information PrinciplesNeural Information Processing Systems (NeurIPS), 2024

Xianghua Zeng

Hao Peng

Angsheng Li

184

09 Oct 2024

Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration

Chang Liu

401

03 Oct 2024

LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World

Taisuke Kobayashi

559

29 Sep 2024

Hyp2Nav: Hyperbolic Planning and Curiosity for Crowd Navigation

Guido Maria DÁmely di Melendugno

Alessandro Flaborea

Pascal Mettes

Yuta Kyuragi

336

18 Jul 2024

Constrained Intrinsic Motivation for Reinforcement Learning

Xiang Zheng

Jie Zhang

Chao Shen

Cong Wang

322

12 Jul 2024

Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization

Liam Schramm

Abdeslam Boularias

281

07 Jul 2024

External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling

323

28 Jun 2024

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

358

18 Jun 2024

How to Explore with Belief: State Entropy Maximization in POMDPs

287

04 Jun 2024

Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient

Tao Chen

272

02 Jun 2024

RLeXplore: Accelerating Research in Intrinsically-Motivated Reinforcement Learning

Mingqi Yuan

Roger Creus Castanyer

578

29 May 2024

Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves

Alexandre Frederic Julien Pichard

Mathieu Cocho

217

17 Apr 2024

ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization

Furong Huang

386

22 Feb 2024

CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning

281

19 Dec 2023

Learning to Discover Skills through GuidanceNeural Information Processing Systems (NeurIPS), 2023

451

31 Oct 2023

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio MinimizationInternational Conference on Learning Representations (ICLR), 2023

...

Furong Huang

389

30 Oct 2023

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of SkillsInternational Conference on Machine Learning (ICML), 2023

354

30 Oct 2023

Unsupervised Behavior Extraction via Random Intent PriorsNeural Information Processing Systems (NeurIPS), 2023

318

28 Oct 2023

Improving Intrinsic Exploration by Creating Stationary ObjectivesInternational Conference on Learning Representations (ICLR), 2023

Roger Creus Castanyer

Javier Civera

Taihú Pire

OffRL

496

27 Oct 2023

METRA: Scalable Unsupervised RL with Metric-Aware AbstractionInternational Conference on Learning Representations (ICLR), 2023

499

13 Oct 2023

RoboCLIP: One Demonstration is Enough to Learn Robot PoliciesNeural Information Processing Systems (NeurIPS), 2023

Sumedh Anand Sontakke

Jesse Zhang

Sébastien M. R. Arnold

Dorsa Sadigh

276

135

11 Oct 2023

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RLInternational Conference on Learning Representations (ICLR), 2023

Wichayaporn Wongkamjan

Huazhe Xu

Furong Huang

OffRL

344

11 Oct 2023

Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration

432

07 Oct 2023

RLLTE: Long-Term Evolution Project of Reinforcement LearningAAAI Conference on Artificial Intelligence (AAAI), 2023

261

28 Sep 2023

Maximum diffusion reinforcement learning

590

26 Sep 2023

Go Beyond Imagination: Maximizing Episodic Reachability with World ModelsInternational Conference on Machine Learning (ICML), 2023

Yao Fu

Run Peng

Honglak Lee

237

25 Aug 2023

Reinforcement Learning by Guided Safe ExplorationEuropean Conference on Artificial Intelligence (ECAI), 2023

255

26 Jul 2023

FOCUS: Object-Centric World Models for Robotics Manipulation

311

05 Jul 2023

Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented TasksNeural Information Processing Systems (NeurIPS), 2023

Maxime Chevalier-Boisvert

414

345

24 Jun 2023

CLUE: Calibrated Latent Guidance for Offline Reinforcement LearningConference on Robot Learning (CoRL), 2023

404

23 Jun 2023

Explore to Generalize in Zero-Shot RLNeural Information Processing Systems (NeurIPS), 2023

416

05 Jun 2023

Accelerating Reinforcement Learning with Value-Conditional State Entropy ExplorationNeural Information Processing Systems (NeurIPS), 2023

Dongyoung Kim

Jinwoo Shin

Pieter Abbeel

Younggyo Seo

292

31 May 2023

Unlocking the Power of Representations in Long-term Novelty-based ExplorationInternational Conference on Learning Representations (ICLR), 2023

Daniele Calandriello

Bilal Piot

435

02 May 2023

Bridging RL Theory and Practice with the Effective HorizonNeural Information Processing Systems (NeurIPS), 2023

402

19 Apr 2023

Data-efficient, Explainable and Safe Box Manipulation: Illustrating the Advantages of Physical Priors in Model-Predictive ControlConference on Learning for Dynamics & Control (L4DC), 2023

Achkan Salehi

Stéphane Doncieux

OffRL

245

02 Mar 2023

Self-supervised network distillation: an effective approach to exploration in sparse reward environmentsNeurocomputing (Neurocomputing), 2023

Matej Pecháč

M. Chovanec

Igor Farkaš

294

22 Feb 2023

Improving robot navigation in crowded environments using intrinsic rewardsIEEE International Conference on Robotics and Automation (ICRA), 2023

Diego Martínez Baselga

L. Riazuelo

Luis Montano

468

13 Feb 2023