ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06070
  4. Cited By
Diversity is All You Need: Learning Skills without a Reward Function

Diversity is All You Need: Learning Skills without a Reward Function

16 February 2018
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
ArXivPDFHTML

Papers citing "Diversity is All You Need: Learning Skills without a Reward Function"

50 / 260 papers shown
Title
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
Teng Xiao
Zhen Ge
Sujay Sanghavi
Tian Wang
Julian Katz-Samuels
Marc Versage
Qingjun Cui
Trishul Chilimbi
31
0
0
13 May 2025
Explainable Reinforcement Learning Agents Using World Models
Explainable Reinforcement Learning Agents Using World Models
Madhuri Singh
Amal Alabdulkarim
Gennie Mansi
Mark O. Riedl
26
0
0
12 May 2025
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Merging and Disentangling Views in Visual Reinforcement Learning for Robotic Manipulation
Abdulaziz Almuzairee
Rohan Patil
Dwait Bhatt
Henrik I. Christensen
34
0
0
07 May 2025
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
D3HRL: A Distributed Hierarchical Reinforcement Learning Approach Based on Causal Discovery and Spurious Correlation Detection
Chenran Zhao
Dianxi Shi
Mengzhu Wang
Jianqiang Xia
Huanhuan Yang
Songchang Jin
Shaowu Yang
Chunping Qiu
45
0
0
04 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
19
0
0
02 May 2025
Improving Human-AI Coordination through Adversarial Training and Generative Models
Improving Human-AI Coordination through Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
71
0
0
21 Apr 2025
Reinforcement Learning from Multi-level and Episodic Human Feedback
Reinforcement Learning from Multi-level and Episodic Human Feedback
Muhammad Qasim Elahi
Somtochukwu Oguchienti
Maheed H. Ahmed
Mahsa Ghasemi
OffRL
55
0
0
20 Apr 2025
Causally Aligned Curriculum Learning
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
67
3
0
21 Mar 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAtt
OffRL
73
0
0
19 Mar 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
126
1
0
24 Feb 2025
Skill Expansion and Composition in Parameter Space
Skill Expansion and Composition in Parameter Space
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
64
4
0
09 Feb 2025
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
Wesley A. Suttle
A. Suresh
Carlos Nieto-Granda
OffRL
100
0
0
06 Feb 2025
Measuring Diversity of Game Scenarios
Measuring Diversity of Game Scenarios
Yuchen Li
Ziqi Wang
Qingquan Zhang
Jialin Liu
Qingbin Liu
65
2
0
17 Jan 2025
Learning to Assist Humans without Inferring Rewards
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
48
3
0
17 Jan 2025
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Pavel Kolev
Marin Vlastelica
Georg Martius
OffRL
53
0
0
08 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
42
17
0
03 Jan 2025
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Amirhossein Mesbah
Reshad Hosseini
Seyed Pooya Shariatpanahi
M. N. Ahmadabadi
77
0
0
21 Dec 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&Ro
VGen
90
1
0
11 Nov 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSL
OffRL
OnRL
59
0
0
23 Oct 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
46
6
0
06 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
33
3
0
18 Jul 2024
Global Reinforcement Learning: Beyond Linear and Convex Rewards via
  Submodular Semi-gradient Methods
Global Reinforcement Learning: Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods
Ric De Santi
Manish Prajapat
Andreas Krause
38
3
0
13 Jul 2024
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Milan Ganai
Sicun Gao
Sylvia Herbert
42
6
0
12 Jul 2024
Exploration by Learning Diverse Skills through Successor State Measures
Exploration by Learning Diverse Skills through Successor State Measures
Paul-Antoine Le Tolguenec
Yann Besse
Florent Teichteil-Königsbuch
Dennis G. Wilson
Emmanuel Rachelson
40
0
0
14 Jun 2024
Language Guided Skill Discovery
Language Guided Skill Discovery
Seungeun Rho
Laura Smith
Tianyu Li
Sergey Levine
Xue Bin Peng
Sehoon Ha
LM&Ro
42
4
0
07 Jun 2024
Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language
  Models
Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Phat Nguyen
Tsun-Hsuan Wang
Zhang-Wei Hong
S. Karaman
Daniela Rus
LM&Ro
45
3
0
06 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
39
1
0
01 Jun 2024
Variational Offline Multi-agent Skill Discovery
Variational Offline Multi-agent Skill Discovery
Jiayu Chen
Bhargav Ganguly
Tian-Shing Lan
OffRL
69
3
0
26 May 2024
Learning Future Representation with Synthetic Observations for
  Sample-efficient Reinforcement Learning
Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu
Yaran Chen
Dong Zhao
45
1
0
20 May 2024
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned
  Reinforcement Learning
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
Lisheng Wu
Ke Chen
29
0
0
19 Apr 2024
Effective Reinforcement Learning Based on Structural Information
  Principles
Effective Reinforcement Learning Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
40
0
0
15 Apr 2024
Active Exploration in Bayesian Model-based Reinforcement Learning for
  Robot Manipulation
Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation
Carlos Plou
Ana C. Murillo
Ruben Martinez-Cantin
OffRL
40
0
0
02 Apr 2024
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming
Hany Hamed
Subin Kim
Dongyeong Kim
Jaesik Yoon
Sungjin Ahn
47
4
0
29 Feb 2024
Align Your Intents: Offline Imitation Learning via Optimal Transport
Align Your Intents: Offline Imitation Learning via Optimal Transport
Maksim Bobrin
N. Buzun
Dmitrii Krylov
Dmitry V. Dylov
OffRL
51
3
0
20 Feb 2024
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill
Sangwoo Shin
Daehee Lee
Minjong Yoo
Woo Kyung Kim
Honguk Woo
32
10
0
13 Feb 2024
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators
M. Beliaev
Ramtin Pedarsani
41
2
0
02 Feb 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
37
1
0
17 Jan 2024
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
41
3
0
27 Dec 2023
SkillDiffuser: Interpretable Hierarchical Planning via Skill
  Abstractions in Diffusion-Based Task Execution
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution
Zhixuan Liang
Yao Mu
Hengbo Ma
Masayoshi Tomizuka
Mingyu Ding
Ping Luo
38
38
0
18 Dec 2023
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Augmenting Unsupervised Reinforcement Learning with Self-Reference
Andrew Zhao
Erle Zhu
Rui Lu
Matthieu Lin
Yong-Jin Liu
Gao Huang
SSL
34
1
0
16 Nov 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
37
4
0
06 Nov 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSL
DRL
41
7
0
30 Oct 2023
Iteratively Learn Diverse Strategies with State Distance Information
Iteratively Learn Diverse Strategies with State Distance Information
Wei Fu
Weihua Du
Jingwei Li
Sunli Chen
Jingzhao Zhang
Yi Wu
53
3
0
23 Oct 2023
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity
  Metrics For Science And Machine Learning
Cousins Of The Vendi Score: A Family Of Similarity-Based Diversity Metrics For Science And Machine Learning
Amey P. Pasarkar
Adji Bousso Dieng
27
11
0
19 Oct 2023
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
Seohong Park
Oleh Rybkin
Sergey Levine
OffRL
33
34
0
13 Oct 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate
  Exploration Bias
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Max Sobol Mark
Archit Sharma
Fahim Tajwar
Rafael Rafailov
Sergey Levine
Chelsea Finn
OffRL
OnRL
34
1
0
12 Oct 2023
Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior
Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior
Ruihan Yang
Zhuoqun Chen
Jianhan Ma
Chongyi Zheng
Yiyu Chen
Quan Nguyen
Junfeng Fang
45
17
0
02 Oct 2023
Zero-Shot Reinforcement Learning from Low Quality Data
Zero-Shot Reinforcement Learning from Low Quality Data
Scott Jeen
Tom Bewley
Jonathan M. Cullen
OffRL
OnRL
40
1
0
26 Sep 2023
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement
  Learning
Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning
David Yunis
Justin Jung
Falcon Z. Dai
Matthew R. Walter
OffRL
47
0
0
08 Sep 2023
Language Reward Modulation for Pretraining Reinforcement Learning
Language Reward Modulation for Pretraining Reinforcement Learning
Ademi Adeniji
Amber Xie
Carmelo Sferrazza
Younggyo Seo
Stephen James
Pieter Abbeel
39
26
0
23 Aug 2023
123456
Next