ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.06070
  4. Cited By
Diversity is All You Need: Learning Skills without a Reward Function
v1v2v3v4v5v6 (latest)

Diversity is All You Need: Learning Skills without a Reward Function

16 February 2018
Benjamin Eysenbach
Abhishek Gupta
Julian Ibarz
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Diversity is All You Need: Learning Skills without a Reward Function"

50 / 414 papers shown
Title
Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation
Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation
Chenxu Wang
Yonggang Jin
Cheng Hu
Youpeng Zhao
Zipeng Dai
Jian Zhao
Shiyu Huang
Liuyu Xiang
Junge Zhang
Zhaofeng He
19
0
0
20 Jun 2025
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning
Yucheng Yang
Tianyi Zhou
Qiang He
Lei Han
Mykola Pechenizkiy
Meng Fang
SSL
101
7
0
12 Jun 2025
Intention-Conditioned Flow Occupancy Models
Chongyi Zheng
S. Park
Sergey Levine
Benjamin Eysenbach
AI4TSOffRLAI4CE
46
0
0
10 Jun 2025
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
AMPED: Adaptive Multi-objective Projection for balancing Exploration and skill Diversification
Geonwoo Cho
Jaemoon Lee
Jaegyun Im
Subi Lee
Jihwan Lee
Sundong Kim
38
0
0
06 Jun 2025
Deep learning image burst stacking to reconstruct high-resolution ground-based solar observations
Christoph Schirninger
Robert Jarolim
Astrid M. Veronig
Christoph Kuckein
97
1
0
05 Jun 2025
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL
SLAC: Simulation-Pretrained Latent Action Space for Whole-Body Real-World RL
Jiaheng Hu
Peter Stone
Roberto Martín-Martín
116
0
0
04 Jun 2025
Diversity-Aware Policy Optimization for Large Language Model Reasoning
Diversity-Aware Policy Optimization for Large Language Model Reasoning
Jian Yao
Ran Cheng
Xingyu Wu
Jibin Wu
Kay Chen Tan
LRM
99
0
0
29 May 2025
Maximizing Confidence Alone Improves Reasoning
Maximizing Confidence Alone Improves Reasoning
Mihir Prabhudesai
Lili Chen
Alex Ippoliti
Katerina Fragkiadaki
Hao Liu
Deepak Pathak
OODOffRLReLMLRM
130
3
0
28 May 2025
Training RL Agents for Multi-Objective Network Defense Tasks
Training RL Agents for Multi-Objective Network Defense Tasks
Andres Molina-Markham
Luis Robaina
Sean Steinle
Akash Trivedi
Derek Tsui
Nicholas Potteiger
Lauren Brandt
Ransom K. Winder
Ahmed Ridley
34
0
0
28 May 2025
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
Runliang Niu
Jinglong Ji
Yi Chang
Qi Wang
64
0
0
25 May 2025
Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Rethinking Agent Design: From Top-Down Workflows to Bottom-Up Skill Evolution
Jiawei Du
Jinlong Wu
Yuzheng Chen
Yucheng Hu
Bing Li
Joey Tianyi Zhou
253
0
0
23 May 2025
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
InfoPO: On Mutual Information Maximization for Large Language Model Alignment
Teng Xiao
Zhen Ge
Sujay Sanghavi
Tian Wang
Julian Katz-Samuels
Marc Versage
Qingjun Cui
Trishul Chilimbi
203
1
0
13 May 2025
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story
Vincenzo De Paola
Riccardo Zamboni
Mirco Mutti
Marcello Restelli
118
0
0
02 May 2025
Improving Human-AI Coordination through Online Adversarial Training and Generative Models
Improving Human-AI Coordination through Online Adversarial Training and Generative Models
Paresh Chaudhary
Yancheng Liang
Daphne Chen
S. Du
Natasha Jaques
158
1
0
21 Apr 2025
Reinforcement Learning from Multi-level and Episodic Human Feedback
Reinforcement Learning from Multi-level and Episodic Human Feedback
Muhammad Qasim Elahi
Somtochukwu Oguchienti
Maheed H. Ahmed
Mahsa Ghasemi
OffRL
92
0
0
20 Apr 2025
Evolutionary Policy Optimization
Evolutionary Policy Optimization
Jianren Wang
Yifan Su
Abhinav Gupta
Deepak Pathak
84
0
0
24 Mar 2025
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Latent Embedding Adaptation for Human Preference Alignment in Diffusion Planners
Wen Zheng Terence Ng
Jianda Chen
Yuan Xu
Tianwei Zhang
113
0
0
24 Mar 2025
Causally Aligned Curriculum Learning
Causally Aligned Curriculum Learning
Mingxuan Li
Junzhe Zhang
Elias Bareinboim
CML
102
4
0
21 Mar 2025
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Behaviour Discovery and Attribution for Explainable Reinforcement Learning
Rishav Rishav
Somjit Nath
Vincent Michalski
Samira Ebrahimi Kahou
FAttOffRL
168
1
0
19 Mar 2025
Training a Generally Curious Agent
Training a Generally Curious Agent
Fahim Tajwar
Yiding Jiang
Abitha Thankaraj
Sumaita Sadia Rahman
J. Zico Kolter
Jeff Schneider
Ruslan Salakhutdinov
237
3
0
24 Feb 2025
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL
Hongye Cao
Fan Feng
Meng Fang
Shaokang Dong
Tianpei Yang
Jing Huo
Yang Gao
124
1
0
14 Feb 2025
Skill Expansion and Composition in Parameter Space
Skill Expansion and Composition in Parameter Space
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
129
4
0
09 Feb 2025
MuST: Multi-Head Skill Transformer for Long-Horizon Dexterous Manipulation with Skill Progress
MuST: Multi-Head Skill Transformer for Long-Horizon Dexterous Manipulation with Skill Progress
Kai Gao
Fan Wang
Erica Aduh
Dylan Randle
Jane Shi
152
0
0
04 Feb 2025
Measuring Diversity of Game Scenarios
Measuring Diversity of Game Scenarios
Yuchen Li
Ziqi Wang
Qingquan Zhang
Jialin Liu
Qingbin Liu
113
3
0
17 Jan 2025
Learning to Assist Humans without Inferring Rewards
Learning to Assist Humans without Inferring Rewards
Vivek Myers
Evan Ellis
Sergey Levine
Benjamin Eysenbach
Anca Dragan
138
5
0
17 Jan 2025
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Dual-Force: Enhanced Offline Diversity Maximization under Imitation Constraints
Pavel Kolev
Marin Vlastelica
Georg Martius
OffRL
78
0
0
08 Jan 2025
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution
Ke Xue
Yutong Wang
Cong Guan
Lei Yuan
Haobo Fu
Qiang Fu
Chao Qian
Yang Yu
159
18
0
03 Jan 2025
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Subgoal Discovery Using a Free Energy Paradigm and State Aggregations
Amirhossein Mesbah
Reshad Hosseini
Seyed Pooya Shariatpanahi
M. N. Ahmadabadi
249
0
0
21 Dec 2024
Grounding Video Models to Actions through Goal Conditioned Exploration
Grounding Video Models to Actions through Goal Conditioned Exploration
Yunhao Luo
Yilun Du
LM&RoVGen
145
5
0
11 Nov 2024
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
Max Wilcoxson
Qiyang Li
Kevin Frans
Sergey Levine
SSLOffRLOnRL
189
0
0
23 Oct 2024
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Latent-Predictive Empowerment: Measuring Empowerment without a Simulator
Andrew Levy
A. Allievi
George Konidaris
105
0
0
15 Oct 2024
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
ELO-Rated Sequence Rewards: Advancing Reinforcement Learning Models
Qi Ju
Falin Hei
Zhemei Fang
Yunfeng Luo
93
1
0
05 Sep 2024
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
Directed Exploration in Reinforcement Learning from Linear Temporal Logic
Marco Bagatella
Andreas Krause
Georg Martius
OffRL
74
1
0
18 Aug 2024
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning
Haozhe Ma
Zhengding Luo
Thanh Vinh Vo
Kuankuan Sima
Tze-Yun Leong
126
9
0
06 Aug 2024
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
262
3
0
18 Jul 2024
Constrained Intrinsic Motivation for Reinforcement Learning
Constrained Intrinsic Motivation for Reinforcement Learning
Xiang Zheng
Jie Zhang
Chao Shen
Cong Wang
86
3
0
12 Jul 2024
Language Guided Skill Discovery
Language Guided Skill Discovery
Seungeun Rho
Laura Smith
Tianyu Li
Sergey Levine
Xue Bin Peng
Sehoon Ha
LM&Ro
74
6
0
07 Jun 2024
Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language
  Models
Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models
Phat Nguyen
Tsun-Hsuan Wang
Zhang-Wei Hong
S. Karaman
Daniela Rus
LM&Ro
93
7
0
06 Jun 2024
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Do's and Don'ts: Learning Desirable Skills with Instruction Videos
Hyunseung Kim
ByungKun Lee
Hojoon Lee
Dongyoon Hwang
Donghu Kim
Jaegul Choo
145
1
0
01 Jun 2024
Variational Offline Multi-agent Skill Discovery
Variational Offline Multi-agent Skill Discovery
Jiayu Chen
Bhargav Ganguly
Tian-Shing Lan
OffRL
126
3
0
26 May 2024
Hierarchical Decision Making Based on Structural Information Principles
Hierarchical Decision Making Based on Structural Information Principles
Xianghua Zeng
Hao Peng
Dingli Su
Angsheng Li
75
0
0
15 Apr 2024
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward
  Encodings
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
Kevin Frans
Seohong Park
Pieter Abbeel
Sergey Levine
OffRL
111
13
0
27 Feb 2024
Potential-Based Reward Shaping For Intrinsic Motivation
Potential-Based Reward Shaping For Intrinsic Motivation
Grant C. Forbes
Nitish Gupta
Leonardo Villalobos-Arias
Colin M. Potts
Arnav Jhala
David L. Roberts
18
5
0
12 Feb 2024
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
Ruoqing Zhang
Ziwei Luo
Jens Sjölund
Thomas B. Schön
Per Mattsson
104
13
0
06 Feb 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
103
1
0
17 Jan 2024
Adaptive trajectory-constrained exploration strategy for deep
  reinforcement learning
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
Guojian Wang
Faguo Wu
Xiao Zhang
Ning Guo
Zhiming Zheng
67
3
0
27 Dec 2023
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
CLIP-Motion: Learning Reward Functions for Robotic Actions Using Consecutive Observations
Xuzhe Dang
Stefan Edelkamp
173
4
0
06 Nov 2023
Variational Curriculum Reinforcement Learning for Unsupervised Discovery
  of Skills
Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills
Seongun Kim
Kyowoon Lee
Jaesik Choi
SSLDRL
84
10
0
30 Oct 2023
Improving Intrinsic Exploration by Creating Stationary Objectives
Improving Intrinsic Exploration by Creating Stationary Objectives
Roger Creus Castanyer
Javier Civera
Taihú Pire
OffRL
103
4
0
27 Oct 2023
Iteratively Learn Diverse Strategies with State Distance Information
Iteratively Learn Diverse Strategies with State Distance Information
Wei Fu
Weihua Du
Jingwei Li
Sunli Chen
Jingzhao Zhang
Yi Wu
90
4
0
23 Oct 2023
123456789
Next