Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.06648
Cited By
Diversity from Human Feedback
10 October 2023
Ren-Jian Wang
Ke Xue
Yutong Wang
Peng Yang
Haobo Fu
Qiang Fu
Chao Qian
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Diversity from Human Feedback"
6 / 6 papers shown
Title
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,915
0
04 Mar 2022
Replay-Guided Adversarial Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
127
95
0
06 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
99
262
0
04 Oct 2021
Quality-Diversity Optimization: a novel branch of stochastic optimization
Konstantinos Chatzilygeroudis
Antoine Cully
Vassilis Vassiliades
Jean-Baptiste Mouret
65
91
0
08 Dec 2020
Interactive Constrained MAP-Elites: Analysis and Evaluation of the Expressiveness of the Feature Dimensions
Alberto Alvarez
S. Dahlskog
J. Font
Julian Togelius
59
31
0
06 Mar 2020
1