ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.06648
  4. Cited By
Diversity from Human Feedback

Diversity from Human Feedback

10 October 2023
Ren-Jian Wang
Ke Xue
Yutong Wang
Peng Yang
Haobo Fu
Qiang Fu
Chao Qian
ArXivPDFHTML

Papers citing "Diversity from Human Feedback"

6 / 6 papers shown
Title
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning
Calarina Muslimani
M. E. Taylor
OffRL
38
2
0
30 Apr 2024
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,915
0
04 Mar 2022
Replay-Guided Adversarial Environment Design
Replay-Guided Adversarial Environment Design
Minqi Jiang
Michael Dennis
Jack Parker-Holder
Jakob N. Foerster
Edward Grefenstette
Tim Rocktaschel
127
95
0
06 Oct 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified
  Q-Ensemble
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
99
262
0
04 Oct 2021
Quality-Diversity Optimization: a novel branch of stochastic
  optimization
Quality-Diversity Optimization: a novel branch of stochastic optimization
Konstantinos Chatzilygeroudis
Antoine Cully
Vassilis Vassiliades
Jean-Baptiste Mouret
65
91
0
08 Dec 2020
Interactive Constrained MAP-Elites: Analysis and Evaluation of the
  Expressiveness of the Feature Dimensions
Interactive Constrained MAP-Elites: Analysis and Evaluation of the Expressiveness of the Feature Dimensions
Alberto Alvarez
S. Dahlskog
J. Font
Julian Togelius
59
31
0
06 Mar 2020
1