Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.10023
Cited By
Deep Bayesian Active Learning for Preference Modeling in Large Language Models
14 June 2024
Luckeciano C. Melo
P. Tigas
Alessandro Abate
Yarin Gal
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Bayesian Active Learning for Preference Modeling in Large Language Models"
9 / 9 papers shown
Title
Uncertainty-Aware Step-wise Verification with Generative Reward Models
Zihuiwen Ye
Luckeciano C. Melo
Younesse Kaddar
Phil Blunsom
Shivalika Singh
Yarin Gal
LRM
49
1
0
16 Feb 2025
Temporal-Difference Variational Continual Learning
Luckeciano C. Melo
Alessandro Abate
Yarin Gal
BDL
CLL
VLM
46
0
0
10 Oct 2024
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace
Bernhard Schölkopf
Gunnar Rätsch
Giorgia Ramponi
OffRL
69
1
0
26 Jun 2024
Advancing Deep Active Learning & Data Subset Selection: Unifying Principles with Information-Theory Intuitions
Andreas Kirsch
UQCV
53
6
0
09 Jan 2024
Sample Efficient Preference Alignment in LLMs via Active Exploration
Viraj Mehta
Vikramjeet Das
Ojash Neopane
Yijia Dai
Ilija Bogunovic
Ilija Bogunovic
W. Neiswanger
Stefano Ermon
Jeff Schneider
Willie Neiswanger
OffRL
33
12
0
01 Dec 2023
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
333
11,953
0
04 Mar 2022
Bayesian Active Learning for Sim-to-Real Robotic Perception
Jianxiang Feng
Jongseok Lee
M. Durner
Rudolph Triebel
52
13
0
23 Sep 2021
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
292
1,595
0
18 Sep 2019
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
184
3,510
0
10 Jun 2015
1