Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.12501
Cited By
v1
v2 (latest)
Reinforcement Learning from Human Feedback
16 April 2025
Nathan Lambert
OffRL
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Reinforcement Learning from Human Feedback"
9 / 9 papers shown
Title
Dataset Cartography for Large Language Model Alignment: Mapping and Diagnosing Preference Data
Seohyeong Lee
Eunwon Kim
Hwaran Lee
Buru Chang
45
0
0
29 May 2025
Text2Grad: Reinforcement Learning from Natural Language Feedback
Hanyang Wang
Lu Wang
Chaoyun Zhang
Tianjun Mao
Si Qin
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
51
0
0
28 May 2025
Multi-Armed Bandits Meet Large Language Models
Djallel Bouneffouf
Raphael Feraud
113
0
0
19 May 2025
Playpen: An Environment for Exploring Learning Through Conversational Interaction
Nicola Horst
Davide Mazzaccara
Antonia Schmidt
Michael Sullivan
Filippo Momentè
...
Alexander Koller
Oliver Lemon
David Schlangen
Mario Giulianelli
Alessandro Suglia
OffRL
112
0
0
11 Apr 2025
Mitigating Open-Vocabulary Caption Hallucinations
Assaf Ben-Kish
Moran Yanuka
Morris Alper
Raja Giryes
Hadar Averbuch-Elor
MLLM
VLM
77
6
0
06 Dec 2023
Can LLM-Generated Misinformation Be Detected?
Canyu Chen
Kai Shu
DeLMO
108
181
0
25 Sep 2023
Are You Worthy of My Trust?: A Socioethical Perspective on the Impacts of Trustworthy AI Systems on the Environment and Human Society
Jamell Dacon
SILM
70
1
0
18 Sep 2023
Enhancing Chat Language Models by Scaling High-quality Instructional Conversations
Ning Ding
Yulin Chen
Bokai Xu
Yujia Qin
Zhi Zheng
Shengding Hu
Zhiyuan Liu
Maosong Sun
Bowen Zhou
ALM
148
540
0
23 May 2023
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Renze Lou
Kai Zhang
Wenpeng Yin
ALM
LRM
85
25
0
18 Mar 2023
1