Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1909.08593
Cited By
v1
v2 (latest)
Fine-Tuning Language Models from Human Preferences
18 September 2019
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Fine-Tuning Language Models from Human Preferences"
50 / 1,265 papers shown
Title
Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
Atsumoto Ohashi
Ryuichiro Higashinaka
OffRL
81
6
0
16 Sep 2022
Calculus on MDPs: Potential Shaping as a Gradient
Erik Jenner
H. V. Hoof
Adam Gleave
76
4
0
20 Aug 2022
Composable Text Controls in Latent Space with ODEs
Guangyi Liu
Zeyu Feng
Yuan Gao
Zichao Yang
Xiaodan Liang
Junwei Bao
Xiaodong He
Shuguang Cui
Zhen Li
Zhiting Hu
AI4CE
DiffM
99
33
0
01 Aug 2022
Knowledge-Grounded Conversational Data Augmentation with Generative Conversational Networks
Yen-Ting Lin
Alexandros Papangelis
Seokhwan Kim
Dilek Z. Hakkani-Tür
41
8
0
22 Jul 2022
Zero-Shot Video Captioning with Evolving Pseudo-Tokens
Yoad Tewel
Yoav Shalev
Roy Nadler
Idan Schwartz
Lior Wolf
63
27
0
22 Jul 2022
MAD for Robust Reinforcement Learning in Machine Translation
Domenic Donato
Lei Yu
Wang Ling
Chris Dyer
MoE
54
7
0
18 Jul 2022
Heuristic-free Optimization of Force-Controlled Robot Search Strategies in Stochastic Environments
Bastian Alt
Darko Katic
Rainer Jäkel
Michael Beetz
63
6
0
15 Jul 2022
Know your audience: specializing grounded language models with listener subtraction
Aaditya K. Singh
David Ding
Andrew M. Saxe
Felix Hill
Andrew Kyle Lampinen
65
2
0
16 Jun 2022
Why is constrained neural language generation particularly challenging?
Cristina Garbacea
Qiaozhu Mei
137
15
0
11 Jun 2022
Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning
Hsuan Su
Po-Han Chi
Shih-Cheng Huang
Chung Ho Lam
Saurav Sahay
Shang-Tse Chen
Hung-yi Lee
OffRL
36
1
0
08 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
221
115
0
05 Jun 2022
Models of human preference for learning reward functions
W. B. Knox
Stephane Hatgis-Kessell
Serena Booth
S. Niekum
Peter Stone
A. Allievi
128
50
0
05 Jun 2022
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Tomasz Korbak
Hady ElSahar
Germán Kruszewski
Marc Dymetman
CLL
105
57
0
01 Jun 2022
A Mixture-of-Expert Approach to RL-based Dialogue Management
Yinlam Chow
Azamat Tulepbergenov
Ofir Nachum
Moonkyung Ryu
Mohammad Ghavamzadeh
Craig Boutilier
MoE
70
16
0
31 May 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
176
220
0
26 May 2022
Gradient-Based Constrained Sampling from Language Models
Sachin Kumar
Biswajit Paria
Yulia Tsvetkov
BDL
99
57
0
25 May 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
97
345
0
25 May 2022
RL with KL penalties is better viewed as Bayesian inference
Tomasz Korbak
Ethan Perez
Christopher L. Buckley
OffRL
96
77
0
23 May 2022
Mitigating Toxic Degeneration with Empathetic Data: Exploring the Relationship Between Toxicity and Empathy
Allison Lahnala
Charles F Welch
Béla Neuendorf
Lucie Flek
101
13
0
15 May 2022
Efficient and Training-Free Control of Language Generation
Shangda Wu
Maosong Sun
52
2
0
12 May 2022
Adversarial Training for High-Stakes Reliability
Daniel M. Ziegler
Seraphina Nix
Lawrence Chan
Tim Bauman
Peter Schmidt-Nielsen
...
Noa Nabeshima
Benjamin Weinstein-Raun
D. Haas
Buck Shlegeris
Nate Thomas
AAML
137
61
0
03 May 2022
Training Language Models with Language Feedback
Jérémy Scheurer
Jon Ander Campos
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
122
51
0
29 Apr 2022
Tailor: A Prompt-Based Approach to Attribute-Based Controlled Text Generation
Kexin Yang
Dayiheng Liu
Wenqiang Lei
Baosong Yang
Mingfeng Xue
Boxing Chen
Jun Xie
79
29
0
28 Apr 2022
Spurious Correlations in Reference-Free Evaluation of Text Generation
Esin Durmus
Faisal Ladhak
Tatsunori Hashimoto
62
32
0
21 Apr 2022
Can Question Rewriting Help Conversational Question Answering?
Etsuko Ishii
Yan Xu
Samuel Cahyawijaya
Bryan Wilie
100
9
0
13 Apr 2022
Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback
Duy-Hung Nguyen
Nguyen-Viet-Dung Nghiem
Bao-Sinh Nguyen
Dung Tien Le
Shahab Sabahi
Minh Le Nguyen
Hung Le
69
13
0
12 Apr 2022
The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems
Caleb Ziems
Jane A. Yu
Yi-Chia Wang
A. Halevy
Diyi Yang
87
97
0
06 Apr 2022
Preprocessing Reward Functions for Interpretability
Erik Jenner
Adam Gleave
143
8
0
25 Mar 2022
Mix and Match: Learning-free Controllable Text Generation using Energy Language Models
Fatemehsadat Mireshghallah
Kartik Goyal
Taylor Berg-Kirkpatrick
71
80
0
24 Mar 2022
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELM
RALM
316
267
0
21 Mar 2022
GRS: Combining Generation and Revision in Unsupervised Sentence Simplification
Mohammad Dehghan
Dhruv Kumar
Lukasz Golab
68
13
0
18 Mar 2022
Uncertainty Estimation for Language Reward Models
Adam Gleave
G. Irving
UQLM
84
34
0
14 Mar 2022
Compilable Neural Code Generation with Compiler Feedback
Xin Wang
Yasheng Wang
Yao Wan
Fei Mi
Yitong Li
Pingyi Zhou
Jin Liu
Hao Wu
Xin Jiang
Qun Liu
78
69
0
10 Mar 2022
Towards Robust Online Dialogue Response Generation
Leyang Cui
Fandong Meng
Yanjun Liu
Jie Zhou
Yue Zhang
38
1
0
07 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
930
13,272
0
04 Mar 2022
Capturing Failures of Large Language Models via Human Cognitive Biases
Erik Jones
Jacob Steinhardt
76
93
0
24 Feb 2022
Reward Modeling for Mitigating Toxicity in Transformer-based Language Models
Farshid Faal
K. Schmitt
Jia Yuan Yu
83
25
0
19 Feb 2022
XFBoost: Improving Text Generation with Controllable Decoders
Xiangyu Peng
Michael Sollami
70
1
0
16 Feb 2022
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Yu Meng
Jiaxin Huang
Yu Zhang
Jiawei Han
SyDa
79
235
0
09 Feb 2022
Red Teaming Language Models with Language Models
Ethan Perez
Saffron Huang
Francis Song
Trevor Cai
Roman Ring
John Aslanides
Amelia Glaese
Nat McAleese
G. Irving
AAML
217
672
0
07 Feb 2022
A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
Hanqing Zhang
Haolin Song
Shaoyu Li
Ming Zhou
Dawei Song
141
230
0
14 Jan 2022
Reframing Human-AI Collaboration for Generating Free-Text Explanations
Sarah Wiegreffe
Jack Hessel
Swabha Swayamdipta
Mark O. Riedl
Yejin Choi
77
149
0
16 Dec 2021
Goal-Directed Story Generation: Augmenting Generative Language Models with Reinforcement Learning
Amal Alabdulkarim
W. Li
Lara J. Martin
Mark O. Riedl
81
23
0
16 Dec 2021
Controlled Cue Generation for Play Scripts
Alara Dirik
Hilal Donmez
Pinar Yanardag
34
3
0
13 Dec 2021
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Lavinia Dunagan
Jacob Morrison
Alexander R. Fabbri
Yejin Choi
Noah A. Smith
97
40
0
08 Dec 2021
Episodic Policy Gradient Training
Hung Le
Majid Abdolshah
Thommen George Karimpanal
Kien Do
D. Nguyen
Svetha Venkatesh
BDL
OffRL
68
6
0
03 Dec 2021
Controlling Conditional Language Models without Catastrophic Forgetting
Tomasz Korbak
Hady ElSahar
Germán Kruszewski
Marc Dymetman
CLL
AI4CE
115
35
0
01 Dec 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
122
197
0
29 Nov 2021
Robust Deep Reinforcement Learning for Extractive Legal Summarization
Duy-Hung Nguyen
Bao-Sinh Nguyen
Nguyen-Viet-Dung Nghiem
Dung Tien Le
Mim Amina Khatun
Minh Le Nguyen
Hung Le
ELM
AILaw
AI4TS
122
18
0
13 Nov 2021
Training Conversational Agents with Generative Conversational Networks
Yen-Ting Lin
Alexandros Papangelis
Seokhwan Kim
Dilek Z. Hakkani-Tür
67
0
0
15 Oct 2021
Previous
1
2
3
...
23
24
25
26
Next