Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.01325
Cited By
Learning to summarize from human feedback
2 September 2020
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to summarize from human feedback"
41 / 1,441 papers shown
Title
Calibrate your listeners! Robust communication-based training for pragmatic speakers
Rose E. Wang
Julia White
Jesse Mu
Noah D. Goodman
28
7
0
11 Oct 2021
An Empirical Investigation of Learning from Biased Toxicity Labels
Neel Nanda
J. Uesato
Sven Gowal
20
0
0
04 Oct 2021
Recursively Summarizing Books with Human Feedback
Jeff Wu
Long Ouyang
Daniel M. Ziegler
Nissan Stiennon
Ryan J. Lowe
Jan Leike
Paul Christiano
ALM
35
295
0
22 Sep 2021
Learning Natural Language Generation from Scratch
Alice Martin Donati
Guillaume Quispe
Charles Ollion
Sylvain Le Corff
Florian Strub
Olivier Pietquin
LRM
26
4
0
20 Sep 2021
Generating Self-Contained and Summary-Centric Question Answer Pairs via Differentiable Reward Imitation Learning
Li Zhou
Kevin Small
Yong Zhang
Sandeep Atluri
40
2
0
10 Sep 2021
TruthfulQA: Measuring How Models Mimic Human Falsehoods
Stephanie C. Lin
Jacob Hilton
Owain Evans
HILM
57
1,742
0
08 Sep 2021
Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
Noriyuki Kojima
Alane Suhr
Yoav Artzi
25
24
0
10 Aug 2021
High Quality Related Search Query Suggestions using Deep Reinforcement Learning
Praveen Kumar Bodigutla
AI4TS
18
2
0
10 Aug 2021
A Survey of Human-in-the-loop for Machine Learning
Xingjiao Wu
Luwei Xiao
Yixuan Sun
Junhang Zhang
Tianlong Ma
Liangbo He
SyDa
46
505
0
02 Aug 2021
Pragmatic Image Compression for Human-in-the-Loop Decision-Making
S. Reddy
Anca Dragan
Sergey Levine
OffRL
44
13
0
07 Jul 2021
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
83
5,082
0
07 Jul 2021
Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence
Alexander Miserlis Hoyle
Pranav Goel
Denis Peskov
Andrew Hian-Cheong
Jordan L. Boyd-Graber
Philip Resnik
41
128
0
05 Jul 2021
The MineRL BASALT Competition on Learning from Human Feedback
Rohin Shah
Cody Wild
Steven H. Wang
Neel Alex
Brandon Houghton
...
Stephanie Milani
Nicholay Topin
Pieter Abbeel
Stuart J. Russell
Anca Dragan
35
31
0
05 Jul 2021
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment & Operations
AI Redefined
S. Gottipati
Sagar Kurandwad
Clodéric Mars
Gregory Szriftgiser
Franccois Chabot
29
8
0
21 Jun 2021
Diversity driven Query Rewriting in Search Advertising
Akash Kumar Mohankumar
Nikit Begwani
Amit Singh
20
24
0
07 Jun 2021
Grounding 'Grounding' in NLP
Khyathi Raghavi Chandu
Yonatan Bisk
A. Black
30
51
0
04 Jun 2021
Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution
Jiacheng Xu
Greg Durrett
38
16
0
03 Jun 2021
Uni-Encoder: A Fast and Accurate Response Selection Paradigm for Generation-Based Dialogue Systems
Chiyu Song
Hongliang He
Haofei Yu
Pengfei Fang
Leyang Cui
Zhenzhong Lan
24
6
0
02 Jun 2021
Hone as You Read: A Practical Type of Interactive Summarization
Tanner A. Bohn
Charles X. Ling
14
9
0
06 May 2021
Reliability Testing for Natural Language Processing Systems
Samson Tan
Chenyu You
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
15
38
0
06 May 2021
Multitasking Inhibits Semantic Drift
Athul Paul Jacob
M. Lewis
Jacob Andreas
16
12
0
15 Apr 2021
Learning What To Do by Simulating the Past
David Lindner
Rohin Shah
Pieter Abbeel
Anca Dragan
19
4
0
08 Apr 2021
Dynabench: Rethinking Benchmarking in NLP
Douwe Kiela
Max Bartolo
Yixin Nie
Divyansh Kaushik
Atticus Geiger
...
Pontus Stenetorp
Robin Jia
Joey Tianyi Zhou
Christopher Potts
Adina Williams
24
390
0
07 Apr 2021
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
34
40
0
06 Apr 2021
Alignment of Language Agents
Zachary Kenton
Tom Everitt
Laura Weidinger
Iason Gabriel
Vladimir Mikulik
G. Irving
30
157
0
26 Mar 2021
Constrained Text Generation with Global Guidance -- Case Study on CommonGen
Yixian Liu
Liwen Zhang
Wenjuan Han
Yue Zhang
Kewei Tu
36
9
0
12 Mar 2021
Putting Humans in the Natural Language Processing Loop: A Survey
Zijie J. Wang
Dongjin Choi
Shenyu Xu
Diyi Yang
LM&MA
12
72
0
06 Mar 2021
Symbolic Behaviour in Artificial Intelligence
Adam Santoro
Andrew Kyle Lampinen
Kory W. Mathewson
Timothy Lillicrap
David Raposo
19
34
0
05 Feb 2021
Scaling Laws for Transfer
Danny Hernandez
Jared Kaplan
T. Henighan
Sam McCandlish
29
238
0
02 Feb 2021
Evaluating the Robustness of Collaborative Agents
P. Knott
Micah Carroll
Sam Devlin
K. Ciosek
Katja Hofmann
Anca Dragan
Rohin Shah
14
34
0
14 Jan 2021
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning
Jerry Zikun Chen
S. Yu
Haoran Wang
184
5
0
18 Dec 2020
Open Problems in Cooperative AI
Allan Dafoe
Edward Hughes
Yoram Bachrach
Tantum Collins
Kevin R. McKee
Joel Z Leibo
Kate Larson
T. Graepel
42
199
0
15 Dec 2020
Offline Reinforcement Learning from Human Feedback in Real-World Sequence-to-Sequence Tasks
Julia Kreutzer
Stefan Riezler
Carolin (Haas) Lawrence
RALM
OffRL
8
15
0
04 Nov 2020
Constrained Abstractive Summarization: Preserving Factual Consistency with Constrained Generation
Yuning Mao
Xiang Ren
Heng Ji
Jiawei Han
HILM
125
38
0
24 Oct 2020
What Have We Achieved on Text Summarization?
Dandan Huang
Leyang Cui
Sen Yang
Guangsheng Bao
Kun Wang
Jun Xie
Yue Zhang
34
109
0
09 Oct 2020
Current Limitations of Language Models: What You Need is Retrieval
Aran Komatsuzaki
LRM
14
3
0
15 Sep 2020
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
33
230
0
27 Aug 2020
SummEval: Re-evaluating Summarization Evaluation
Alexander R. Fabbri
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
Dragomir R. Radev
HILM
38
690
0
24 Jul 2020
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
301
1,610
0
18 Sep 2019
Improving a Neural Semantic Parser by Counterfactual Learning from Human Bandit Feedback
Carolin (Haas) Lawrence
Stefan Riezler
OffRL
173
56
0
03 May 2018
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,748
0
26 Sep 2016
Previous
1
2
3
...
27
28
29