ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.04319
  4. Cited By
Neural Text Generation with Unlikelihood Training

Neural Text Generation with Unlikelihood Training

12 August 2019
Sean Welleck
Ilia Kulikov
Stephen Roller
Emily Dinan
Kyunghyun Cho
Jason Weston
    MU
ArXivPDFHTML

Papers citing "Neural Text Generation with Unlikelihood Training"

35 / 35 papers shown
Title
Teaching Large Language Models to Reason through Learning and Forgetting
Teaching Large Language Models to Reason through Learning and Forgetting
Tianwei Ni
Allen Nie
Sapana Chaudhary
Yao Liu
Huzefa Rangwala
Rasool Fakoor
ReLM
CLL
LRM
387
0
0
15 Apr 2025
Understanding the Logic of Direct Preference Alignment through Logic
Understanding the Logic of Direct Preference Alignment through Logic
Kyle Richardson
Vivek Srikumar
Ashish Sabharwal
151
2
0
23 Dec 2024
Self-calibration for Language Model Quantization and Pruning
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
372
0
0
22 Oct 2024
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
Taewhoo Lee
Chanwoong Yoon
Kyochul Jang
Donghyeon Lee
Minju Song
Hyunjae Kim
Jaewoo Kang
ELM
66
1
0
22 Oct 2024
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Abhijnan Nath
Changsoo Jung
Ethan Seefried
Nikhil Krishnaswamy
390
2
0
11 Oct 2024
W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering
W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering
Jinming Nian
Zhiyuan Peng
Qifan Wang
Yi Fang
RALM
115
2
0
15 Aug 2024
Watermark Smoothing Attacks against Language Models
Watermark Smoothing Attacks against Language Models
Hongyan Chang
Hamed Hassani
Reza Shokri
WaLM
93
3
0
19 Jul 2024
ACCORD: Closing the Commonsense Measurability Gap
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
86
0
0
04 Jun 2024
How Decoding Strategies Affect the Verifiability of Generated Text
How Decoding Strategies Affect the Verifiability of Generated Text
Luca Massarelli
Fabio Petroni
Aleksandra Piktus
Myle Ott
Tim Rocktaschel
Vassilis Plachouras
Fabrizio Silvestri
Sebastian Riedel
83
50
0
09 Nov 2019
ACUTE-EVAL: Improved Dialogue Evaluation with Optimized Questions and
  Multi-turn Comparisons
ACUTE-EVAL: Improved Dialogue Evaluation with Optimized Questions and Multi-turn Comparisons
Margaret Li
Jason Weston
Stephen Roller
56
176
0
06 Sep 2019
The Curious Case of Neural Text Degeneration
The Curious Case of Neural Text Degeneration
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
170
3,160
0
22 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLM
FaML
95
3,147
0
01 Apr 2019
Negative Training for Neural Dialogue Response Generation
Negative Training for Neural Dialogue Response Generation
Tianxing He
James R. Glass
52
59
0
06 Mar 2019
What makes a good conversation? How controllable attributes affect human
  judgments
What makes a good conversation? How controllable attributes affect human judgments
A. See
Stephen Roller
Douwe Kiela
Jason Weston
89
287
0
22 Feb 2019
The Second Conversational Intelligence Challenge (ConvAI2)
The Second Conversational Intelligence Challenge (ConvAI2)
Emily Dinan
V. Logacheva
Valentin Malykh
Alexander H. Miller
Kurt Shuster
...
Alexander I. Rudnicky
Jason Williams
Joelle Pineau
Andrey Kravchenko
Jason Weston
DRL
97
366
0
31 Jan 2019
Importance of Search and Evaluation Strategies in Neural Dialogue
  Modeling
Importance of Search and Evaluation Strategies in Neural Dialogue Modeling
Ilia Kulikov
Alexander H. Miller
Kyunghyun Cho
Jason Weston
65
84
0
02 Nov 2018
Adaptive Input Representations for Neural Language Modeling
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski
Michael Auli
98
390
0
28 Sep 2018
Retrieve and Refine: Improved Sequence Generation Models For Dialogue
Retrieve and Refine: Improved Sequence Generation Models For Dialogue
Jason Weston
Emily Dinan
Alexander H. Miller
RALM
58
204
0
14 Aug 2018
Learning to Write with Cooperative Discriminators
Learning to Write with Cooperative Discriminators
Ari Holtzman
Jan Buys
Maxwell Forbes
Antoine Bosselut
David Golub
Yejin Choi
71
237
0
16 May 2018
Hierarchical Neural Story Generation
Hierarchical Neural Story Generation
Angela Fan
M. Lewis
Yann N. Dauphin
DiffM
170
1,615
0
13 May 2018
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Personalizing Dialogue Agents: I have a dog, do you have pets too?
Saizheng Zhang
Emily Dinan
Jack Urbanek
Arthur Szlam
Douwe Kiela
Jason Weston
89
1,453
0
22 Jan 2018
Controllable Abstractive Summarization
Controllable Abstractive Summarization
Angela Fan
David Grangier
Michael Auli
80
310
0
14 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
74
186
0
14 Nov 2017
Generating Sentences by Editing Prototypes
Generating Sentences by Editing Prototypes
Kelvin Guu
Tatsunori B. Hashimoto
Yonatan Oren
Percy Liang
131
316
0
26 Sep 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
628
130,942
0
12 Jun 2017
A Deep Reinforced Model for Abstractive Summarization
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
179
1,556
0
11 May 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
325
1,900
0
10 Jan 2017
A Simple, Fast Diverse Decoding Algorithm for Neural Generation
A Simple, Fast Diverse Decoding Algorithm for Neural Generation
Jiwei Li
Will Monroe
Dan Jurafsky
67
240
0
25 Nov 2016
Controlling Output Length in Neural Encoder-Decoders
Controlling Output Length in Neural Encoder-Decoders
Yuta Kikuchi
Graham Neubig
Ryohei Sasano
Hiroya Takamura
Manabu Okumura
54
243
0
30 Sep 2016
Pointer Sentinel Mixture Models
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
264
2,844
0
26 Sep 2016
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Lantao Yu
Weinan Zhang
Jun Wang
Yong Yu
GAN
62
2,396
0
18 Sep 2016
Minimum Risk Training for Neural Machine Translation
Minimum Risk Training for Neural Machine Translation
Shiqi Shen
Yong Cheng
Zhongjun He
W. He
Hua Wu
Maosong Sun
Yang Liu
108
469
0
08 Dec 2015
Sequence Level Training with Recurrent Neural Networks
Sequence Level Training with Recurrent Neural Networks
MarcÁurelio Ranzato
S. Chopra
Michael Auli
Wojciech Zaremba
96
1,614
0
20 Nov 2015
A Reduction of Imitation Learning and Structured Prediction to No-Regret
  Online Learning
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning
Stéphane Ross
Geoffrey J. Gordon
J. Andrew Bagnell
OffRL
198
3,211
0
02 Nov 2010
Search-based Structured Prediction
Search-based Structured Prediction
Hal Daumé
John Langford
Daniel Marcu
GNN
122
586
0
04 Jul 2009
1