Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2109.08705
Cited By
Relating Neural Text Degeneration to Exposure Bias
17 September 2021
Ting-Rui Chiang
Yun-Nung Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Relating Neural Text Degeneration to Exposure Bias"
14 / 14 papers shown
Title
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward
Dongheng Li
Yongchang Hao
Lili Mou
48
1
0
19 Sep 2024
The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models
Ayo Adedeji
Sarita Joshi
Brendan Doohan
LM&MA
24
14
0
12 Feb 2024
Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
Huayang Li
Tian Lan
Z. Fu
Deng Cai
Lemao Liu
Nigel Collier
Taro Watanabe
Yixuan Su
34
12
0
16 Oct 2023
Language Model Decoding as Direct Metrics Optimization
Haozhe Ji
Pei Ke
Hongning Wang
Minlie Huang
11
7
0
02 Oct 2023
On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
Rishabh Agarwal
Nino Vieillard
Yongchao Zhou
Piotr Stańczyk
Sabela Ramos
Matthieu Geist
Olivier Bachem
35
84
0
23 Jun 2023
ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews
Mike DÁrcy
Alexis Ross
Erin Bransom
Bailey Kuehl
Jonathan Bragg
Tom Hope
Doug Downey
KELM
24
21
0
21 Jun 2023
A Frustratingly Simple Decoding Method for Neural Text Generation
Haoran Yang
Deng Cai
Huayang Li
Wei Bi
Wai Lam
Shuming Shi
46
11
0
22 May 2023
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
31
17
0
03 May 2023
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
27
103
0
20 Mar 2023
Reward Gaming in Conditional Text Generation
Richard Yuanzhe Pang
Vishakh Padmakumar
Thibault Sellam
Ankur P. Parikh
He He
29
24
0
16 Nov 2022
Teacher Forcing Recovers Reward Functions for Text Generation
Yongchang Hao
Yuxin Liu
Lili Mou
OffRL
32
11
0
17 Oct 2022
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Rajkumar Ramamurthy
Prithviraj Ammanabrolu
Kianté Brantley
Jack Hessel
R. Sifa
Christian Bauckhage
Hannaneh Hajishirzi
Yejin Choi
OffRL
31
239
0
03 Oct 2022
PreTR: Spatio-Temporal Non-Autoregressive Trajectory Prediction Transformer
Lina Achaji
Thierno Barry
Thibault Fouqueray
Julien Moreau
François Aioun
François Charpillet
16
15
0
17 Mar 2022
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
415
2,586
0
03 Sep 2019
1