Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.02369
Cited By
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
6 June 2022
Jin Xu
Xiaojiang Liu
Jianhao Yan
Deng Cai
Huayang Li
Jian Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation"
20 / 20 papers shown
Title
Induction Head Toxicity Mechanistically Explains Repetition Curse in Large Language Models
Shuxun Wang
Qingyu Yin
Chak Tou Leong
Qiang Zhang
Linyi Yang
4
0
0
17 May 2025
Repetition Neurons: How Do Language Models Produce Repetitions?
Tatsuya Hiraoka
Kentaro Inui
MILM
75
8
0
21 Feb 2025
Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing
Zhilin Wang
Yafu Li
Jianhao Yan
Yu Cheng
Yue Zhang
65
0
0
21 Feb 2025
Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication
Weicheng Ma
Hefan Zhang
Ivory Yang
Shiyu Ji
Joice Chen
...
Shubham Mohole
Ethan Gearey
Michael Macy
Saeed Hassanpour
Soroush Vosoughi
64
0
0
13 Feb 2025
Decoding Secret Memorization in Code LLMs Through Token-Level Characterization
Yuqing Nie
Chong Wang
Kaidi Wang
Guoai Xu
Guosheng Xu
Haoyu Wang
OffRL
184
1
0
11 Oct 2024
T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models
Yibo Miao
Yifan Zhu
Yinpeng Dong
Lijia Yu
Jun Zhu
Xiao-Shan Gao
EGVM
43
12
0
08 Jul 2024
RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models
Jianhao Yan
Yun Luo
Yue Zhang
ALM
LRM
38
7
0
21 Feb 2024
JumpCoder: Go Beyond Autoregressive Coder via Online Modification
Mouxiang Chen
Hao Tian
Zhongxi Liu
Xiaoxue Ren
Jianling Sun
SyDa
KELM
43
2
0
15 Jan 2024
Object Recognition as Next Token Prediction
Kaiyu Yue
Borchun Chen
Jonas Geiping
Hengduo Li
Tom Goldstein
Ser-Nam Lim
40
9
0
04 Dec 2023
Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models
J. Michaelov
Catherine Arnett
Tyler A. Chang
Benjamin Bergen
36
12
0
15 Nov 2023
ContextRef: Evaluating Referenceless Metrics For Image Description Generation
Elisa Kreiss
E. Zelikman
Christopher Potts
Nick Haber
34
5
0
21 Sep 2023
AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models
Siheng Li
Cheng Yang
Yichun Yin
Xinyu Zhu
Ze-Long Cheng
Lifeng Shang
Xin Jiang
Qun Liu
Yujiu Yang
SyDa
35
3
0
12 Aug 2023
KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text Understanding
Zijun Yao
Yantao Liu
Xin Lv
S. Cao
Jifan Yu
Lei Hou
Juanzi Li
37
10
0
06 Jul 2023
Mitigating the Learning Bias towards Repetition by Self-Contrastive Training for Open-Ended Generation
Jian Guan
Minlie Huang
32
0
0
04 Jul 2023
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention
Wenhao Li
Xiaoyuan Yi
Jinyi Hu
Maosong Sun
Xing Xie
44
0
0
14 Nov 2022
Contrastive Search Is What You Need For Neural Text Generation
Yixuan Su
Nigel Collier
25
50
0
25 Oct 2022
A Theoretical Analysis of the Repetition Problem in Text Generation
Z. Fu
Wai Lam
Anthony Man-Cho So
Bei Shi
79
90
0
29 Dec 2020
Text Summarization with Pretrained Encoders
Yang Liu
Mirella Lapata
MILM
258
1,435
0
22 Aug 2019
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
273
1,896
0
10 Jan 2017
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
211
3,513
0
10 Jun 2015
1