ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.05424
  4. Cited By
A Call for Clarity in Beam Search: How It Works and When It Stops
v1v2v3 (latest)

A Call for Clarity in Beam Search: How It Works and When It Stops

11 April 2022
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
ArXiv (abs)PDFHTML

Papers citing "A Call for Clarity in Beam Search: How It Works and When It Stops"

50 / 126 papers shown
Title
Neural Text Summarization: A Critical Evaluation
Neural Text Summarization: A Critical Evaluation
Wojciech Kry'sciñski
N. Keskar
Bryan McCann
Caiming Xiong
R. Socher
81
367
0
23 Aug 2019
On The Evaluation of Machine Translation Systems Trained With
  Back-Translation
On The Evaluation of Machine Translation Systems Trained With Back-Translation
Sergey Edunov
Myle Ott
MarcÁurelio Ranzato
Michael Auli
40
98
0
14 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
686
24,557
0
26 Jul 2019
Sampled Softmax with Random Fourier Features
Sampled Softmax with Random Fourier Features
A. S. Rawat
Jiecao Chen
Felix X. Yu
A. Suresh
Sanjiv Kumar
75
55
0
24 Jul 2019
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer
  on Time Series Forecasting
Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting
Shiyang Li
Xiaoyong Jin
Yao Xuan
Xiyou Zhou
Wenhu Chen
Yu Wang
Xifeng Yan
AI4TS
112
1,429
0
29 Jun 2019
Translationese in Machine Translation Evaluation
Translationese in Machine Translation Evaluation
Yvette Graham
Barry Haddow
Philipp Koehn
51
90
0
24 Jun 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
236
8,451
0
19 Jun 2019
Defending Against Neural Fake News
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
134
1,030
0
29 May 2019
Are Sixteen Heads Really Better than One?
Are Sixteen Heads Really Better than One?
Paul Michel
Omer Levy
Graham Neubig
MoE
109
1,069
0
25 May 2019
Adaptive Attention Span in Transformers
Adaptive Attention Span in Transformers
Sainbayar Sukhbaatar
Edouard Grave
Piotr Bojanowski
Armand Joulin
76
286
0
19 May 2019
Transformers with convolutional context for ASR
Transformers with convolutional context for ASR
Abdel-rahman Mohamed
Dmytro Okhonko
Luke Zettlemoyer
56
168
0
26 Apr 2019
Generating Long Sequences with Sparse Transformers
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
129
1,916
0
23 Apr 2019
Modeling Recurrence for Transformer
Modeling Recurrence for Transformer
Jie Hao
Xing Wang
Baosong Yang
Longyue Wang
Jinfeng Zhang
Zhaopeng Tu
75
85
0
05 Apr 2019
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
Myle Ott
Sergey Edunov
Alexei Baevski
Angela Fan
Sam Gross
Nathan Ng
David Grangier
Michael Auli
VLMFaML
120
3,156
0
01 Apr 2019
Learning to Stop in Structured Prediction for Neural Machine Translation
Learning to Stop in Structured Prediction for Neural Machine Translation
Mingbo Ma
Renjie Zheng
Liang Huang
74
5
0
01 Apr 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for
  Sampling Sequences Without Replacement
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
121
220
0
14 Mar 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLP
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
223
4,518
0
02 Feb 2019
Pay Less Attention with Lightweight and Dynamic Convolutions
Pay Less Attention with Lightweight and Dynamic Convolutions
Felix Wu
Angela Fan
Alexei Baevski
Yann N. Dauphin
Michael Auli
89
610
0
29 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
260
3,747
0
09 Jan 2019
Efficient Attention: Attention with Linear Complexities
Efficient Attention: Attention with Linear Complexities
Zhuoran Shen
Mingyuan Zhang
Haiyu Zhao
Shuai Yi
Hongsheng Li
95
534
0
04 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,229
0
11 Oct 2018
Adaptive Input Representations for Neural Language Modeling
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski
Michael Auli
111
390
0
28 Sep 2018
Document-Level Neural Machine Translation with Hierarchical Attention
  Networks
Document-Level Neural Machine Translation with Hierarchical Attention Networks
Lesly Miculicich
Dhananjay Ram
Nikolaos Pappas
James Henderson
AIMat
70
269
0
05 Sep 2018
When to Finish? Optimal Beam Search for Neural Text Generation (modulo
  beam size)
When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size)
Liang Huang
Kai Zhao
Mingbo Ma
86
54
0
31 Aug 2018
Correcting Length Bias in Neural Machine Translation
Correcting Length Bias in Neural Machine Translation
Kenton W. Murray
David Chiang
AIMat
71
158
0
29 Aug 2018
Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and
  Stopping Criteria for Neural Machine Translation
Breaking the Beam Search Curse: A Study of (Re-)Scoring Methods and Stopping Criteria for Neural Machine Translation
Yilin Yang
Liang Huang
Mingbo Ma
63
92
0
28 Aug 2018
Rational Recurrences
Rational Recurrences
Hao Peng
Roy Schwartz
Sam Thomson
Noah A. Smith
AI4CE
51
39
0
28 Aug 2018
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional
  Neural Networks for Extreme Summarization
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
149
1,684
0
27 Aug 2018
Universal Transformers
Universal Transformers
Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Lukasz Kaiser
87
755
0
10 Jul 2018
A Simple Method for Commonsense Reasoning
A Simple Method for Commonsense Reasoning
Trieu H. Trinh
Quoc V. Le
LRMReLM
97
434
0
07 Jun 2018
Scaling Neural Machine Translation
Scaling Neural Machine Translation
Myle Ott
Sergey Edunov
David Grangier
Michael Auli
AIMat
192
615
0
01 Jun 2018
A Call for Clarity in Reporting BLEU Scores
A Call for Clarity in Reporting BLEU Scores
Matt Post
181
2,998
0
23 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
1.1K
7,200
0
20 Apr 2018
ListOps: A Diagnostic Dataset for Latent Tree Learning
ListOps: A Diagnostic Dataset for Latent Tree Learning
Nikita Nangia
Samuel R. Bowman
66
138
0
17 Apr 2018
Differentiable plasticity: training plastic neural networks with
  backpropagation
Differentiable plasticity: training plastic neural networks with backpropagation
Thomas Miconi
Jeff Clune
Kenneth O. Stanley
AI4CE
66
154
0
06 Apr 2018
Tensor2Tensor for Neural Machine Translation
Tensor2Tensor for Neural Machine Translation
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
...
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
98
530
0
16 Mar 2018
Analyzing Uncertainty in Neural Machine Translation
Analyzing Uncertainty in Neural Machine Translation
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
UQLM
113
275
0
28 Feb 2018
Image Transformer
Image Transformer
Niki Parmar
Ashish Vaswani
Jakob Uszkoreit
Lukasz Kaiser
Noam M. Shazeer
Alexander Ku
Dustin Tran
ViT
144
1,687
0
15 Feb 2018
Generating Wikipedia by Summarizing Long Sequences
Generating Wikipedia by Summarizing Long Sequences
Peter J. Liu
Mohammad Saleh
Etienne Pot
Ben Goodrich
Ryan Sepassi
Lukasz Kaiser
Noam M. Shazeer
CVBM
211
801
0
30 Jan 2018
Classical Structured Prediction Losses for Sequence to Sequence Learning
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
115
186
0
14 Nov 2017
Non-Autoregressive Neural Machine Translation
Non-Autoregressive Neural Machine Translation
Jiatao Gu
James Bradbury
Caiming Xiong
Victor O.K. Li
R. Socher
107
798
0
07 Nov 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
139
276
0
08 Sep 2017
Regularizing and Optimizing LSTM Language Models
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
173
1,096
0
07 Aug 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual
  Question Answering
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
123
4,223
0
25 Jul 2017
Six Challenges for Neural Machine Translation
Six Challenges for Neural Machine Translation
Philipp Koehn
Rebecca Knowles
AAMLAIMat
377
1,225
0
12 Jun 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
795
132,454
0
12 Jun 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through
  Inference
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
524
4,497
0
18 Apr 2017
Get To The Point: Summarization with Pointer-Generator Networks
Get To The Point: Summarization with Pointer-Generator Networks
A. See
Peter J. Liu
Christopher D. Manning
3DPC
311
4,029
0
14 Apr 2017
Beam Search Strategies for Neural Machine Translation
Beam Search Strategies for Neural Machine Translation
Markus Freitag
Yaser Al-Onaizan
97
396
0
06 Feb 2017
Google's Multilingual Neural Machine Translation System: Enabling
  Zero-Shot Translation
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation
Melvin Johnson
M. Schuster
Quoc V. Le
M. Krikun
Yonghui Wu
...
F. Viégas
Martin Wattenberg
Gregory S. Corrado
Macduff Hughes
Jeffrey Dean
129
2,096
0
14 Nov 2016
Previous
123
Next