Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.07174
Cited By
The EOS Decision and Length Extrapolation
14 October 2020
Benjamin Newman
John Hewitt
Percy Liang
Christopher D. Manning
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The EOS Decision and Length Extrapolation"
24 / 24 papers shown
Title
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee
Ziyang Cai
Avi Schwarzschild
Kangwook Lee
Dimitris Papailiopoulos
ReLM
VLM
LRM
AI4CE
120
7
0
03 Feb 2025
RNNs can generate bounded hierarchical languages with optimal memory
John Hewitt
Michael Hahn
Surya Ganguli
Percy Liang
Christopher D. Manning
LRM
41
54
0
15 Oct 2020
A Study of Compositional Generalization in Neural Models
Tim Klinger
D. Adjodah
Vincent Marois
Joshua Joseph
Matthew D Riemer
Alex Pentland
Murray Campbell
CoGe
NAI
176
13
0
16 Jun 2020
On the Linguistic Capacity of Real-Time Counter Automata
William Merrill
35
23
0
15 Apr 2020
Learning Compositional Rules via Neural Program Synthesis
Maxwell Nye
Armando Solar-Lezama
J. Tenenbaum
Brenden M. Lake
NAI
LRM
62
118
0
12 Mar 2020
A Benchmark for Systematic Generalization in Grounded Language Understanding
Laura Ruis
Jacob Andreas
Marco Baroni
Diane Bouchacourt
Brenden M. Lake
55
144
0
11 Mar 2020
Location Attention for Extrapolation to Longer Sequences
Yann Dubois
Gautier Dagan
Dieuwke Hupkes
Elia Bruni
46
43
0
10 Nov 2019
Compositionality decomposed: how do neural networks generalise?
Dieuwke Hupkes
Verna Dankers
Mathijs Mul
Elia Bruni
CoGe
145
336
0
22 Aug 2019
Compositional generalization through meta sequence-to-sequence learning
Brenden M. Lake
CoGe
83
199
0
12 Jun 2019
LSTM Networks Can Perform Dynamic Counting
Mirac Suzgun
Sebastian Gehrmann
Yonatan Belinkov
Stuart M. Shieber
58
75
0
09 Jun 2019
On Evaluating the Generalization of LSTM Models in Formal Languages
Mirac Suzgun
Yonatan Belinkov
Stuart M. Shieber
AI4CE
37
41
0
02 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.7K
94,770
0
11 Oct 2018
Assessing Composition in Sentence Vector Representations
Allyson Ettinger
Ahmed Elgohary
C. Phillips
Philip Resnik
CoGe
45
78
0
11 Sep 2018
Correcting Length Bias in Neural Machine Translation
Kenton W. Murray
David Chiang
AIMat
67
157
0
29 Aug 2018
Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks
J. Loula
Marco Baroni
Brenden M. Lake
KELM
CoGe
46
132
0
19 Jul 2018
Extrapolation in NLP
Jeff Mitchell
Pasquale Minervini
Pontus Stenetorp
Sebastian Riedel
21
20
0
17 May 2018
On the Practical Computational Power of Finite Precision RNNs for Language Recognition
Gail Weiss
Yoav Goldberg
Eran Yahav
74
265
0
13 May 2018
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau
Germán Kruszewski
Guillaume Lample
Loïc Barrault
Marco Baroni
331
893
0
03 May 2018
A Call for Clarity in Reporting BLEU Scores
Matt Post
150
2,985
0
23 Apr 2018
Deep Learning: A Critical Appraisal
G. Marcus
HAI
VLM
122
1,040
0
02 Jan 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
692
131,526
0
12 Jun 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
327
1,900
0
10 Jan 2017
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,039
0
22 Dec 2014
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
97
2,327
0
20 Oct 2014
1