The EOS Decision and Length Extrapolation

14 October 2020

Benjamin Newman

John Hewitt

Percy Liang

Christopher D. Manning

ArXiv PDF HTML

Papers citing "The EOS Decision and Length Extrapolation"

24 / 24 papers shown

Title
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges Nayoung Lee Ziyang Cai Avi Schwarzschild Kangwook Lee Dimitris Papailiopoulos ReLM VLM LRM AI4CE 120 7 0 03 Feb 2025
RNNs can generate bounded hierarchical languages with optimal memory John Hewitt Michael Hahn Surya Ganguli Percy Liang Christopher D. Manning LRM 41 54 0 15 Oct 2020
A Study of Compositional Generalization in Neural Models Tim Klinger D. Adjodah Vincent Marois Joshua Joseph Matthew D Riemer Alex Pentland Murray Campbell CoGe NAI 176 13 0 16 Jun 2020
On the Linguistic Capacity of Real-Time Counter Automata William Merrill 35 23 0 15 Apr 2020
Learning Compositional Rules via Neural Program Synthesis Maxwell Nye Armando Solar-Lezama J. Tenenbaum Brenden M. Lake NAI LRM 62 118 0 12 Mar 2020
A Benchmark for Systematic Generalization in Grounded Language Understanding Laura Ruis Jacob Andreas Marco Baroni Diane Bouchacourt Brenden M. Lake 55 144 0 11 Mar 2020
Location Attention for Extrapolation to Longer Sequences Yann Dubois Gautier Dagan Dieuwke Hupkes Elia Bruni 46 43 0 10 Nov 2019
Compositionality decomposed: how do neural networks generalise? Dieuwke Hupkes Verna Dankers Mathijs Mul Elia Bruni CoGe 145 336 0 22 Aug 2019
Compositional generalization through meta sequence-to-sequence learning Brenden M. Lake CoGe 83 199 0 12 Jun 2019
LSTM Networks Can Perform Dynamic Counting Mirac Suzgun Sebastian Gehrmann Yonatan Belinkov Stuart M. Shieber 58 75 0 09 Jun 2019
On Evaluating the Generalization of LSTM Models in Formal Languages Mirac Suzgun Yonatan Belinkov Stuart M. Shieber AI4CE 37 41 0 02 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Jacob Devlin Ming-Wei Chang Kenton Lee Kristina Toutanova VLM SSL SSeg 1.7K 94,770 0 11 Oct 2018
Assessing Composition in Sentence Vector Representations Allyson Ettinger Ahmed Elgohary C. Phillips Philip Resnik CoGe 45 78 0 11 Sep 2018
Correcting Length Bias in Neural Machine Translation Kenton W. Murray David Chiang AIMat 67 157 0 29 Aug 2018
Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks J. Loula Marco Baroni Brenden M. Lake KELM CoGe 46 132 0 19 Jul 2018
Extrapolation in NLP Jeff Mitchell Pasquale Minervini Pontus Stenetorp Sebastian Riedel 21 20 0 17 May 2018
On the Practical Computational Power of Finite Precision RNNs for Language Recognition Gail Weiss Yoav Goldberg Eran Yahav 74 265 0 13 May 2018
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 331 893 0 03 May 2018
A Call for Clarity in Reporting BLEU Scores Matt Post 150 2,985 0 23 Apr 2018
Deep Learning: A Critical Appraisal G. Marcus HAI VLM 122 1,040 0 02 Jan 2018
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 692 131,526 0 12 Jun 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation Guillaume Klein Yoon Kim Yuntian Deng Jean Senellart Alexander M. Rush 327 1,900 0 10 Jan 2017
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 1.8K 150,039 0 22 Dec 2014
Neural Turing Machines Alex Graves Greg Wayne Ivo Danihelka 97 2,327 0 20 Oct 2014