ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.07174
  4. Cited By
The EOS Decision and Length Extrapolation

The EOS Decision and Length Extrapolation

14 October 2020
Benjamin Newman
John Hewitt
Percy Liang
Christopher D. Manning
ArXivPDFHTML

Papers citing "The EOS Decision and Length Extrapolation"

24 / 24 papers shown
Title
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges
Nayoung Lee
Ziyang Cai
Avi Schwarzschild
Kangwook Lee
Dimitris Papailiopoulos
ReLM
VLM
LRM
AI4CE
120
7
0
03 Feb 2025
RNNs can generate bounded hierarchical languages with optimal memory
RNNs can generate bounded hierarchical languages with optimal memory
John Hewitt
Michael Hahn
Surya Ganguli
Percy Liang
Christopher D. Manning
LRM
41
54
0
15 Oct 2020
A Study of Compositional Generalization in Neural Models
A Study of Compositional Generalization in Neural Models
Tim Klinger
D. Adjodah
Vincent Marois
Joshua Joseph
Matthew D Riemer
Alex Pentland
Murray Campbell
CoGe
NAI
176
13
0
16 Jun 2020
On the Linguistic Capacity of Real-Time Counter Automata
On the Linguistic Capacity of Real-Time Counter Automata
William Merrill
35
23
0
15 Apr 2020
Learning Compositional Rules via Neural Program Synthesis
Learning Compositional Rules via Neural Program Synthesis
Maxwell Nye
Armando Solar-Lezama
J. Tenenbaum
Brenden M. Lake
NAI
LRM
62
118
0
12 Mar 2020
A Benchmark for Systematic Generalization in Grounded Language
  Understanding
A Benchmark for Systematic Generalization in Grounded Language Understanding
Laura Ruis
Jacob Andreas
Marco Baroni
Diane Bouchacourt
Brenden M. Lake
55
144
0
11 Mar 2020
Location Attention for Extrapolation to Longer Sequences
Location Attention for Extrapolation to Longer Sequences
Yann Dubois
Gautier Dagan
Dieuwke Hupkes
Elia Bruni
46
43
0
10 Nov 2019
Compositionality decomposed: how do neural networks generalise?
Compositionality decomposed: how do neural networks generalise?
Dieuwke Hupkes
Verna Dankers
Mathijs Mul
Elia Bruni
CoGe
145
336
0
22 Aug 2019
Compositional generalization through meta sequence-to-sequence learning
Compositional generalization through meta sequence-to-sequence learning
Brenden M. Lake
CoGe
83
199
0
12 Jun 2019
LSTM Networks Can Perform Dynamic Counting
LSTM Networks Can Perform Dynamic Counting
Mirac Suzgun
Sebastian Gehrmann
Yonatan Belinkov
Stuart M. Shieber
58
75
0
09 Jun 2019
On Evaluating the Generalization of LSTM Models in Formal Languages
On Evaluating the Generalization of LSTM Models in Formal Languages
Mirac Suzgun
Yonatan Belinkov
Stuart M. Shieber
AI4CE
37
41
0
02 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.7K
94,770
0
11 Oct 2018
Assessing Composition in Sentence Vector Representations
Assessing Composition in Sentence Vector Representations
Allyson Ettinger
Ahmed Elgohary
C. Phillips
Philip Resnik
CoGe
45
78
0
11 Sep 2018
Correcting Length Bias in Neural Machine Translation
Correcting Length Bias in Neural Machine Translation
Kenton W. Murray
David Chiang
AIMat
67
157
0
29 Aug 2018
Rearranging the Familiar: Testing Compositional Generalization in
  Recurrent Networks
Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks
J. Loula
Marco Baroni
Brenden M. Lake
KELM
CoGe
46
132
0
19 Jul 2018
Extrapolation in NLP
Extrapolation in NLP
Jeff Mitchell
Pasquale Minervini
Pontus Stenetorp
Sebastian Riedel
21
20
0
17 May 2018
On the Practical Computational Power of Finite Precision RNNs for
  Language Recognition
On the Practical Computational Power of Finite Precision RNNs for Language Recognition
Gail Weiss
Yoav Goldberg
Eran Yahav
74
265
0
13 May 2018
What you can cram into a single vector: Probing sentence embeddings for
  linguistic properties
What you can cram into a single vector: Probing sentence embeddings for linguistic properties
Alexis Conneau
Germán Kruszewski
Guillaume Lample
Loïc Barrault
Marco Baroni
331
893
0
03 May 2018
A Call for Clarity in Reporting BLEU Scores
A Call for Clarity in Reporting BLEU Scores
Matt Post
150
2,985
0
23 Apr 2018
Deep Learning: A Critical Appraisal
Deep Learning: A Critical Appraisal
G. Marcus
HAI
VLM
122
1,040
0
02 Jan 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
692
131,526
0
12 Jun 2017
OpenNMT: Open-Source Toolkit for Neural Machine Translation
OpenNMT: Open-Source Toolkit for Neural Machine Translation
Guillaume Klein
Yoon Kim
Yuntian Deng
Jean Senellart
Alexander M. Rush
327
1,900
0
10 Jan 2017
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.8K
150,039
0
22 Dec 2014
Neural Turing Machines
Neural Turing Machines
Alex Graves
Greg Wayne
Ivo Danihelka
97
2,327
0
20 Oct 2014
1