Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.08378
Cited By
Dynamic Evaluation of Transformer Language Models
17 April 2019
Ben Krause
Emmanuel Kahembwe
Iain Murray
Steve Renals
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dynamic Evaluation of Transformer Language Models"
14 / 14 papers shown
Title
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Yu Sun
Xinhao Li
Karan Dalal
Jiarui Xu
Arjun Vikram
...
Xinlei Chen
Xiaolong Wang
Sanmi Koyejo
Tatsunori Hashimoto
Carlos Guestrin
78
93
0
05 Jul 2024
Meta-Learning Fast Weight Language Models
Kevin Clark
Kelvin Guu
Ming-Wei Chang
Panupong Pasupat
Geoffrey E. Hinton
Mohammad Norouzi
KELM
32
13
0
05 Dec 2022
Optimizing Test-Time Query Representations for Dense Retrieval
Mujeen Sung
Jungsoo Park
Jaewoo Kang
Danqi Chen
Jinhyuk Lee
40
5
0
25 May 2022
Effective Cross-Utterance Language Modeling for Conversational Speech Recognition
Bi-Cheng Yan
Hsin-Wei Wang
Shih-Hsuan Chiu
Hsuan-Sheng Chiu
Berlin Chen
23
1
0
05 Nov 2021
GNN-LM: Language Modeling based on Global Contexts via GNN
Yuxian Meng
Shi Zong
Xiaoya Li
Xiaofei Sun
Tianwei Zhang
Fei Wu
Jiwei Li
LRM
29
37
0
17 Oct 2021
Vision-Language Transformer and Query Generation for Referring Segmentation
Henghui Ding
Chang-rui Liu
Suchen Wang
Xudong Jiang
40
253
0
12 Aug 2021
Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media
Paul Röttger
J. Pierrehumbert
32
64
0
16 Apr 2021
Mind the Gap: Assessing Temporal Generalization in Neural Language Models
Angeliki Lazaridou
A. Kuncoro
E. Gribovskaya
Devang Agrawal
Adam Liska
...
Sebastian Ruder
Dani Yogatama
Kris Cao
Susannah Young
Phil Blunsom
VLM
46
207
0
03 Feb 2021
Multi-timescale Representation Learning in LSTM Language Models
Shivangi Mahto
Vy A. Vo
Javier S. Turek
Alexander G. Huth
15
29
0
27 Sep 2020
Just Add Functions: A Neural-Symbolic Language Model
David Demeter
Doug Downey
16
11
0
11 Dec 2019
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
35
313
0
04 Dec 2019
Compressive Transformers for Long-Range Sequence Modelling
Jack W. Rae
Anna Potapenko
Siddhant M. Jayakumar
Timothy Lillicrap
RALM
VLM
KELM
13
623
0
13 Nov 2019
Generalization through Memorization: Nearest Neighbor Language Models
Urvashi Khandelwal
Omer Levy
Dan Jurafsky
Luke Zettlemoyer
M. Lewis
RALM
86
820
0
01 Nov 2019
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
Mohammad Shoeybi
M. Patwary
Raul Puri
P. LeGresley
Jared Casper
Bryan Catanzaro
MoE
247
1,836
0
17 Sep 2019
1