Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.05589
Cited By
v1
v2 (latest)
On the State of the Art of Evaluation in Neural Language Models
18 July 2017
Gábor Melis
Chris Dyer
Phil Blunsom
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"On the State of the Art of Evaluation in Neural Language Models"
50 / 190 papers shown
Title
On the Practical Ability of Recurrent Neural Networks to Recognize Hierarchical Languages
S. Bhattamishra
Kabir Ahuja
Navin Goyal
ReLM
73
13
0
08 Nov 2020
Improving Low Compute Language Modeling with In-Domain Embedding Initialisation
Charles F Welch
Rada Mihalcea
Jonathan K. Kummerfeld
AI4CE
70
4
0
29 Sep 2020
Multi-timescale Representation Learning in LSTM Language Models
Shivangi Mahto
Vy A. Vo
Javier S. Turek
Alexander G. Huth
61
31
0
27 Sep 2020
Black Magic in Deep Learning: How Human Skill Impacts Network Training
Kanav Anand
Ziqi Wang
Marco Loog
Jan van Gemert
HAI
70
16
0
13 Aug 2020
Shopping in the Multiverse: A Counterfactual Approach to In-Session Attribution
Jacopo Tagliabue
Bingqing Yu
70
6
0
20 Jul 2020
Hyperparameter Selection for Offline Reinforcement Learning
T. Paine
Cosmin Paduraru
Andrea Michi
Çağlar Gülçehre
Konrad Zolna
Alexander Novikov
Ziyun Wang
Nando de Freitas
GP
OffRL
204
148
0
17 Jul 2020
Do Transformers Need Deep Long-Range Memory
Jack W. Rae
Ali Razavi
RALM
81
41
0
07 Jul 2020
Evaluating the Performance of Reinforcement Learning Algorithms
Scott M. Jordan
Yash Chandak
Daniel Cohen
Mengxue Zhang
Philip S. Thomas
73
47
0
30 Jun 2020
Green Machine Learning via Augmented Gaussian Processes and Multi-Information Source Optimization
Antonio Candelieri
R. Perego
Francesco Archetti
47
19
0
25 Jun 2020
Multi-Source Deep Domain Adaptation with Weak Supervision for Time-Series Sensor Data
Garrett Wilson
J. Doppa
D. Cook
TTA
AI4TS
AI4CE
91
149
0
22 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions
Abdulelah S. Alshehri
R. Gani
Fengqi You
AI4CE
98
86
0
18 May 2020
Pretraining Federated Text Models for Next Word Prediction
Joel Stremmel
Arjun Singh
FedML
99
53
0
11 May 2020
Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings
Phillip Keung
Y. Lu
Julian Salazar
Vikas Bhardwaj
93
14
0
30 Apr 2020
Sequence Model Design for Code Completion in the Modern IDE
Gareth Ari Aye
Gail E. Kaiser
60
30
0
10 Apr 2020
DynaBERT: Dynamic BERT with Adaptive Width and Depth
Lu Hou
Zhiqi Huang
Lifeng Shang
Xin Jiang
Xiao Chen
Qun Liu
MQ
95
323
0
08 Apr 2020
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)
Joelle Pineau
Philippe Vincent-Lamarre
Koustuv Sinha
V. Larivière
A. Beygelzimer
Florence dÁlché-Buc
E. Fox
Hugo Larochelle
143
363
0
27 Mar 2020
Hyper-Parameter Optimization: A Review of Algorithms and Applications
Tong Yu
Hong Zhu
AAML
99
541
0
12 Mar 2020
Towards CRISP-ML(Q): A Machine Learning Process Model with Quality Assurance Methodology
Stefan Studer
T. Bui
C. Drescher
A. Hanuschkin
Ludwig Winkler
S. Peters
Klaus-Robert Muller
133
181
0
11 Mar 2020
Iterative Averaging in the Quest for Best Test Error
Diego Granziol
Xingchen Wan
Samuel Albanie
Stephen J. Roberts
76
3
0
02 Mar 2020
The Implicit and Explicit Regularization Effects of Dropout
Colin Wei
Sham Kakade
Tengyu Ma
131
118
0
28 Feb 2020
Using a thousand optimization tasks to learn hyperparameter search strategies
Luke Metz
Niru Maheswaranathan
Ruoxi Sun
C. Freeman
Ben Poole
Jascha Narain Sohl-Dickstein
126
46
0
27 Feb 2020
Quantized Neural Network Inference with Precision Batching
Maximilian Lam
Zachary Yedidia
Colby R. Banbury
Vijay Janapa Reddi
MQ
45
1
0
26 Feb 2020
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity
Thomas Miconi
Aditya Rawal
Jeff Clune
Kenneth O. Stanley
80
90
0
24 Feb 2020
Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping
Jesse Dodge
Gabriel Ilharco
Roy Schwartz
Ali Farhadi
Hannaneh Hajishirzi
Noah A. Smith
110
598
0
15 Feb 2020
Transformer on a Diet
Chenguang Wang
Zihao Ye
Aston Zhang
Zheng Zhang
Alex Smola
93
8
0
14 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges
T. H. Le
Hao Chen
Muhammad Ali Babar
VLM
147
155
0
13 Feb 2020
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits
Jack Parker-Holder
Vu Nguyen
Stephen J. Roberts
OffRL
161
86
0
06 Feb 2020
Consistency of a Recurrent Language Model With Respect to Incomplete Decoding
Sean Welleck
Ilia Kulikov
Jaedeok Kim
Richard Yuanzhe Pang
Kyunghyun Cho
91
67
0
06 Feb 2020
Big-Data Science in Porous Materials: Materials Genomics and Machine Learning
Kevin Maik Jablonka
D. Ongari
S. M. Moosavi
B. Smit
AI4CE
85
365
0
18 Jan 2020
Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging
J. Zech
Jessica Zosa Forde
Michael L. Littman
38
5
0
08 Dec 2019
How Can We Know What Language Models Know?
Zhengbao Jiang
Frank F. Xu
Jun Araki
Graham Neubig
KELM
213
1,416
0
28 Nov 2019
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling
Sachin Mehta
Rik Koncel-Kedziorski
Mohammad Rastegari
Hannaneh Hajishirzi
AI4TS
117
23
0
27 Nov 2019
Relevance-Promoting Language Model for Short-Text Conversation
Xin Li
Piji Li
Wei Bi
Xiaojiang Liu
Wai Lam
78
11
0
26 Nov 2019
AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture
Tunhou Zhang
Hsin-Pai Cheng
Zhenwen Li
Feng Yan
Chengyu Huang
H. Li
Yiran Chen
62
9
0
21 Nov 2019
How Decoding Strategies Affect the Verifiability of Generated Text
Luca Massarelli
Fabio Petroni
Aleksandra Piktus
Myle Ott
Tim Rocktaschel
Vassilis Plachouras
Fabrizio Silvestri
Sebastian Riedel
137
50
0
09 Nov 2019
Structured Pruning of Large Language Models
Ziheng Wang
Jeremy Wohlwend
Tao Lei
96
293
0
10 Oct 2019
Probabilistic Rollouts for Learning Curve Extrapolation Across Hyperparameter Settings
Matilde Gargiani
Aaron Klein
Stefan Falkner
Frank Hutter
BDL
70
12
0
10 Oct 2019
Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data
H. Shahidi
Ming Li
Jimmy J. Lin
LMTD
68
14
0
23 Sep 2019
DECoVaC: Design of Experiments with Controlled Variability Components
Thomas Boquet
Laure Delisle
Denis Kochetkov
Nathan Schucher
Parmida Atighehchian
Boris N. Oreshkin
Julien Cornebise
55
1
0
21 Sep 2019
Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation
Yi-An Lai
Arshit Gupta
Yi Zhang
46
1
0
19 Sep 2019
Show Your Work: Improved Reporting of Experimental Results
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
78
255
0
06 Sep 2019
Deep Equilibrium Models
Shaojie Bai
J. Zico Kolter
V. Koltun
114
674
0
03 Sep 2019
Language Models as Knowledge Bases?
Fabio Petroni
Tim Rocktaschel
Patrick Lewis
A. Bakhtin
Yuxiang Wu
Alexander H. Miller
Sebastian Riedel
KELM
AI4MH
603
2,681
0
03 Sep 2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression
Genta Indra Winata
Andrea Madotto
Jamin Shin
Elham J. Barezi
Pascale Fung
64
29
0
27 Aug 2019
Green AI
Roy Schwartz
Jesse Dodge
Noah A. Smith
Oren Etzioni
148
1,164
0
22 Jul 2019
Reproducibility in Machine Learning for Health
Matthew B. A. McDermott
Shirly Wang
N. Marinsek
Rajesh Ranganath
Marzyeh Ghassemi
L. Foschini
AI4TS
46
53
0
02 Jul 2019
Kite: Automatic speech recognition for unmanned aerial vehicles
Dan Oneaţă
H. Cucu
40
13
0
02 Jul 2019
Evaluating Computational Language Models with Scaling Properties of Natural Language
Shuntaro Takahashi
Kumiko Tanaka-Ishii
60
27
0
22 Jun 2019
Character n-gram Embeddings to Improve RNN Language Models
Sho Takase
Jun Suzuki
Masaaki Nagata
69
25
0
13 Jun 2019
Automated Machine Learning: State-of-The-Art and Open Challenges
Radwa El Shawi
Mohamed Maher
Sherif Sakr
58
162
0
05 Jun 2019
Previous
1
2
3
4
Next