v1v2 (latest)

On the State of the Art of Evaluation in Neural Language Models

18 July 2017

Papers citing "On the State of the Art of Evaluation in Neural Language Models"

50 / 190 papers shown

Title
On the Practical Ability of Recurrent Neural Networks to Recognize Hierarchical Languages S. Bhattamishra Kabir Ahuja Navin Goyal ReLM 73 13 0 08 Nov 2020
Improving Low Compute Language Modeling with In-Domain Embedding Initialisation Charles F Welch Rada Mihalcea Jonathan K. Kummerfeld AI4CE 70 4 0 29 Sep 2020
Multi-timescale Representation Learning in LSTM Language Models Shivangi Mahto Vy A. Vo Javier S. Turek Alexander G. Huth 61 31 0 27 Sep 2020
Black Magic in Deep Learning: How Human Skill Impacts Network Training Kanav Anand Ziqi Wang Marco Loog Jan van Gemert HAI 70 16 0 13 Aug 2020
Shopping in the Multiverse: A Counterfactual Approach to In-Session Attribution Jacopo Tagliabue Bingqing Yu 70 6 0 20 Jul 2020
Hyperparameter Selection for Offline Reinforcement Learning T. Paine Cosmin Paduraru Andrea Michi Çağlar Gülçehre Konrad Zolna Alexander Novikov Ziyun Wang Nando de Freitas GP OffRL 204 148 0 17 Jul 2020
Do Transformers Need Deep Long-Range Memory Jack W. Rae Ali Razavi RALM 81 41 0 07 Jul 2020
Evaluating the Performance of Reinforcement Learning Algorithms Scott M. Jordan Yash Chandak Daniel Cohen Mengxue Zhang Philip S. Thomas 73 47 0 30 Jun 2020
Green Machine Learning via Augmented Gaussian Processes and Multi-Information Source Optimization Antonio Candelieri R. Perego Francesco Archetti 47 19 0 25 Jun 2020
Multi-Source Deep Domain Adaptation with Weak Supervision for Time-Series Sensor Data Garrett Wilson J. Doppa D. Cook TTA AI4TS AI4CE 91 149 0 22 May 2020
Deep Learning and Knowledge-Based Methods for Computer Aided Molecular Design -- Toward a Unified Approach: State-of-the-Art and Future Directions Abdulelah S. Alshehri R. Gani Fengqi You AI4CE 98 86 0 18 May 2020
Pretraining Federated Text Models for Next Word Prediction Joel Stremmel Arjun Singh FedML 99 53 0 11 May 2020
Don't Use English Dev: On the Zero-Shot Cross-Lingual Evaluation of Contextual Embeddings Phillip Keung Y. Lu Julian Salazar Vikas Bhardwaj 93 14 0 30 Apr 2020
Sequence Model Design for Code Completion in the Modern IDE Gareth Ari Aye Gail E. Kaiser 60 30 0 10 Apr 2020
DynaBERT: Dynamic BERT with Adaptive Width and Depth Lu Hou Zhiqi Huang Lifeng Shang Xin Jiang Xiao Chen Qun Liu MQ 95 323 0 08 Apr 2020
Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program) Joelle Pineau Philippe Vincent-Lamarre Koustuv Sinha V. Larivière A. Beygelzimer Florence dÁlché-Buc E. Fox Hugo Larochelle 143 363 0 27 Mar 2020
Hyper-Parameter Optimization: A Review of Algorithms and Applications Tong Yu Hong Zhu AAML 99 541 0 12 Mar 2020
Towards CRISP-ML(Q): A Machine Learning Process Model with Quality Assurance Methodology Stefan Studer T. Bui C. Drescher A. Hanuschkin Ludwig Winkler S. Peters Klaus-Robert Muller 133 181 0 11 Mar 2020
Iterative Averaging in the Quest for Best Test Error Diego Granziol Xingchen Wan Samuel Albanie Stephen J. Roberts 76 3 0 02 Mar 2020
The Implicit and Explicit Regularization Effects of Dropout Colin Wei Sham Kakade Tengyu Ma 131 118 0 28 Feb 2020
Using a thousand optimization tasks to learn hyperparameter search strategies Luke Metz Niru Maheswaranathan Ruoxi Sun C. Freeman Ben Poole Jascha Narain Sohl-Dickstein 126 46 0 27 Feb 2020
Quantized Neural Network Inference with Precision Batching Maximilian Lam Zachary Yedidia Colby R. Banbury Vijay Janapa Reddi MQ 45 1 0 26 Feb 2020
Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity Thomas Miconi Aditya Rawal Jeff Clune Kenneth O. Stanley 80 90 0 24 Feb 2020
Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping Jesse Dodge Gabriel Ilharco Roy Schwartz Ali Farhadi Hannaneh Hajishirzi Noah A. Smith 110 598 0 15 Feb 2020
Transformer on a Diet Chenguang Wang Zihao Ye Aston Zhang Zheng Zhang Alex Smola 93 8 0 14 Feb 2020
Deep Learning for Source Code Modeling and Generation: Models, Applications and Challenges T. H. Le Hao Chen Muhammad Ali Babar VLM 147 155 0 13 Feb 2020
Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits Jack Parker-Holder Vu Nguyen Stephen J. Roberts OffRL 161 86 0 06 Feb 2020
Consistency of a Recurrent Language Model With Respect to Incomplete Decoding Sean Welleck Ilia Kulikov Jaedeok Kim Richard Yuanzhe Pang Kyunghyun Cho 91 67 0 06 Feb 2020
Big-Data Science in Porous Materials: Materials Genomics and Machine Learning Kevin Maik Jablonka D. Ongari S. M. Moosavi B. Smit AI4CE 85 365 0 18 Jan 2020
Individual predictions matter: Assessing the effect of data ordering in training fine-tuned CNNs for medical imaging J. Zech Jessica Zosa Forde Michael L. Littman 38 5 0 08 Dec 2019
How Can We Know What Language Models Know? Zhengbao Jiang Frank F. Xu Jun Araki Graham Neubig KELM 213 1,416 0 28 Nov 2019
DeFINE: DEep Factorized INput Token Embeddings for Neural Sequence Modeling Sachin Mehta Rik Koncel-Kedziorski Mohammad Rastegari Hannaneh Hajishirzi AI4TS 117 23 0 27 Nov 2019
Relevance-Promoting Language Model for Short-Text Conversation Xin Li Piji Li Wei Bi Xiaojiang Liu Wai Lam 78 11 0 26 Nov 2019
AutoShrink: A Topology-aware NAS for Discovering Efficient Neural Architecture Tunhou Zhang Hsin-Pai Cheng Zhenwen Li Feng Yan Chengyu Huang H. Li Yiran Chen 62 9 0 21 Nov 2019
How Decoding Strategies Affect the Verifiability of Generated Text Luca Massarelli Fabio Petroni Aleksandra Piktus Myle Ott Tim Rocktaschel Vassilis Plachouras Fabrizio Silvestri Sebastian Riedel 137 50 0 09 Nov 2019
Structured Pruning of Large Language Models Ziheng Wang Jeremy Wohlwend Tao Lei 96 293 0 10 Oct 2019
Probabilistic Rollouts for Learning Curve Extrapolation Across Hyperparameter Settings Matilde Gargiani Aaron Klein Stefan Falkner Frank Hutter BDL 70 12 0 10 Oct 2019
Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data H. Shahidi Ming Li Jimmy J. Lin LMTD 68 14 0 23 Sep 2019
DECoVaC: Design of Experiments with Controlled Variability Components Thomas Boquet Laure Delisle Denis Kochetkov Nathan Schucher Parmida Atighehchian Boris N. Oreshkin Julien Cornebise 55 1 0 21 Sep 2019
Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation Yi-An Lai Arshit Gupta Yi Zhang 46 1 0 19 Sep 2019
Show Your Work: Improved Reporting of Experimental Results Jesse Dodge Suchin Gururangan Dallas Card Roy Schwartz Noah A. Smith 78 255 0 06 Sep 2019
Deep Equilibrium Models Shaojie Bai J. Zico Kolter V. Koltun 114 674 0 03 Sep 2019
Language Models as Knowledge Bases? Fabio Petroni Tim Rocktaschel Patrick Lewis A. Bakhtin Yuxiang Wu Alexander H. Miller Sebastian Riedel KELM AI4MH 603 2,681 0 03 Sep 2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression Genta Indra Winata Andrea Madotto Jamin Shin Elham J. Barezi Pascale Fung 64 29 0 27 Aug 2019
Green AI Roy Schwartz Jesse Dodge Noah A. Smith Oren Etzioni 148 1,164 0 22 Jul 2019
Reproducibility in Machine Learning for Health Matthew B. A. McDermott Shirly Wang N. Marinsek Rajesh Ranganath Marzyeh Ghassemi L. Foschini AI4TS 46 53 0 02 Jul 2019
Kite: Automatic speech recognition for unmanned aerial vehicles Dan Oneaţă H. Cucu 40 13 0 02 Jul 2019
Evaluating Computational Language Models with Scaling Properties of Natural Language Shuntaro Takahashi Kumiko Tanaka-Ishii 60 27 0 22 Jun 2019
Character n-gram Embeddings to Improve RNN Language Models Sho Takase Jun Suzuki Masaaki Nagata 69 25 0 13 Jun 2019
Automated Machine Learning: State-of-The-Art and Open Challenges Radwa El Shawi Mohamed Maher Sherif Sakr 58 162 0 05 Jun 2019