v1v2 (latest)

On the State of the Art of Evaluation in Neural Language Models

18 July 2017

Papers citing "On the State of the Art of Evaluation in Neural Language Models"

50 / 190 papers shown

Title
Transient Dynamics in Lattices of Differentiating Ring Oscillators Peter DelMastro Arjun Karuvally Hananel Hazan H. Siegelmann E. Rietman 29 0 0 08 Jun 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models Yulei Qin Yuncheng Yang Pengcheng Guo Gang Li Hang Shao Yuchen Shi Zihan Xu Yun Gu Ke Li Xing Sun ALM 213 13 0 31 Dec 2024
Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks Matteo Tucat Anirbit Mukherjee Procheta Sen Mingfei Sun Omar Rivasplata MLT 93 1 0 12 Apr 2024
A Controlled Reevaluation of Coreference Resolution Models Ian Porada Xiyuan Zou Jackie Chi Kit Cheung 92 1 0 31 Mar 2024
Large Language Model Agent for Hyper-Parameter Optimization Siyi Liu Chen Gao Yong Li 135 23 0 02 Feb 2024
Autocompletion of Chief Complaints in the Electronic Health Records using Large Language Models K M Sajjadul Islam Ayesha Siddika Nipu Praveen Madiraju Priya Deshpande LM&MA 71 7 0 11 Jan 2024
Quantifying the Uniqueness of Donald Trump in Presidential Discourse Karen Zhou Alexander A. Meitus Milo Chase Grace Wang Anne Mykland William Howell Chenhao Tan 38 1 0 02 Jan 2024
An Ensemble Approach to Personalized Real Time Predictive Writing for Experts Sourav Prosad Viswa Datha Polavarapu Shrutendra Harsola 52 0 0 25 Aug 2023
Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science Omri Suissa Avshalom Elmalech M. Zhitomirsky-Geffet AI4CE 57 48 0 30 Jul 2023
Decoding ChatGPT: A Taxonomy of Existing Research, Current Challenges, and Possible Future Directions S. Sohail Faiza Farhat Yassine Himeur Mohammad Nadeem D. Madsen Yashbir Singh Shadi Atalla W. Mansoor 106 123 0 26 Jul 2023
Hierarchical Attention Encoder Decoder Asier Mujika BDL 62 3 0 01 Jun 2023
Measuring Data Margaret Mitchell A. Luccioni Nathan Lambert Marissa Gerchick Angelina McMillan-Major Ezinwanne Ozoani Nazneen Rajani Tristan Thrush Yacine Jernite Douwe Kiela 95 17 0 09 Dec 2022
Bridging the Training-Inference Gap for Dense Phrase Retrieval Gyuwan Kim Jinhyuk Lee Barlas Oğuz Wenhan Xiong Yizhe Zhang Yashar Mehdad William Yang Wang 84 2 0 25 Oct 2022
Reproducibility of the Methods in Medical Imaging with Deep Learning A. Simkó A. Garpebring J. Jonsson T. Nyholm Tommy Löfstedt OOD 46 10 0 20 Oct 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications Ling Yang Zhilong Zhang Yingxia Shao Shenda Hong Runsheng Xu Yue Zhao Wentao Zhang Tengjiao Wang Ming-Hsuan Yang DiffM MedIm 551 1,428 0 02 Sep 2022
Local Byte Fusion for Neural Machine Translation Makesh Narsimhan Sreedhar Xiangpeng Wan Yu-Jie Cheng Junjie Hu 106 4 0 23 May 2022
Sources of Irreproducibility in Machine Learning: A Review Odd Erik Gundersen Kevin Coakley Christine R. Kirkpatrick Yolanda Gil SyDa 133 34 0 15 Apr 2022
A Survey on Dropout Methods and Experimental Verification in Recommendation Yongqian Li Weizhi Ma C. L. Philip Chen Hao Fei Yiqun Liu Shaoping Ma Yue Yang 92 11 0 05 Apr 2022
The worst of both worlds: A comparative analysis of errors in learning from data in psychology and machine learning Jessica Hullman Sayash Kapoor Priyanka Nanayakkara Andrew Gelman Arvind Narayanan 147 39 0 12 Mar 2022
Improving Baselines in the Wild Kazuki Irie Imanol Schlag Róbert Csordás Jürgen Schmidhuber 49 3 0 31 Dec 2021
Evaluating deep transfer learning for whole-brain cognitive decoding A. Thomas U. Lindenberger Wojciech Samek K. Müller AI4CE 62 12 0 01 Nov 2021
A Systematic Investigation of Commonsense Knowledge in Large Language Models Xiang Lorraine Li A. Kuncoro Jordan Hoffmann Cyprien de Masson dÁutume Phil Blunsom Aida Nematzadeh LRM 104 59 0 31 Oct 2021
GNN-LM: Language Modeling based on Global Contexts via GNN Yuxian Meng Shi Zong Xiaoya Li Xiaofei Sun Tianwei Zhang Leilei Gan Jiwei Li LRM 127 39 0 17 Oct 2021
Autoregressive Diffusion Models Emiel Hoogeboom Alexey A. Gritsenko Jasmijn Bastings Ben Poole Rianne van den Berg Tim Salimans DiffM 127 155 0 05 Oct 2021
Simple Recurrent Neural Networks is all we need for clinical events predictions using EHR data L. Rasmy Jie Zhu Zhiheng Li Xin Hao Hong Thoai Nga Tran Yujia Zhou Firat Tiryaki Yang Xiang Hua Xu Degui Zhi 29 17 0 03 Oct 2021
Expected Validation Performance and Estimation of a Random Variable's Maximum Jesse Dodge Suchin Gururangan Dallas Card Roy Schwartz Noah A. Smith 102 9 0 01 Oct 2021
Language Models as a Knowledge Source for Cognitive Agents R. Wray James R. Kirk John E. Laird 57 15 0 17 Sep 2021
HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems for HPO Katharina Eggensperger Philip Muller Neeratyoy Mallik Matthias Feurer René Sass Aaron Klein Noor H. Awad Marius Lindauer Frank Hutter 243 104 0 14 Sep 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice Rishabh Agarwal Max Schwarzer Pablo Samuel Castro Aaron Courville Marc G. Bellemare OffRL 196 680 0 30 Aug 2021
Challenges for cognitive decoding using deep learning methods A. Thomas Christopher Ré R. Poldrack AI4CE 63 6 0 16 Aug 2021
Using AntiPatterns to avoid MLOps Mistakes Nikhil Muralidhar Sathappah Muthiah P. Butler Manish Jain Yu Yu ... Weipeng Li David Jones P. Arunachalam Hays Mccormick Naren Ramakrishnan 61 17 0 30 Jun 2021
Randomness In Neural Network Training: Characterizing The Impact of Tooling Donglin Zhuang Xingyao Zhang Shuaiwen Leon Song Sara Hooker 87 78 0 22 Jun 2021
Well-tuned Simple Nets Excel on Tabular Datasets Arlind Kadra Marius Lindauer Frank Hutter Josif Grabocka 68 202 0 21 Jun 2021
Context-Aware Legal Citation Recommendation using Deep Learning Zihan Huang Charles Low Mengqiu Teng Hongyi Zhang Daniel E. Ho M. Krass Matthias Grabmair AILaw HAI 73 39 0 20 Jun 2021
Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning Zachary Nado Neil Band Mark Collier Josip Djolonga Michael W. Dusenberry ... D. Sculley Balaji Lakshminarayanan Jasper Snoek Y. Gal Dustin Tran UQCV ELM 133 96 0 07 Jun 2021
Modeling the Unigram Distribution Irene Nikkarinen Tiago Pimentel Damián E. Blasi Ryan Cotterell 46 8 0 04 Jun 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models Linting Xue Aditya Barua Noah Constant Rami Al-Rfou Sharan Narang Mihir Kale Adam Roberts Colin Raffel 158 509 0 28 May 2021
A Cognitive Regularizer for Language Modeling Jason W. Wei Clara Meister Ryan Cotterell 77 21 0 15 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling A. Cetoli 84 0 0 09 May 2021
Perspectives on Machine Learning from Psychology's Reproducibility Crisis Samuel J. Bell Onno P. Kampman 61 15 0 18 Apr 2021
Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning Songan Zhang Lu Wen H. Peng H. E. Tseng 41 10 0 18 Apr 2021
Lessons on Parameter Sharing across Layers in Transformers Sho Takase Shun Kiyono 111 87 0 13 Apr 2021
Is it enough to optimize CNN architectures on ImageNet? Lukas Tuggener Jürgen Schmidhuber Thilo Stadelmann 86 23 0 16 Mar 2021
Accounting for Variance in Machine Learning Benchmarks Xavier Bouthillier Pierre Delaunay Mirko Bronzi Assya Trofimov Brennan Nichyporuk ... Dmitriy Serdyuk Tal Arbel C. Pal Gaël Varoquaux Pascal Vincent 120 152 0 01 Mar 2021
On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning Bangnig Zhang Raghunandan Rajan Luis Pineda Nathan Lambert André Biedenkapp Kurtland Chua Frank Hutter Roberto Calandra 126 103 0 26 Feb 2021
Reproducibility in Evolutionary Computation Manuel López-Ibánez Juergen Branke L. Paquete 149 32 0 05 Feb 2021
PyGlove: Symbolic Programming for Automated Machine Learning Daiyi Peng Xuanyi Dong Esteban Real Mingxing Tan Yifeng Lu Hanxiao Liu Gabriel Bender Adam Kraft Chen Liang Quoc V. Le 61 31 0 21 Jan 2021
Hyperboost: Hyperparameter Optimization by Gradient Boosting surrogate models Jeroen van Hoof Joaquin Vanschoren BDL 72 10 0 06 Jan 2021
A Population-based Hybrid Approach to Hyperparameter Optimization for Neural Networks Marcello Serqueira Israel Mendonça Eduardo Bezerra 61 23 0 22 Nov 2020
Learning Associative Inference Using Fast Weight Memory Imanol Schlag Tsendsuren Munkhdalai Jürgen Schmidhuber KELM 67 47 0 16 Nov 2020