ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.05589
  4. Cited By
On the State of the Art of Evaluation in Neural Language Models
v1v2 (latest)

On the State of the Art of Evaluation in Neural Language Models

18 July 2017
Gábor Melis
Chris Dyer
Phil Blunsom
ArXiv (abs)PDFHTML

Papers citing "On the State of the Art of Evaluation in Neural Language Models"

50 / 190 papers shown
Title
Transient Dynamics in Lattices of Differentiating Ring Oscillators
Transient Dynamics in Lattices of Differentiating Ring Oscillators
Peter DelMastro
Arjun Karuvally
Hananel Hazan
H. Siegelmann
E. Rietman
29
0
0
08 Jun 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
213
13
0
31 Dec 2024
Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks
Regularized Gradient Clipping Provably Trains Wide and Deep Neural Networks
Matteo Tucat
Anirbit Mukherjee
Procheta Sen
Mingfei Sun
Omar Rivasplata
MLT
93
1
0
12 Apr 2024
A Controlled Reevaluation of Coreference Resolution Models
A Controlled Reevaluation of Coreference Resolution Models
Ian Porada
Xiyuan Zou
Jackie Chi Kit Cheung
92
1
0
31 Mar 2024
Large Language Model Agent for Hyper-Parameter Optimization
Large Language Model Agent for Hyper-Parameter Optimization
Siyi Liu
Chen Gao
Yong Li
135
23
0
02 Feb 2024
Autocompletion of Chief Complaints in the Electronic Health Records
  using Large Language Models
Autocompletion of Chief Complaints in the Electronic Health Records using Large Language Models
K M Sajjadul Islam
Ayesha Siddika Nipu
Praveen Madiraju
Priya Deshpande
LM&MA
71
7
0
11 Jan 2024
Quantifying the Uniqueness of Donald Trump in Presidential Discourse
Quantifying the Uniqueness of Donald Trump in Presidential Discourse
Karen Zhou
Alexander A. Meitus
Milo Chase
Grace Wang
Anne Mykland
William Howell
Chenhao Tan
38
1
0
02 Jan 2024
An Ensemble Approach to Personalized Real Time Predictive Writing for
  Experts
An Ensemble Approach to Personalized Real Time Predictive Writing for Experts
Sourav Prosad
Viswa Datha Polavarapu
Shrutendra Harsola
52
0
0
25 Aug 2023
Text Analysis Using Deep Neural Networks in Digital Humanities and
  Information Science
Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science
Omri Suissa
Avshalom Elmalech
M. Zhitomirsky-Geffet
AI4CE
57
48
0
30 Jul 2023
Decoding ChatGPT: A Taxonomy of Existing Research, Current Challenges,
  and Possible Future Directions
Decoding ChatGPT: A Taxonomy of Existing Research, Current Challenges, and Possible Future Directions
S. Sohail
Faiza Farhat
Yassine Himeur
Mohammad Nadeem
D. Madsen
Yashbir Singh
Shadi Atalla
W. Mansoor
106
123
0
26 Jul 2023
Hierarchical Attention Encoder Decoder
Hierarchical Attention Encoder Decoder
Asier Mujika
BDL
62
3
0
01 Jun 2023
Measuring Data
Measuring Data
Margaret Mitchell
A. Luccioni
Nathan Lambert
Marissa Gerchick
Angelina McMillan-Major
Ezinwanne Ozoani
Nazneen Rajani
Tristan Thrush
Yacine Jernite
Douwe Kiela
95
17
0
09 Dec 2022
Bridging the Training-Inference Gap for Dense Phrase Retrieval
Bridging the Training-Inference Gap for Dense Phrase Retrieval
Gyuwan Kim
Jinhyuk Lee
Barlas Oğuz
Wenhan Xiong
Yizhe Zhang
Yashar Mehdad
William Yang Wang
84
2
0
25 Oct 2022
Reproducibility of the Methods in Medical Imaging with Deep Learning
Reproducibility of the Methods in Medical Imaging with Deep Learning
A. Simkó
A. Garpebring
J. Jonsson
T. Nyholm
Tommy Löfstedt
OOD
46
10
0
20 Oct 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Tengjiao Wang
Ming-Hsuan Yang
DiffMMedIm
551
1,428
0
02 Sep 2022
Local Byte Fusion for Neural Machine Translation
Local Byte Fusion for Neural Machine Translation
Makesh Narsimhan Sreedhar
Xiangpeng Wan
Yu-Jie Cheng
Junjie Hu
106
4
0
23 May 2022
Sources of Irreproducibility in Machine Learning: A Review
Sources of Irreproducibility in Machine Learning: A Review
Odd Erik Gundersen
Kevin Coakley
Christine R. Kirkpatrick
Yolanda Gil
SyDa
133
34
0
15 Apr 2022
A Survey on Dropout Methods and Experimental Verification in
  Recommendation
A Survey on Dropout Methods and Experimental Verification in Recommendation
Yongqian Li
Weizhi Ma
C. L. Philip Chen
Hao Fei
Yiqun Liu
Shaoping Ma
Yue Yang
92
11
0
05 Apr 2022
The worst of both worlds: A comparative analysis of errors in learning
  from data in psychology and machine learning
The worst of both worlds: A comparative analysis of errors in learning from data in psychology and machine learning
Jessica Hullman
Sayash Kapoor
Priyanka Nanayakkara
Andrew Gelman
Arvind Narayanan
147
39
0
12 Mar 2022
Improving Baselines in the Wild
Improving Baselines in the Wild
Kazuki Irie
Imanol Schlag
Róbert Csordás
Jürgen Schmidhuber
49
3
0
31 Dec 2021
Evaluating deep transfer learning for whole-brain cognitive decoding
Evaluating deep transfer learning for whole-brain cognitive decoding
A. Thomas
U. Lindenberger
Wojciech Samek
K. Müller
AI4CE
62
12
0
01 Nov 2021
A Systematic Investigation of Commonsense Knowledge in Large Language
  Models
A Systematic Investigation of Commonsense Knowledge in Large Language Models
Xiang Lorraine Li
A. Kuncoro
Jordan Hoffmann
Cyprien de Masson dÁutume
Phil Blunsom
Aida Nematzadeh
LRM
104
59
0
31 Oct 2021
GNN-LM: Language Modeling based on Global Contexts via GNN
GNN-LM: Language Modeling based on Global Contexts via GNN
Yuxian Meng
Shi Zong
Xiaoya Li
Xiaofei Sun
Tianwei Zhang
Leilei Gan
Jiwei Li
LRM
127
39
0
17 Oct 2021
Autoregressive Diffusion Models
Autoregressive Diffusion Models
Emiel Hoogeboom
Alexey A. Gritsenko
Jasmijn Bastings
Ben Poole
Rianne van den Berg
Tim Salimans
DiffM
127
155
0
05 Oct 2021
Simple Recurrent Neural Networks is all we need for clinical events
  predictions using EHR data
Simple Recurrent Neural Networks is all we need for clinical events predictions using EHR data
L. Rasmy
Jie Zhu
Zhiheng Li
Xin Hao
Hong Thoai Nga Tran
Yujia Zhou
Firat Tiryaki
Yang Xiang
Hua Xu
Degui Zhi
29
17
0
03 Oct 2021
Expected Validation Performance and Estimation of a Random Variable's
  Maximum
Expected Validation Performance and Estimation of a Random Variable's Maximum
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
102
9
0
01 Oct 2021
Language Models as a Knowledge Source for Cognitive Agents
Language Models as a Knowledge Source for Cognitive Agents
R. Wray
James R. Kirk
John E. Laird
57
15
0
17 Sep 2021
HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems
  for HPO
HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems for HPO
Katharina Eggensperger
Philip Muller
Neeratyoy Mallik
Matthias Feurer
René Sass
Aaron Klein
Noor H. Awad
Marius Lindauer
Frank Hutter
243
104
0
14 Sep 2021
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Deep Reinforcement Learning at the Edge of the Statistical Precipice
Rishabh Agarwal
Max Schwarzer
Pablo Samuel Castro
Aaron Courville
Marc G. Bellemare
OffRL
196
680
0
30 Aug 2021
Challenges for cognitive decoding using deep learning methods
Challenges for cognitive decoding using deep learning methods
A. Thomas
Christopher Ré
R. Poldrack
AI4CE
63
6
0
16 Aug 2021
Using AntiPatterns to avoid MLOps Mistakes
Using AntiPatterns to avoid MLOps Mistakes
Nikhil Muralidhar
Sathappah Muthiah
P. Butler
Manish Jain
Yu Yu
...
Weipeng Li
David Jones
P. Arunachalam
Hays Mccormick
Naren Ramakrishnan
61
17
0
30 Jun 2021
Randomness In Neural Network Training: Characterizing The Impact of
  Tooling
Randomness In Neural Network Training: Characterizing The Impact of Tooling
Donglin Zhuang
Xingyao Zhang
Shuaiwen Leon Song
Sara Hooker
87
78
0
22 Jun 2021
Well-tuned Simple Nets Excel on Tabular Datasets
Well-tuned Simple Nets Excel on Tabular Datasets
Arlind Kadra
Marius Lindauer
Frank Hutter
Josif Grabocka
68
202
0
21 Jun 2021
Context-Aware Legal Citation Recommendation using Deep Learning
Context-Aware Legal Citation Recommendation using Deep Learning
Zihan Huang
Charles Low
Mengqiu Teng
Hongyi Zhang
Daniel E. Ho
M. Krass
Matthias Grabmair
AILawHAI
73
39
0
20 Jun 2021
Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep
  Learning
Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Zachary Nado
Neil Band
Mark Collier
Josip Djolonga
Michael W. Dusenberry
...
D. Sculley
Balaji Lakshminarayanan
Jasper Snoek
Y. Gal
Dustin Tran
UQCVELM
133
96
0
07 Jun 2021
Modeling the Unigram Distribution
Modeling the Unigram Distribution
Irene Nikkarinen
Tiago Pimentel
Damián E. Blasi
Ryan Cotterell
46
8
0
04 Jun 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Linting Xue
Aditya Barua
Noah Constant
Rami Al-Rfou
Sharan Narang
Mihir Kale
Adam Roberts
Colin Raffel
158
509
0
28 May 2021
A Cognitive Regularizer for Language Modeling
A Cognitive Regularizer for Language Modeling
Jason W. Wei
Clara Meister
Ryan Cotterell
77
21
0
15 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling
Dispatcher: A Message-Passing Approach To Language Modelling
A. Cetoli
84
0
0
09 May 2021
Perspectives on Machine Learning from Psychology's Reproducibility
  Crisis
Perspectives on Machine Learning from Psychology's Reproducibility Crisis
Samuel J. Bell
Onno P. Kampman
61
15
0
18 Apr 2021
Quick Learner Automated Vehicle Adapting its Roadmanship to Varying
  Traffic Cultures with Meta Reinforcement Learning
Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning
Songan Zhang
Lu Wen
H. Peng
H. E. Tseng
41
10
0
18 Apr 2021
Lessons on Parameter Sharing across Layers in Transformers
Lessons on Parameter Sharing across Layers in Transformers
Sho Takase
Shun Kiyono
111
87
0
13 Apr 2021
Is it enough to optimize CNN architectures on ImageNet?
Is it enough to optimize CNN architectures on ImageNet?
Lukas Tuggener
Jürgen Schmidhuber
Thilo Stadelmann
86
23
0
16 Mar 2021
Accounting for Variance in Machine Learning Benchmarks
Accounting for Variance in Machine Learning Benchmarks
Xavier Bouthillier
Pierre Delaunay
Mirko Bronzi
Assya Trofimov
Brennan Nichyporuk
...
Dmitriy Serdyuk
Tal Arbel
C. Pal
Gaël Varoquaux
Pascal Vincent
120
152
0
01 Mar 2021
On the Importance of Hyperparameter Optimization for Model-based
  Reinforcement Learning
On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning
Bangnig Zhang
Raghunandan Rajan
Luis Pineda
Nathan Lambert
André Biedenkapp
Kurtland Chua
Frank Hutter
Roberto Calandra
126
103
0
26 Feb 2021
Reproducibility in Evolutionary Computation
Reproducibility in Evolutionary Computation
Manuel López-Ibánez
Juergen Branke
L. Paquete
149
32
0
05 Feb 2021
PyGlove: Symbolic Programming for Automated Machine Learning
PyGlove: Symbolic Programming for Automated Machine Learning
Daiyi Peng
Xuanyi Dong
Esteban Real
Mingxing Tan
Yifeng Lu
Hanxiao Liu
Gabriel Bender
Adam Kraft
Chen Liang
Quoc V. Le
61
31
0
21 Jan 2021
Hyperboost: Hyperparameter Optimization by Gradient Boosting surrogate
  models
Hyperboost: Hyperparameter Optimization by Gradient Boosting surrogate models
Jeroen van Hoof
Joaquin Vanschoren
BDL
72
10
0
06 Jan 2021
A Population-based Hybrid Approach to Hyperparameter Optimization for
  Neural Networks
A Population-based Hybrid Approach to Hyperparameter Optimization for Neural Networks
Marcello Serqueira
Israel Mendonça
Eduardo Bezerra
61
23
0
22 Nov 2020
Learning Associative Inference Using Fast Weight Memory
Learning Associative Inference Using Fast Weight Memory
Imanol Schlag
Tsendsuren Munkhdalai
Jürgen Schmidhuber
KELM
67
47
0
16 Nov 2020
1234
Next