Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2201.07040
Cited By
Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals
18 January 2022
Kathrin Blagec
J. Kraiger
Wolfgang Frühwirt
Matthias Samwald
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals"
17 / 17 papers shown
Title
Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methods
J. Benítez-Andrades
C. Prada-García
Nicolás Ordás-Reyes
Marta Esteban Blanco
Alicia Merayo
Antonio Serrano-García
59
1
0
23 Mar 2025
Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation
Maria Eriksson
Erasmo Purificato
Arman Noroozian
Joao Vinagre
Guillaume Chaslot
Emilia Gomez
David Fernandez-Llorca
ELM
198
2
0
10 Feb 2025
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research
Bernard Koch
Emily L. Denton
A. Hanna
J. Foster
79
146
0
03 Dec 2021
A curated, ontology-based, large-scale knowledge graph of artificial intelligence tasks and benchmarks
Kathrin Blagec
A. Barbosa-Silva
Simon Ott
Matthias Samwald
30
26
0
04 Oct 2021
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT
Usman Naseem
A. Dunn
Matloob Khushi
Jinman Kim
OOD
LM&MA
AI4MH
66
43
0
09 Jul 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
Samuel R. Bowman
George E. Dahl
ELM
ALM
50
159
0
05 Apr 2021
A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images
Pablo Messina
Pablo Pino
Denis Parra
Alvaro Soto
Cecilia Besa
S. Uribe
Marcelo andía
C. Tejos
Claudia Prieto
Daniel Capurro
MedIm
56
63
0
20 Oct 2020
SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search
Sean MacAvaney
Arman Cohan
Nazli Goharian
29
21
0
12 Oct 2020
Repurposing TREC-COVID Annotations to Answer the Key Questions of CORD-19
Connor T. Heaton
P. Mitra
12
2
0
27 Aug 2020
The Future of Digital Health with Federated Learning
Nicola Rieke
Jonny Hancox
Wenqi Li
Fausto Milletari
H. Roth
...
Ronald M. Summers
Andrew Trask
Daguang Xu
Maximilian Baust
M. Jorge Cardoso
OOD
239
1,746
0
18 Mar 2020
PathVQA: 30000+ Questions for Medical Visual Question Answering
Xuehai He
Yichen Zhang
Luntian Mou
Eric Xing
P. Xie
LM&MA
38
230
0
07 Mar 2020
Towards better healthcare: What could and should be automated?
Wolfgang Frühwirt
Paul Duckworth
55
17
0
21 Oct 2019
emrQA: A Large Corpus for Question Answering on Electronic Medical Records
Anusri Pampari
Preethi Raghavan
Jennifer J. Liang
Jian-wei Peng
52
203
0
03 Sep 2018
Datasheets for Datasets
Timnit Gebru
Jamie Morgenstern
Briana Vecchione
Jennifer Wortman Vaughan
Hanna M. Wallach
Hal Daumé
Kate Crawford
219
2,158
0
23 Mar 2018
An Empirical Evaluation of Deep Learning for ICD-9 Code Assignment using MIMIC-III Clinical Notes
Jinmiao Huang
C. Osorio
Luke Wicent Sy
42
111
0
07 Feb 2018
On the Automatic Generation of Medical Imaging Reports
Baoyu Jing
P. Xie
Eric Xing
MedIm
52
509
0
22 Nov 2017
Classification of Radiology Reports Using Neural Attention Models
Bonggun Shin
F. Chokshi
Timothy Lee
Jinho Choi
59
48
0
22 Aug 2017
1