Benchmark datasets driving artificial intelligence development fail to
capture the needs of medical professionals

Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals

18 January 2022

Wolfgang Frühwirt

Matthias Samwald

Papers citing "Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals"

17 / 17 papers shown

Title
Enhanced prediction of spine surgery outcomes using advanced machine learning techniques and oversampling methods J. Benítez-Andrades C. Prada-García Nicolás Ordás-Reyes Marta Esteban Blanco Alicia Merayo Antonio Serrano-García 59 1 0 23 Mar 2025
Can We Trust AI Benchmarks? An Interdisciplinary Review of Current Issues in AI Evaluation Maria Eriksson Erasmo Purificato Arman Noroozian Joao Vinagre Guillaume Chaslot Emilia Gomez David Fernandez-Llorca ELM 198 2 0 10 Feb 2025
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research Bernard Koch Emily L. Denton A. Hanna J. Foster 79 146 0 03 Dec 2021
A curated, ontology-based, large-scale knowledge graph of artificial intelligence tasks and benchmarks Kathrin Blagec A. Barbosa-Silva Simon Ott Matthias Samwald 30 26 0 04 Oct 2021
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERT Usman Naseem A. Dunn Matloob Khushi Jinman Kim OOD LM&MA AI4MH 66 43 0 09 Jul 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding? Samuel R. Bowman George E. Dahl ELM ALM 50 159 0 05 Apr 2021
A Survey on Deep Learning and Explainability for Automatic Report Generation from Medical Images Pablo Messina Pablo Pino Denis Parra Alvaro Soto Cecilia Besa S. Uribe Marcelo andía C. Tejos Claudia Prieto Daniel Capurro MedIm 56 63 0 20 Oct 2020
SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search Sean MacAvaney Arman Cohan Nazli Goharian 29 21 0 12 Oct 2020
Repurposing TREC-COVID Annotations to Answer the Key Questions of CORD-19 Connor T. Heaton P. Mitra 12 2 0 27 Aug 2020
The Future of Digital Health with Federated Learning Nicola Rieke Jonny Hancox Wenqi Li Fausto Milletari H. Roth ... Ronald M. Summers Andrew Trask Daguang Xu Maximilian Baust M. Jorge Cardoso OOD 239 1,746 0 18 Mar 2020
PathVQA: 30000+ Questions for Medical Visual Question Answering Xuehai He Yichen Zhang Luntian Mou Eric Xing P. Xie LM&MA 38 230 0 07 Mar 2020
Towards better healthcare: What could and should be automated? Wolfgang Frühwirt Paul Duckworth 55 17 0 21 Oct 2019
emrQA: A Large Corpus for Question Answering on Electronic Medical Records Anusri Pampari Preethi Raghavan Jennifer J. Liang Jian-wei Peng 52 203 0 03 Sep 2018
Datasheets for Datasets Timnit Gebru Jamie Morgenstern Briana Vecchione Jennifer Wortman Vaughan Hanna M. Wallach Hal Daumé Kate Crawford 219 2,158 0 23 Mar 2018
An Empirical Evaluation of Deep Learning for ICD-9 Code Assignment using MIMIC-III Clinical Notes Jinmiao Huang C. Osorio Luke Wicent Sy 42 111 0 07 Feb 2018
On the Automatic Generation of Medical Imaging Reports Baoyu Jing P. Xie Eric Xing MedIm 52 509 0 22 Nov 2017
Classification of Radiology Reports Using Neural Attention Models Bonggun Shin F. Chokshi Timothy Lee Jinho Choi 59 48 0 22 Aug 2017