ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.12475
  4. Cited By
Hidden Stratification Causes Clinically Meaningful Failures in Machine
  Learning for Medical Imaging

Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging

27 September 2019
Luke Oakden-Rayner
Jared A. Dunnmon
G. Carneiro
Christopher Ré
    OOD
ArXivPDFHTML

Papers citing "Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging"

50 / 163 papers shown
Title
Zipfian environments for Reinforcement Learning
Zipfian environments for Reinforcement Learning
Stephanie C. Y. Chan
Andrew Kyle Lampinen
Pierre Harvey Richemond
Felix Hill
OffRL
15
15
0
15 Mar 2022
Capturing Failures of Large Language Models via Human Cognitive Biases
Capturing Failures of Large Language Models via Human Cognitive Biases
Erik Jones
Jacob Steinhardt
33
91
0
24 Feb 2022
Symphony: Composing Interactive Interfaces for Machine Learning
Symphony: Composing Interactive Interfaces for Machine Learning
Alex Bäuerle
Ángel Alexander Cabrera
Fred Hohman
Megan Maher Welsh
David Koski
Xavier Suau
Titus Barik
Dominik Moritz
27
55
0
18 Feb 2022
Agree to Disagree: Diversity through Disagreement for Better
  Transferability
Agree to Disagree: Diversity through Disagreement for Better Transferability
Matteo Pagliardini
Martin Jaggi
Franccois Fleuret
Sai Praneeth Karimireddy
28
70
0
09 Feb 2022
Diversify and Disambiguate: Learning From Underspecified Data
Diversify and Disambiguate: Learning From Underspecified Data
Yoonho Lee
Huaxiu Yao
Chelsea Finn
215
64
0
07 Feb 2022
Mapping DNN Embedding Manifolds for Network Generalization Prediction
Mapping DNN Embedding Manifolds for Network Generalization Prediction
Molly O'Brien
Julia V. Bukowski
Mathias Unberath
Aria Pezeshk
Gregory Hager
AI4CE
30
0
0
03 Feb 2022
Generalizability of Machine Learning Models: Quantitative Evaluation of
  Three Methodological Pitfalls
Generalizability of Machine Learning Models: Quantitative Evaluation of Three Methodological Pitfalls
Farhad Maleki
K. Ovens
Rajiv Gupta
C. Reinhold
A. Spatz
Reza Forghani
44
68
0
01 Feb 2022
Towards Group Robustness in the presence of Partial Group Labels
Towards Group Robustness in the presence of Partial Group Labels
Vishnu Suresh Lokhande
Kihyuk Sohn
Jinsung Yoon
Madeleine Udell
Chen-Yu Lee
Tomas Pfister
OOD
42
11
0
10 Jan 2022
BARACK: Partially Supervised Group Robustness With Guarantees
BARACK: Partially Supervised Group Robustness With Guarantees
N. Sohoni
Maziar Sanjabi
Nicolas Ballas
Aditya Grover
Shaoliang Nie
Hamed Firooz
Christopher Ré
OOD
20
24
0
31 Dec 2021
Simple and near-optimal algorithms for hidden stratification and
  multi-group learning
Simple and near-optimal algorithms for hidden stratification and multi-group learning
Abdoreza Asadpour
Daniel J. Hsu
105
20
0
22 Dec 2021
The Effect of Model Size on Worst-Group Generalization
The Effect of Model Size on Worst-Group Generalization
Alan Pham
Eunice Chan
V. Srivatsa
Dhruba Ghosh
Yaoqing Yang
Yaodong Yu
Ruiqi Zhong
Joseph E. Gonzalez
Jacob Steinhardt
23
5
0
08 Dec 2021
Evaluating deep transfer learning for whole-brain cognitive decoding
Evaluating deep transfer learning for whole-brain cognitive decoding
A. Thomas
U. Lindenberger
Wojciech Samek
K. Müller
AI4CE
27
12
0
01 Nov 2021
Algorithmic encoding of protected characteristics in image-based models
  for disease detection
Algorithmic encoding of protected characteristics in image-based models for disease detection
Ben Glocker
Charles Jones
Mélanie Bernhardt
S. Winzeck
31
9
0
27 Oct 2021
Identifying and Benchmarking Natural Out-of-Context Prediction Problems
Identifying and Benchmarking Natural Out-of-Context Prediction Problems
David Madras
D. Psaltis
CML
OOD
32
4
0
25 Oct 2021
A Principled Approach to Failure Analysis and Model Repairment:
  Demonstration in Medical Imaging
A Principled Approach to Failure Analysis and Model Repairment: Demonstration in Medical Imaging
Thomas Henn
Yasukazu Sakamoto
Clément Jacquet
Shunsuke Yoshizawa
M. Andou
...
R. Saga
Hiroyuki Ishihara
Katsuhiko Shimizu
Yingzhen Li
Ryutaro Tanno
116
9
0
25 Sep 2021
On the Efficiency of Subclass Knowledge Distillation in Classification
  Tasks
On the Efficiency of Subclass Knowledge Distillation in Classification Tasks
A. Sajedi
Konstantinos N. Plataniotis
16
4
0
12 Sep 2021
A comparison of approaches to improve worst-case predictive model
  performance over patient subpopulations
A comparison of approaches to improve worst-case predictive model performance over patient subpopulations
Stephen R. Pfohl
Haoran Zhang
Yizhe Xu
Agata Foryciarz
Marzyeh Ghassemi
N. Shah
OOD
29
22
0
27 Aug 2021
Challenges for cognitive decoding using deep learning methods
Challenges for cognitive decoding using deep learning methods
A. Thomas
Christopher Ré
R. Poldrack
AI4CE
24
6
0
16 Aug 2021
Active Assessment of Prediction Services as Accuracy Surface Over
  Attribute Combinations
Active Assessment of Prediction Services as Accuracy Surface Over Attribute Combinations
Vihari Piratla
Soumen Chakrabarty
Sunita Sarawagi
19
3
0
14 Aug 2021
Meta-repository of screening mammography classifiers
Meta-repository of screening mammography classifiers
Benjamin Stadnick
Jan Witowski
Vishwaesh Rajiv
Jakub Chledowski
Farah E. Shamout
Kyunghyun Cho
Krzysztof J. Geras
25
11
0
10 Aug 2021
Distributionally Robust Segmentation of Abnormal Fetal Brain 3D MRI
Distributionally Robust Segmentation of Abnormal Fetal Brain 3D MRI
Lucas Fidon
Michael Aertsen
Nada Mufti
Thomas Deprest
Doaa Emam
...
Andrew Melbourne
Sébastien Ourselin
Jan Deprest
Georg Langs
Tom Kamiel Magda Vercauteren
OOD
18
22
0
09 Aug 2021
Pointer Value Retrieval: A new benchmark for understanding the limits of
  neural network generalization
Pointer Value Retrieval: A new benchmark for understanding the limits of neural network generalization
Chiyuan Zhang
M. Raghu
Jon M. Kleinberg
Samy Bengio
OOD
32
30
0
27 Jul 2021
Preventing dataset shift from breaking machine-learning biomarkers
Preventing dataset shift from breaking machine-learning biomarkers
Jéroome Dockes
Gaël Varoquaux
J B Poline
OOD
30
64
0
21 Jul 2021
Responsible and Regulatory Conform Machine Learning for Medicine: A
  Survey of Challenges and Solutions
Responsible and Regulatory Conform Machine Learning for Medicine: A Survey of Challenges and Solutions
Eike Petersen
Yannik Potdevin
Esfandiar Mohammadi
Stephan Zidowitz
Sabrina Breyer
...
Sandra Henn
Ludwig Pechmann
M. Leucker
P. Rostalski
Christian Herzog
FaML
AILaw
OOD
41
21
0
20 Jul 2021
Just Train Twice: Improving Group Robustness without Training Group
  Information
Just Train Twice: Improving Group Robustness without Training Group Information
E. Liu
Behzad Haghgoo
Annie S. Chen
Aditi Raghunathan
Pang Wei Koh
Shiori Sagawa
Percy Liang
Chelsea Finn
OOD
37
540
0
19 Jul 2021
A Topological-Framework to Improve Analysis of Machine Learning Model
  Performance
A Topological-Framework to Improve Analysis of Machine Learning Model Performance
Henry Kvinge
Colby Wight
Sarah Akers
Scott Howland
W. Choi
Xiaolong Ma
Luke J. Gosink
E. Jurrus
K. Kappagantula
Tegan H. Emerson
39
0
0
09 Jul 2021
The Spotlight: A General Method for Discovering Systematic Errors in
  Deep Learning Models
The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models
G. dÉon
Jason dÉon
J. R. Wright
Kevin Leyton-Brown
33
74
0
01 Jul 2021
Mandoline: Model Evaluation under Distribution Shift
Mandoline: Model Evaluation under Distribution Shift
Mayee F. Chen
Karan Goel
N. Sohoni
Fait Poms
Kayvon Fatahalian
Christopher Ré
30
69
0
01 Jul 2021
Randomness In Neural Network Training: Characterizing The Impact of
  Tooling
Randomness In Neural Network Training: Characterizing The Impact of Tooling
Donglin Zhuang
Xingyao Zhang
Shuaiwen Leon Song
Sara Hooker
25
75
0
22 Jun 2021
Evaluating Deep Neural Networks Trained on Clinical Images in
  Dermatology with the Fitzpatrick 17k Dataset
Evaluating Deep Neural Networks Trained on Clinical Images in Dermatology with the Fitzpatrick 17k Dataset
Matthew Groh
Caleb Harris
L. Soenksen
Felix Lau
Rachel Han
Aerin Kim
A. Koochek
Omar Badri
112
184
0
20 Apr 2021
An Empirical Framework for Domain Generalization in Clinical Settings
An Empirical Framework for Domain Generalization in Clinical Settings
Haoran Zhang
Natalie Dullerud
Laleh Seyyed-Kalantari
Q. Morris
Shalmali Joshi
Marzyeh Ghassemi
OOD
AI4CE
29
59
0
20 Mar 2021
CheXbreak: Misclassification Identification for Deep Learning Models
  Interpreting Chest X-rays
CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays
E. Chen
Andy Kim
R. Krishnan
J. Long
A. Ng
Pranav Rajpurkar
26
2
0
18 Mar 2021
Detecting Spurious Correlations with Sanity Tests for Artificial
  Intelligence Guided Radiology Systems
Detecting Spurious Correlations with Sanity Tests for Artificial Intelligence Guided Radiology Systems
U. Mahmood
Robik Shrestha
D. Bates
L. Mannelli
G. Corrias
Y. Erdi
Christopher Kanan
18
16
0
04 Mar 2021
Explaining the Black-box Smoothly- A Counterfactual Approach
Explaining the Black-box Smoothly- A Counterfactual Approach
Junyu Chen
Yong Du
Yufan He
W. Paul Segars
Ye Li
MedIm
FAtt
67
100
0
11 Jan 2021
Critical Evaluation of Deep Neural Networks for Wrist Fracture Detection
Critical Evaluation of Deep Neural Networks for Wrist Fracture Detection
A. M. Raisuddin
Elias Vaattovaara
M. Nevalainen
Marko Nikki
Elina Järvenpää
...
P. Pinola
Tuula Palsio
Arttu Niemensivu
O. Tervonen
A. Tiulpin
11
0
0
04 Dec 2020
Machine Learning Systems in the IoT: Trustworthiness Trade-offs for Edge
  Intelligence
Machine Learning Systems in the IoT: Trustworthiness Trade-offs for Edge Intelligence
Wiebke Toussaint
Aaron Yi Ding
35
11
0
01 Dec 2020
Differences between human and machine perception in medical diagnosis
Differences between human and machine perception in medical diagnosis
Taro Makino
Stanislaw Jastrzebski
Witold Oleszkiewicz
Celin Chacko
Robin Ehrenpreis
...
D. Sodickson
Laura Heacock
Linda Moy
Kyunghyun Cho
Krzysztof J. Geras
AAML
21
26
0
28 Nov 2020
No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained
  Classification Problems
No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained Classification Problems
N. Sohoni
Jared A. Dunnmon
Geoffrey Angus
Albert Gu
Christopher Ré
30
242
0
25 Nov 2020
Gradient Starvation: A Learning Proclivity in Neural Networks
Gradient Starvation: A Learning Proclivity in Neural Networks
Mohammad Pezeshki
Sekouba Kaba
Yoshua Bengio
Aaron Courville
Doina Precup
Guillaume Lajoie
MLT
52
258
0
18 Nov 2020
Underspecification Presents Challenges for Credibility in Modern Machine
  Learning
Underspecification Presents Challenges for Credibility in Modern Machine Learning
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
...
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
77
671
0
06 Nov 2020
Surgical Data Science -- from Concepts toward Clinical Translation
Surgical Data Science -- from Concepts toward Clinical Translation
Lena Maier-Hein
Matthias Eisenmann
Duygu Sarikaya
Keno Marz
Toby Collins
...
D. Teber
F. Uckert
Beat P. Müller-Stich
Pierre Jannin
Stefanie Speidel
AI4CE
25
223
0
30 Oct 2020
Evaluating Model Robustness and Stability to Dataset Shift
Evaluating Model Robustness and Stability to Dataset Shift
Adarsh Subbaswamy
R. Adams
Suchi Saria
OOD
26
9
0
28 Oct 2020
Selective Classification Can Magnify Disparities Across Groups
Selective Classification Can Magnify Disparities Across Groups
Erik Jones
Shiori Sagawa
Pang Wei Koh
Ananya Kumar
Percy Liang
39
46
0
27 Oct 2020
Large-Scale Methods for Distributionally Robust Optimization
Large-Scale Methods for Distributionally Robust Optimization
Daniel Levy
Y. Carmon
John C. Duchi
Aaron Sidford
37
205
0
12 Oct 2020
Characterising Bias in Compressed Models
Characterising Bias in Compressed Models
Sara Hooker
Nyalleng Moorosi
Gregory Clark
Samy Bengio
Emily L. Denton
19
183
0
06 Oct 2020
Ethical Machine Learning in Health Care
Ethical Machine Learning in Health Care
Irene Y. Chen
Emma Pierson
Sherri Rose
Shalmali Joshi
Kadija Ferryman
Marzyeh Ghassemi
AILaw
27
372
0
22 Sep 2020
Estimating Example Difficulty Using Variance of Gradients
Estimating Example Difficulty Using Variance of Gradients
Chirag Agarwal
Daniel D'souza
Sara Hooker
213
108
0
26 Aug 2020
Assessing the (Un)Trustworthiness of Saliency Maps for Localizing
  Abnormalities in Medical Imaging
Assessing the (Un)Trustworthiness of Saliency Maps for Localizing Abnormalities in Medical Imaging
N. Arun
N. Gaw
P. Singh
Ken Chang
M. Aggarwal
...
J. Patel
M. Gidwani
Julius Adebayo
M. D. Li
Jayashree Kalpathy-Cramer
FAtt
30
109
0
06 Aug 2020
Robust Benchmarking for Machine Learning of Clinical Entity Extraction
Robust Benchmarking for Machine Learning of Clinical Entity Extraction
Monica Agrawal
Chloe P. O'Connell
Yasmin Fatemi
A. Levy
David Sontag
6
6
0
31 Jul 2020
The Pitfalls of Simplicity Bias in Neural Networks
The Pitfalls of Simplicity Bias in Neural Networks
Harshay Shah
Kaustav Tamuly
Aditi Raghunathan
Prateek Jain
Praneeth Netrapalli
AAML
18
349
0
13 Jun 2020
Previous
1234
Next