CheXternal: Generalization of deep learning models for chest X-ray interpretation to photos of chest X-rays and external clinical settings

Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren

8 Citations (Scopus)

Abstract

Recent advances in training deep learning models have demonstrated the potential to provide accurate chest X-ray interpretation and increase access to radiology expertise. However, poor generalization due to data distribution shifts in clinical settings is a key barrier to implementation. In this study, we measured the diagnostic performance for 8 different chest X-ray models when applied to (1) smartphone photos of chest X-rays and (2) external datasets without any finetuning. All models were developed by different groups and submitted to the CheXpert challenge, and re-applied to test datasets without further tuning. We found that (1) on photos of chest X-rays, all 8 models experienced a statistically significant drop in task performance, but only 3 performed significantly worse than radiologists on average, and (2) on the external set, none of the models performed statistically significantly worse than radiologists, and five models performed statistically significantly better than radiologists. Our results demonstrate that some chest X-ray models, under clinically relevant distribution shifts, were comparable to radiologists while other models were not. Future work should investigate aspects of model training procedures and dataset collection that influence generalization in the presence of data distribution shifts.

Original languageEnglish
JournalACM CHIL 2021 - Proceedings of the 2021 ACM Conference on Health, Inference, and Learning
Pages (from-to)125-132
Number of pages8
DOIs
Publication statusPublished - 8 Apr 2021
Externally publishedYes
Event2021 ACM Conference on Health, Inference, and Learning, CHIL 2021 - Virtual, Online, United States
Duration: 8 Apr 20219 Apr 2021

Conference

Conference2021 ACM Conference on Health, Inference, and Learning, CHIL 2021
Country/TerritoryUnited States
CityVirtual, Online
Period08/04/202109/04/2021
SponsorACM

Keywords

  • chest X-ray interpretation
  • clinical deployment
  • distribution shifts
  • generalizability
  • radiology

Fingerprint

Dive into the research topics of 'CheXternal: Generalization of deep learning models for chest X-ray interpretation to photos of chest X-rays and external clinical settings'. Together they form a unique fingerprint.

Cite this