Department of Electrical and Computer Engineering, The University of Texas at Austin, 78712, Austin, TX, USA.
Department of Population Health Sciences, Weill Cornell Medicine, 10065, New York, NY, USA.
Med Image Anal. 2024 Oct;97:103224. doi: 10.1016/j.media.2024.103224. Epub 2024 May 31.
Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" - there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.
许多现实世界的图像识别问题,如诊断性医学成像检查,都是“长尾”的——少数常见的发现后面跟着更多相对罕见的情况。在胸部 X 光摄影中,诊断既是长尾问题,也是多标签问题,因为患者通常同时出现多种发现。虽然研究人员已经开始研究医学图像识别中的长尾学习问题,但很少有人研究长尾、多标签疾病分类中标签不平衡和标签共现的相互作用。为了在这个新兴主题上与研究界进行交流,我们针对胸部 X 光(CXR)的长尾、多标签胸部疾病分类开展了一项名为 CXR-LT 的公开挑战赛。我们公开发布了一个超过 35 万张 CXR 的大规模基准数据集,每张 CXR 至少标记了 26 种临床发现中的一种,呈长尾分布。我们综合了表现最佳解决方案的常见主题,为长尾、多标签医学图像分类提供了实用建议。最后,我们利用这些见解提出了一个涉及视觉-语言基础模型的未来发展方向,用于少量和零样本疾病分类。
Data Augment Label Imperfections (2022). 2022-9
Int J Comput Assist Radiol Surg. 2019-2-7
IEEE Trans Image Process. 2021
Biomed Phys Eng Express. 2020-3-11
IEEE/ACM Trans Comput Biol Bioinform. 2024
Int J Comput Assist Radiol Surg. 2023-1
Proc IEEE Inst Electr Electron Eng. 2021-5
J Med Imaging (Bellingham). 2023-7
Nat Commun. 2023-7-28
IEEE Trans Pattern Anal Mach Intell. 2023-9
Radiol Artif Intell. 2023-1-11
Data Augment Label Imperfections (2022). 2022-9
Proc (IEEE Int Conf Healthc Inform). 2022-6
IEEE J Biomed Health Inform. 2022-12