研究机器学习中的特征提取和分类模块用于诊断低剂量计算机断层扫描筛查检测到的病变。

Examining feature extraction and classification modules in machine learning for diagnosis of low-dose computed tomographic screening-detected lesions.

作者信息

Liang Daniel D, Liang David D, Pomeroy Marc J, Gao Yongfeng, Kuo Licheng R, Li Lihong C

机构信息

Ward Melville High School, East Setauket, New York, United States.

University of Chicago, Department of Computer Science, Chicago, Illinois, United States.

出版信息

J Med Imaging (Bellingham). 2024 Jul;11(4):044501. doi: 10.1117/1.JMI.11.4.044501. Epub 2024 Jul 9.

DOI:10.1117/1.JMI.11.4.044501

PMID:38993628

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11234229/

Abstract

PURPOSE

Medical imaging-based machine learning (ML) for computer-aided diagnosis of lesions consists of two basic components or modules of (i) feature extraction from non-invasively acquired medical images and (ii) feature classification for prediction of malignancy of lesions detected or localized in the medical images. This study investigates their individual performances for diagnosis of low-dose computed tomography (CT) screening-detected lesions of pulmonary nodules and colorectal polyps.

APPROACH

Three feature extraction methods were investigated. One uses the mathematical descriptor of gray-level co-occurrence image texture measure to extract the Haralick image texture features (HFs). One uses the convolutional neural network (CNN) architecture to extract deep learning (DL) image abstractive features (DFs). The third one uses the interactions between lesion tissues and X-ray energy of CT to extract tissue-energy specific characteristic features (TFs). All the above three categories of extracted features were classified by the random forest (RF) classifier with comparison to the DL-CNN method, which reads the images, extracts the DFs, and classifies the DFs in an end-to-end manner. The ML diagnosis of lesions or prediction of lesion malignancy was measured by the area under the receiver operating characteristic curve (AUC). Three lesion image datasets were used. The lesions' tissue pathological reports were used as the learning labels.

RESULTS

Experiments on the three datasets produced AUC values of 0.724 to 0.878 for the HFs, 0.652 to 0.965 for the DFs, and 0.985 to 0.996 for the TFs, compared to the DL-CNN of 0.694 to 0.964. These experimental outcomes indicate that the RF classifier performed comparably to the DL-CNN classification module and the extraction of tissue-energy specific characteristic features dramatically improved AUC value.

CONCLUSIONS

The feature extraction module is more important than the feature classification module. Extraction of tissue-energy specific characteristic features is more important than extraction of image abstractive and characteristic features.

摘要

目的

基于医学成像的机器学习（ML）用于病变的计算机辅助诊断，由两个基本组件或模块组成：（i）从非侵入性获取的医学图像中提取特征，以及（ii）对医学图像中检测到或定位的病变的恶性程度进行预测的特征分类。本研究调查了它们在诊断低剂量计算机断层扫描（CT）筛查检测到的肺结节和结肠息肉病变中的各自表现。

方法

研究了三种特征提取方法。一种使用灰度共生图像纹理测量的数学描述符来提取哈拉里克图像纹理特征（HFs）。一种使用卷积神经网络（CNN）架构来提取深度学习（DL）图像抽象特征（DFs）。第三种方法利用病变组织与CT的X射线能量之间的相互作用来提取组织能量特定特征（TFs）。上述三类提取的特征均由随机森林（RF）分类器进行分类，并与DL-CNN方法进行比较，DL-CNN方法以端到端的方式读取图像、提取DFs并对DFs进行分类。病变的ML诊断或病变恶性程度的预测通过受试者操作特征曲线（AUC）下的面积来衡量。使用了三个病变图像数据集。病变的组织病理报告用作学习标签。

结果

在三个数据集上进行的实验中，HFs的AUC值为0.724至0.878，DFs的AUC值为0.652至0.965，TFs的AUC值为0.985至0.996，而DL-CNN的AUC值为0.694至0.964。这些实验结果表明，RF分类器的表现与DL-CNN分类模块相当，并且组织能量特定特征的提取显著提高了AUC值。

结论

特征提取模块比特征分类模块更重要。组织能量特定特征的提取比图像抽象和特征特征的提取更重要。

相似文献

Examining feature extraction and classification modules in machine learning for diagnosis of low-dose computed tomographic screening-detected lesions.研究机器学习中的特征提取和分类模块用于诊断低剂量计算机断层扫描筛查检测到的病变。

J Med Imaging (Bellingham). 2024 Jul;11(4):044501. doi: 10.1117/1.JMI.11.4.044501. Epub 2024 Jul 9.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Skin-CAD: Explainable deep learning classification of skin cancer from dermoscopic images by feature selection of dual high-level CNNs features and transfer learning.皮肤 CAD：基于双高级 CNN 特征选择和迁移学习的皮肤镜图像皮肤癌可解释深度学习分类。

Comput Biol Med. 2024 Aug;178:108798. doi: 10.1016/j.compbiomed.2024.108798. Epub 2024 Jun 25.

Fully Automated Online Adaptive Radiation Therapy Decision-Making for Cervical Cancer Using Artificial Intelligence.使用人工智能的宫颈癌全自动在线自适应放射治疗决策

Int J Radiat Oncol Biol Phys. 2025 Jul 15;122(4):1012-1021. doi: 10.1016/j.ijrobp.2025.04.012. Epub 2025 Apr 17.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗？

Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.

Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型：基于多中心队列研究的开发与验证研究

J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.

The Machine Learning Models in Major Cardiovascular Adverse Events Prediction Based on Coronary Computed Tomography Angiography: Systematic Review.基于冠状动脉计算机断层扫描血管造影术的主要心血管不良事件预测中的机器学习模型：系统评价

J Med Internet Res. 2025 Jun 13;27:e68872. doi: 10.2196/68872.

Recent advancements in feature extraction and classification based bone cancer detection - a systematic review.基于特征提取和分类的骨癌检测的最新进展——一项系统综述。

Biomed Phys Eng Express. 2025 Jul 7;11(4). doi: 10.1088/2057-1976/ade8f8.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

本文引用的文献

Exploring Dual-Energy CT Spectral Information for Machine Learning-Driven Lesion Diagnosis in Pre-Log Domain.在预对数域中，利用双能 CT 光谱信息进行机器学习驱动的病变诊断。

IEEE Trans Med Imaging. 2023 Jun;42(6):1835-1845. doi: 10.1109/TMI.2023.3240847. Epub 2023 Jun 1.

Haralick texture feature analysis for characterization of specific energy and absorbed dose distributions across cellular to patient length scales.基于哈拉里克纹理特征分析来表征从细胞到患者长度尺度范围内的比能和吸收剂量分布。

Phys Med Biol. 2023 Mar 21;68(7). doi: 10.1088/1361-6560/acb885.

Clinical Impact and Generalizability of a Computer-Assisted Diagnostic Tool to Risk-Stratify Lung Nodules With CT.计算机辅助诊断工具对 CT 肺结节进行风险分层的临床影响和推广性。

J Am Coll Radiol. 2023 Feb;20(2):232-242. doi: 10.1016/j.jacr.2022.08.006. Epub 2022 Sep 3.

Deep learning in CT colonography: differentiating premalignant from benign colorectal polyps.CT 结肠成像中的深度学习：区分癌前与良性结直肠息肉。

Eur Radiol. 2022 Jul;32(7):4749-4759. doi: 10.1007/s00330-021-08532-2. Epub 2022 Jan 26.

Machine Learning-based Differentiation of Benign and Premalignant Colorectal Polyps Detected with CT Colonography in an Asymptomatic Screening Population: A Proof-of-Concept Study.基于机器学习的 CT 结肠成像在无症状筛查人群中对结直肠息肉良恶性的鉴别：一项概念验证研究。

Radiology. 2021 May;299(2):326-335. doi: 10.1148/radiol.2021202363. Epub 2021 Feb 23.

Assessing the Accuracy of a Deep Learning Method to Risk Stratify Indeterminate Pulmonary Nodules.评估深度学习方法对不确定肺结节进行风险分层的准确性。

Am J Respir Crit Care Med. 2020 Jul 15;202(2):241-249. doi: 10.1164/rccm.201903-0505OC.

External validation of a convolutional neural network artificial intelligence tool to predict malignancy in pulmonary nodules.卷积神经网络人工智能工具预测肺结节良恶性的外部验证。

Thorax. 2020 Apr;75(4):306-312. doi: 10.1136/thoraxjnl-2019-214104. Epub 2020 Mar 5.

3D-GLCM CNN: A 3-Dimensional Gray-Level Co-Occurrence Matrix-Based CNN Model for Polyp Classification via CT Colonography.3D-GLCM CNN：基于三维灰度共生矩阵的卷积神经网络模型，用于通过 CT 结肠成像进行息肉分类。

IEEE Trans Med Imaging. 2020 Jun;39(6):2013-2024. doi: 10.1109/TMI.2019.2963177. Epub 2019 Dec 30.

End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography.基于低剂量 CT 的三维深度学习肺癌全流程筛查。

Nat Med. 2019 Jun;25(6):954-961. doi: 10.1038/s41591-019-0447-x. Epub 2019 May 20.

Gray-level invariant Haralick texture features.灰度不变哈雷利克纹理特征。

PLoS One. 2019 Feb 22;14(2):e0212110. doi: 10.1371/journal.pone.0212110. eCollection 2019.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验