Selvam Minmini, Sadanandan Abjasree, Chandrasekharan Anupama, Ramesh Sidharth, Murali Arunan, Krishnamurthi Ganapathy
Department of Radiology and Imaging Sciences, Sri Ramachandra Institute of Higher Education and Research, Porur, Chennai, 600 116, India.
Department of Engineering Design, Indian Institute of Technology Madras, Chennai, 600 036, India.
Sci Rep. 2024 Dec 30;14(1):32088. doi: 10.1038/s41598-024-83786-6.
Distinguishing between primary adenocarcinoma (AC) and squamous cell carcinoma (SCC) within non-small cell lung cancer (NSCLC) tumours holds significant management implications. We assessed the performance of radiomics-based models in distinguishing primary there is from SCC presenting as lung nodules on Computed Tomography (CT) scans. We studied individuals with histopathologically proven adenocarcinoma or SCC type NSCLC tumours, detected as lung nodules on Chest CT. The workflow comprised manual nodule segmentation, regions of interest creation, preprocessing data, feature extraction, and nodule classification using machine learning algorithms. The dataset comprised 46 adenocarcinoma and 28 SCC cases. For feature extraction, 101 radiomic features were extracted from the tumour regions using the 'pyradiomics' module in Python. After extensive experimentation with various feature importance techniques, the top 10 most significant radiomic features for differentiating between adenocarcinoma and squamous cell carcinoma (SCC) were identified. The Synthetic Minority Over-Sampling Technique was used to achieve a balanced distribution. Lung nodules were classified using 13 machine-learning algorithms, including Linear Discriminant Analysis, Random Forest, AdaBoost, and eXtreme Gradient Boosting. The Multilayer Perceptron (MLP) Classifier with Rectified Linear Unit (ReLu) activation was the most accurate (83% accuracy) with 83% precision and 86% sensitivity in distinguishing SCC from adenocarcinoma. It achieved a balanced F1 score of 83%, indicating well-rounded performance in both precision and sensitivity. The average Area Under the Curve score was 88%, representing good discrimination between the two classes of lung nodules. Radiomics is a powerful non-invasive tool that could potentially add to visual information obtained on CT. The MLP Classifier with ReLu activation showed good accuracy in distinguishing primary lung adenocarcinoma from SCC nodules. However, widespread multicentre trials are required to realize the full potential of radiomics in lung nodules.
在非小细胞肺癌(NSCLC)肿瘤中区分原发性腺癌(AC)和鳞状细胞癌(SCC)具有重要的管理意义。我们评估了基于放射组学的模型在区分原发性AC与在计算机断层扫描(CT)上表现为肺结节的SCC方面的性能。我们研究了组织病理学证实为腺癌或SCC型NSCLC肿瘤且在胸部CT上被检测为肺结节的个体。工作流程包括手动结节分割、感兴趣区域创建、数据预处理、特征提取以及使用机器学习算法进行结节分类。数据集包括46例腺癌和28例SCC病例。为了进行特征提取,使用Python中的“pyradiomics”模块从肿瘤区域提取了101个放射组学特征。在对各种特征重要性技术进行广泛实验后,确定了区分腺癌和鳞状细胞癌(SCC)的前10个最重要的放射组学特征。使用合成少数过采样技术来实现平衡分布。使用13种机器学习算法对肺结节进行分类,包括线性判别分析、随机森林、AdaBoost和极端梯度提升。具有整流线性单元(ReLu)激活的多层感知器(MLP)分类器在区分SCC和腺癌方面最为准确(准确率83%),精确率为83%,灵敏度为86%。它实现了83%的平衡F1分数,表明在精确率和灵敏度方面都有良好的表现。曲线下面积平均分数为88%,表明两类肺结节之间有良好的区分度。放射组学是一种强大的非侵入性工具,有可能补充从CT获得的视觉信息。具有ReLu激活的MLP分类器在区分原发性肺腺癌和SCC结节方面显示出良好的准确性。然而,需要进行广泛的多中心试验来充分发挥放射组学在肺结节中的潜力。