Biological Chemistry and Chemometrics, Institute of Chemistry, Federal University of Rio Grande do Norte, Natal 59072-970, Brazil.
Center for Education, Science and Technology of the Inhamuns Region, State University of Ceará, Tauá 63660-000, Brazil.
Analyst. 2024 Sep 23;149(19):4851-4861. doi: 10.1039/d4an00726c.
Lung cancer is one of the most commonly occurring malignant tumours worldwide. Although some reference methods such as X-ray, computed tomography or bronchoscope are widely used for clinical diagnosis of lung cancer, there is still a need to develop new methods for early detection of lung cancer. Especially needed are approaches that might be non-invasive and fast with high analytical precision and statistically reliable. Herein, we developed a swab "dip" test in saliva whereby swabs were analysed using attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy harnessed to principal component analysis-quadratic discriminant analysis (QDA) and variable selection techniques employing successive projections algorithm (SPA) and genetic algorithm (GA) for feature selection/extraction combined with QDA. A total of 1944 saliva samples (56 designated as lung-cancer positive and 1888 designed as controls) were obtained in a lung cancer-screening programme being undertaken in North-West England. GA-QDA models achieved, for the test set, sensitivity and specificity values of 100.0% and 99.1%, respectively. Three wavenumbers (1422 cm, 1546 cm and 1578 cm) were identified using the GA-QDA model to distinguish between lung cancer and controls, including ring C-C stretching, CN adenine, Amide II [(NH), (CN)] and (COO) (polysaccharides, pectin). These findings highlight the potential of using biospectroscopy associated with multivariate classification algorithms to discriminate between benign saliva samples and those with underlying lung cancer.
肺癌是世界上最常见的恶性肿瘤之一。尽管 X 射线、计算机断层扫描或支气管镜等一些参考方法广泛用于肺癌的临床诊断,但仍需要开发新的方法来早期发现肺癌。特别需要的是那些可能是非侵入性的、快速的、具有高分析精度和统计学可靠性的方法。在此,我们开发了一种在唾液中进行拭子“蘸取”的测试方法,使用衰减全反射傅里叶变换红外(ATR-FTIR)光谱分析拭子,并结合主成分分析-二次判别分析(QDA)以及采用连续投影算法(SPA)和遗传算法(GA)的变量选择技术,用于特征选择/提取,并结合 QDA。在英格兰西北部进行的肺癌筛查计划中,共获得了 1944 份唾液样本(56 份被指定为肺癌阳性,1888 份被指定为对照)。GA-QDA 模型在测试集中的灵敏度和特异性值分别为 100.0%和 99.1%。使用 GA-QDA 模型在区分肺癌和对照组时,确定了三个波数(1422cm、1546cm 和 1578cm),包括环 C-C 伸缩、C-N 腺嘌呤、酰胺 II [(NH),(CN)]和(COO)(多糖、果胶)。这些发现强调了使用与多元分类算法相关的生物光谱学来区分良性唾液样本和潜在肺癌样本的潜力。