WestCHEM, Department of Pure and Applied Chemistry, University of Strathclyde, Thomas Graham Building, 295 Cathedral Street, Glasgow, Scotland G1 1XL, UK.
Neuropathology, Lancashire Teaching Hospitals NHS Trust, Royal Preston Hospital, Sharoe Green Lane, Fulwood, Preston, PR2 9HT, UK.
Analyst. 2016 Jun 7;141(12):3668-78. doi: 10.1039/c5an02452h.
Fourier transform infrared (FTIR) spectroscopy has long been established as an analytical technique for the measurement of vibrational modes of molecular systems. More recently, FTIR has been used for the analysis of biofluids with the aim of becoming a tool to aid diagnosis. For the clinician, this represents a convenient, fast, non-subjective option for the study of biofluids and the diagnosis of disease states. The patient also benefits from this method, as the procedure for the collection of serum is much less invasive and stressful than traditional biopsy. This is especially true of patients in whom brain cancer is suspected. A brain biopsy is very unpleasant for the patient, potentially dangerous and can occasionally be inconclusive. We therefore present a method for the diagnosis of brain cancer from serum samples using FTIR and machine learning techniques. The scope of the study involved 433 patients from whom were collected 9 spectra each in the range 600-4000 cm(-1). To begin the development of the novel method, various pre-processing steps were investigated and ranked in terms of final accuracy of the diagnosis. Random forest machine learning was utilised as a classifier to separate patients into cancer or non-cancer categories based upon the intensities of wavenumbers present in their spectra. Generalised 2D correlational analysis was then employed to further augment the machine learning, and also to establish spectral features important for the distinction between cancer and non-cancer serum samples. Using these methods, sensitivities of up to 92.8% and specificities of up to 91.5% were possible. Furthermore, ratiometrics were also investigated in order to establish any correlations present in the dataset. We show a rapid, computationally light, accurate, statistically robust methodology for the identification of spectral features present in differing disease states. With current advances in IR technology, such as the development of rapid discrete frequency collection, this approach is of importance to enable future clinical translation and enables IR to achieve its potential.
傅里叶变换红外(FTIR)光谱学长期以来一直被确立为一种用于测量分子系统振动模式的分析技术。最近,FTIR 已被用于生物流体的分析,旨在成为一种辅助诊断的工具。对于临床医生来说,这是一种方便、快速、客观的生物流体研究和疾病诊断方法。患者也从这种方法中受益,因为与传统活检相比,收集血清的过程侵入性更小,压力也更小。对于怀疑患有脑癌的患者尤其如此。脑活检对患者来说非常不愉快,有潜在的危险,而且有时可能无法得出明确的结论。因此,我们提出了一种使用 FTIR 和机器学习技术从血清样本中诊断脑癌的方法。该研究的范围涉及 433 名患者,从每位患者中收集了 9 个范围在 600-4000 cm(-1) 的光谱。为了开始开发新方法,研究了各种预处理步骤,并根据诊断的最终准确性对其进行了排名。随机森林机器学习被用作分类器,根据患者光谱中存在的波数强度将患者分为癌症或非癌症类别。然后使用广义二维相关分析进一步增强机器学习,并确定区分癌症和非癌症血清样本的重要光谱特征。使用这些方法,灵敏度最高可达 92.8%,特异性最高可达 91.5%。此外,还研究了比率计量学,以确定数据集中存在的任何相关性。我们展示了一种快速、计算量轻、准确、统计稳健的方法,用于识别不同疾病状态下存在的光谱特征。随着红外技术的当前进展,例如快速离散频率采集的发展,这种方法对于实现未来的临床转化和使红外技术发挥其潜力非常重要。