School of Veterinary Medicine, Texas Tech University, 7671 Evans Dr., Amarillo, TX 79106, USA.
Sensors (Basel). 2023 Sep 23;23(19):8042. doi: 10.3390/s23198042.
The digestion of protein into peptide fragments reduces the size and complexity of protein molecules. Peptide fragments can be analyzed with higher sensitivity (often > 10 fold) and resolution using MALDI-TOF mass spectrometers, leading to improved pattern recognition by common machine learning algorithms. In turn, enhanced sensitivity and specificity for bacterial sorting and/or disease diagnosis may be obtained. To test this hypothesis, four exemplar case studies have been pursued in which samples are sorted into dichotomous groups by machine learning (ML) software based on MALDI-TOF spectra. Samples were analyzed in 'intact' mode in which the proteins present in the sample were not digested with protease prior to MALDI-TOF analysis and separately after the standard overnight tryptic digestion of the same samples. For each case, sensitivity (sens), specificity (spc), and the Youdin index (J) were used to assess the ML model performance. The proteolytic digestion of samples prior to MALDI-TOF analysis substantially enhanced the sensitivity and specificity of dichotomous sorting. Two exceptions were when substantial differences in chemical composition between the samples were present and, in such cases, both 'intact' and 'digested' protocols performed similarly. The results suggest proteolytic digestion prior to analysis can improve sorting in MALDI/ML-based workflows and may enable improved biomarker discovery. However, when samples are easily distinguishable protein digestion is not necessary to obtain useful diagnostic results.
蛋白质消化成肽片段会降低蛋白质分子的大小和复杂性。使用 MALDI-TOF 质谱仪可以更灵敏(通常 > 10 倍)和更准确地分析肽片段,从而提高常见机器学习算法的模式识别能力。反过来,也可以提高细菌分类和/或疾病诊断的灵敏度和特异性。为了验证这一假设,我们进行了四个范例案例研究,其中样本通过机器学习(ML)软件根据 MALDI-TOF 光谱分为两类。样本以“完整”模式进行分析,即在 MALDI-TOF 分析之前,样本中的蛋白质未经蛋白酶消化,而在对相同样本进行标准过夜胰蛋白酶消化后再进行分析。对于每个案例,使用灵敏度(sens)、特异性(spc)和 Youdin 指数(J)来评估 ML 模型的性能。在 MALDI-TOF 分析之前对样品进行蛋白水解消化可显著提高二分类排序的灵敏度和特异性。有两个例外情况是,当样品之间存在明显的化学成分差异时,在这种情况下,“完整”和“消化”两种方案的性能相似。结果表明,在 MALDI/ML 工作流程中,分析前进行蛋白水解消化可以改善分类,并且可能有助于发现更好的生物标志物。然而,当样本容易区分时,为了获得有用的诊断结果,进行蛋白质消化并不是必需的。