Department of Biochemistry and Molecular Biology, Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States.
School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States.
J Proteome Res. 2021 Jul 2;20(7):3629-3641. doi: 10.1021/acs.jproteome.1c00213. Epub 2021 Jun 23.
Renal cell carcinoma (RCC) is diagnosed through expensive cross-sectional imaging, frequently followed by renal mass biopsy, which is not only invasive but also prone to sampling errors. Hence, there is a critical need for a noninvasive diagnostic assay. RCC exhibits altered cellular metabolism combined with the close proximity of the tumor(s) to the urine in the kidney, suggesting that urine metabolomic profiling is an excellent choice for assay development. Here, we acquired liquid chromatography-mass spectrometry (LC-MS) and nuclear magnetic resonance (NMR) data followed by the use of machine learning (ML) to discover candidate metabolomic panels for RCC. The study cohort consisted of 105 RCC patients and 179 controls separated into two subcohorts: the model cohort and the test cohort. Univariate, wrapper, and embedded methods were used to select discriminatory features using the model cohort. Three ML techniques, each with different induction biases, were used for training and hyperparameter tuning. Assessment of RCC status prediction was evaluated using the test cohort with the selected biomarkers and the optimally tuned ML algorithms. A seven-metabolite panel predicted RCC in the test cohort with 88% accuracy, 94% sensitivity, 85% specificity, and 0.98 AUC. Metabolomics Workbench Study IDs are ST001705 and ST001706.
肾细胞癌(RCC)通过昂贵的横截面成像进行诊断,随后通常进行肾肿瘤活检,这不仅具有侵入性,而且容易出现采样误差。因此,迫切需要一种非侵入性诊断检测方法。RCC 表现出改变的细胞代谢,加上肿瘤与肾脏中的尿液接近,这表明尿液代谢组学分析是开发检测方法的绝佳选择。在这里,我们获得了液相色谱-质谱(LC-MS)和核磁共振(NMR)数据,随后使用机器学习(ML)发现用于 RCC 的候选代谢组学分析面板。研究队列包括 105 名 RCC 患者和 179 名对照,分为两个亚队列:模型队列和测试队列。使用模型队列,单变量、包装器和嵌入式方法用于选择有区别的特征。使用具有不同归纳偏差的三种 ML 技术对培训和超参数调整进行训练。使用选定的生物标志物和经过最佳调整的 ML 算法评估测试队列中的 RCC 状态预测。一个由七种代谢物组成的分析面板在测试队列中预测 RCC 的准确率为 88%,灵敏度为 94%,特异性为 85%,AUC 为 0.98。代谢组学工作平台研究 ID 为 ST001705 和 ST001706。