整合组学数据和机器学习技术用于口腔鳞状细胞癌的精准检测：评估单一生物标志物

Integrating omics data and machine learning techniques for precision detection of oral squamous cell carcinoma: evaluating single biomarkers.

作者信息

Sun Yilan, Cheng Guozhen, Wei Dongliang, Luo Jiacheng, Liu Jiannan

机构信息

Department of Oral and Maxillofacial Head and Neck Oncology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.

College of Stomatology, Shanghai Jiao Tong University, Shanghai, China.

出版信息

Front Immunol. 2024 Dec 3;15:1493377. doi: 10.3389/fimmu.2024.1493377. eCollection 2024.

DOI:10.3389/fimmu.2024.1493377

PMID:39691710

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11649677/

Abstract

INTRODUCTION

Early detection of oral squamous cell carcinoma (OSCC) is critical for improving clinical outcomes. Precision diagnostics integrating metabolomics and machine learning offer promising non-invasive solutions for identifying tumor-derived biomarkers.

METHODS

We analyzed a multicenter public dataset comprising 61 OSCC patients and 61 healthy controls. Plasma metabolomics data were processed to extract 29 numerical and 47 ratio features. The Extra Trees (ET) algorithm was applied for feature selection, and the TabPFN model was used for classification and prediction.

RESULTS

The model achieved an area under the curve (AUC) of 93% and an overall accuracy of 76.6% when using top-ranked individual biomarkers. Key metabolic features significantly differentiated OSCC patients from healthy controls, providing a detailed metabolic fingerprint of the disease.

DISCUSSION

Our findings demonstrate the utility of integrating omics data with advanced machine learning techniques to develop accurate, non-invasive diagnostic tools for OSCC. The study highlights actionable metabolic signatures that have potential applications in personalized therapeutics and early intervention strategies.

摘要

引言

早期发现口腔鳞状细胞癌（OSCC）对于改善临床结果至关重要。整合代谢组学和机器学习的精准诊断为识别肿瘤衍生生物标志物提供了有前景的非侵入性解决方案。

方法

我们分析了一个多中心公共数据集，其中包括61名OSCC患者和61名健康对照。对血浆代谢组学数据进行处理，以提取29个数值特征和47个比率特征。应用Extra Trees（ET）算法进行特征选择，并使用TabPFN模型进行分类和预测。

结果

当使用排名靠前的单个生物标志物时，该模型的曲线下面积（AUC）为93%，总体准确率为76.6%。关键代谢特征显著区分了OSCC患者与健康对照，提供了该疾病详细的代谢指纹。

讨论

我们的研究结果证明了将组学数据与先进的机器学习技术相结合，以开发用于OSCC的准确、非侵入性诊断工具的实用性。该研究突出了可操作的代谢特征，这些特征在个性化治疗和早期干预策略中具有潜在应用。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

整合组学数据和机器学习技术用于口腔鳞状细胞癌的精准检测：评估单一生物标志物

Integrating omics data and machine learning techniques for precision detection of oral squamous cell carcinoma: evaluating single biomarkers.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

本文引用的文献

整合组学数据和机器学习技术用于口腔鳞状细胞癌的精准检测：评估单一生物标志物

Integrating omics data and machine learning techniques for precision detection of oral squamous cell carcinoma: evaluating single biomarkers.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

本文引用的文献