• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于组学数据的机器学习方法预测血液分泌蛋白和肝癌潜在生物标志物

Machine learning approach to predict blood-secretory proteins and potential biomarkers for liver cancer using omics data.

机构信息

Department of Bioinformatics, Pondicherry University, Puducherry 605014, India.

Department of Bioinformatics, Pondicherry University, Puducherry 605014, India.

出版信息

J Proteomics. 2024 Oct 30;309:105298. doi: 10.1016/j.jprot.2024.105298. Epub 2024 Aug 30.

DOI:10.1016/j.jprot.2024.105298
PMID:39216516
Abstract

Identifying non-invasive blood-based biomarkers is crucial for early detection and monitoring of liver cancer (LC), thereby improving patient outcomes. This study leveraged computational approaches to predict potential blood-based biomarkers for LC. Machine learning (ML) models were developed using selected features from blood-secretory proteins collected from the curated databases. The logistic regression (LR) model demonstrated the optimal performance. Transcriptome analysis across 7 LC cohorts revealed 231 common differentially expressed genes (DEGs). The encoded proteins of these DEGs were compared with the ML dataset, revealing 29 proteins overlapping with the blood-secretory dataset. The LR model also predicted 29 additional proteins as blood-secretory with the remaining protein-coding genes. As a result, 58 potential blood-secretory proteins were obtained. Among the top 20 genes, 13 common hub genes were identified. Further, area under the receiver operating characteristic curve (ROC AUC) analysis was performed to assess the genes as potential diagnostic blood biomarkers. Six genes, ESM1, FCN2, MDK, GPC3, CTHRC1 and COL6A6, exhibited an AUC value higher than 0.85 and were predicted as blood-secretory. This study highlights the potential of an integrative computational approach for discovering non-invasive blood-based biomarkers in LC, facilitating for further validation and clinical translation. SIGNIFICANCE: Liver cancer is one of the leading causes of premature death worldwide, with its prevalence and mortality rates projected to increase. Although current diagnostic methods are highly sensitive, they are invasive and unsuitable for repeated testing. Blood biomarkers offer a promising non-invasive alternative, but their wide dynamic range of protein concentration poses experimental challenges. Therefore, utilizing available omics data to develop a diagnostic model could provide a potential solution for accurate diagnosis. This study developed a computational method integrating machine learning and bioinformatics analysis to identify potential blood biomarkers. As a result, ESM1, FCN2, MDK, GPC3, CTHRC1 and COL6A6 biomarkers were identified, holding significant promise for improving diagnosis and understanding of liver cancer. The integrated method can be applied to other cancers, offering a possible solution for early detection and improved patient outcomes.

摘要

识别非侵入性的血液生物标志物对于肝癌(LC)的早期检测和监测至关重要,从而改善患者的预后。本研究利用计算方法来预测潜在的 LC 血液生物标志物。使用从经过整理的数据库中收集的血液分泌蛋白中选择的特征来开发机器学习(ML)模型。逻辑回归(LR)模型表现出最佳性能。对 7 个 LC 队列的转录组分析显示出 231 个共同差异表达基因(DEG)。这些 DEG 编码的蛋白质与 ML 数据集进行比较,发现 29 个蛋白质与血液分泌数据集重叠。LR 模型还预测了 29 个作为血液分泌的额外蛋白质,而其余的蛋白质编码基因也是如此。结果获得了 58 个潜在的血液分泌蛋白。在排名前 20 的基因中,确定了 13 个共同的枢纽基因。此外,还进行了接收器操作特征曲线(ROC AUC)分析,以评估这些基因作为潜在的诊断性血液生物标志物的性能。6 个基因,ESM1、FCN2、MDK、GPC3、CTHRC1 和 COL6A6,表现出 AUC 值高于 0.85,并被预测为血液分泌。本研究强调了综合计算方法在发现 LC 非侵入性血液生物标志物方面的潜力,为进一步验证和临床转化提供了便利。

意义

肝癌是全球导致过早死亡的主要原因之一,其发病率和死亡率预计将上升。尽管目前的诊断方法具有很高的灵敏度,但它们是侵入性的,不适合重复测试。血液生物标志物提供了一种有前途的非侵入性替代方法,但它们的蛋白质浓度广泛的动态范围带来了实验挑战。因此,利用现有的组学数据来开发诊断模型可能为准确诊断提供一个潜在的解决方案。本研究开发了一种计算方法,将机器学习和生物信息学分析相结合,以识别潜在的血液生物标志物。结果确定了 ESM1、FCN2、MDK、GPC3、CTHRC1 和 COL6A6 等生物标志物,为改善肝癌的诊断和理解提供了重要意义。该综合方法可应用于其他癌症,为早期检测和改善患者预后提供了可能的解决方案。

相似文献

1
Machine learning approach to predict blood-secretory proteins and potential biomarkers for liver cancer using omics data.基于组学数据的机器学习方法预测血液分泌蛋白和肝癌潜在生物标志物
J Proteomics. 2024 Oct 30;309:105298. doi: 10.1016/j.jprot.2024.105298. Epub 2024 Aug 30.
2
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
3
Identification of blood-derived exosomal tumor RNA signatures as noninvasive diagnostic biomarkers for multi-cancer: a multi-phase, multi-center study.鉴定血液来源的外泌体肿瘤RNA特征作为多种癌症的非侵入性诊断生物标志物:一项多阶段、多中心研究。
Mol Cancer. 2025 Mar 1;24(1):60. doi: 10.1186/s12943-025-02271-4.
4
AI-based Hepatic Steatosis Detection and Integrated Hepatic Assessment from Cardiac CT Attenuation Scans Enhances All-cause Mortality Risk Stratification: A Multi-center Study.基于人工智能的心脏CT衰减扫描检测肝脂肪变性及综合肝脏评估可增强全因死亡风险分层:一项多中心研究
medRxiv. 2025 Jun 11:2025.06.09.25329157. doi: 10.1101/2025.06.09.25329157.
5
Blood biomarkers for the non-invasive diagnosis of endometriosis.用于子宫内膜异位症无创诊断的血液生物标志物。
Cochrane Database Syst Rev. 2016 May 1;2016(5):CD012179. doi: 10.1002/14651858.CD012179.
6
Molecular feature-based classification of retroperitoneal liposarcoma: a prospective cohort study.基于分子特征的腹膜后脂肪肉瘤分类:一项前瞻性队列研究。
Elife. 2025 May 23;14:RP100887. doi: 10.7554/eLife.100887.
7
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
8
Combining Spatial Transcriptomics, Pseudotime, and Machine Learning Enables Discovery of Biomarkers for Prostate Cancer.结合空间转录组学、伪时间分析和机器学习可发现前列腺癌生物标志物。
Cancer Res. 2025 Jul 2;85(13):2514-2526. doi: 10.1158/0008-5472.CAN-25-0269.
9
Deciphering Shared Gene Signatures and Immune Infiltration Characteristics Between Gestational Diabetes Mellitus and Preeclampsia by Integrated Bioinformatics Analysis and Machine Learning.通过综合生物信息学分析和机器学习破译妊娠期糖尿病和子痫前期之间共享的基因特征及免疫浸润特征
Reprod Sci. 2025 May 15. doi: 10.1007/s43032-025-01847-1.
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.