• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的乳腺浸润性癌蛋白质编码基因标志物的鉴定

[Identification of Protein-Coding Gene Markers in Breast Invasive Carcinoma Based on Machine Learning].

作者信息

Wu Yue, Min Kai-Yuan, Liu Jiang-Feng, Liang Wan-Feng, Yang Ye-Hong, Hu Gang, Yang Jun-Tao

机构信息

1 School of Statistics and Data Science,Nankai University,Tianjin 300071,China.

2 State Key Laboratory of Common Mechanism Research for Major Diseases,Institute of Basic Medical Sciences,CAMS and PUMC,Beijing 100005,China.

出版信息

Zhongguo Yi Xue Ke Xue Yuan Xue Bao. 2024 Apr;46(2):147-153. doi: 10.3881/j.issn.1000-503X.15717.

DOI:10.3881/j.issn.1000-503X.15717
PMID:38686709
Abstract

Objective To screen out the biomarkers linked to prognosis of breast invasive carcinoma based on the analysis of transcriptome data by random forest (RF),extreme gradient boosting (XGBoost),light gradient boosting machine (LightGBM),and categorical boosting (CatBoost). Methods We obtained the expression data of breast invasive carcinoma from The Cancer Genome Atlas and employed DESeq2,-test,and Cox univariate analysis to identify the differentially expressed protein-coding genes associated with survival prognosis in human breast invasive carcinoma samples.Furthermore,RF,XGBoost,LightGBM,and CatBoost models were established to mine the protein-coding gene markers related to the prognosis of breast invasive cancer and the model performance was compared.The expression data of breast cancer from the Gene Expression Omnibus was used for validation. Results A total of 151 differentially expressed protein-coding genes related to survival prognosis were screened out.The machine learning model established with C3orf80,UGP2,and SPC25 demonstrated the best performance. Conclusions Three protein-coding genes (UGP2,C3orf80,and SPC25) were screened out to identify breast invasive carcinoma.This study provides a new direction for the treatment and diagnosis of breast invasive carcinoma.

摘要

目的 基于随机森林(RF)、极端梯度提升(XGBoost)、轻量级梯度提升机(LightGBM)和分类梯度提升(CatBoost)对转录组数据的分析,筛选出与乳腺浸润癌预后相关的生物标志物。方法 我们从癌症基因组图谱获取乳腺浸润癌的表达数据,并采用DESeq2、t检验和Cox单因素分析,以鉴定人乳腺浸润癌样本中与生存预后相关的差异表达蛋白质编码基因。此外,建立RF、XGBoost、LightGBM和CatBoost模型,挖掘与乳腺浸润癌预后相关的蛋白质编码基因标志物,并比较模型性能。使用基因表达综合数据库中的乳腺癌表达数据进行验证。结果 共筛选出151个与生存预后相关的差异表达蛋白质编码基因。用C3orf80、UGP2和SPC25建立的机器学习模型表现最佳。结论 筛选出三个蛋白质编码基因(UGP2、C3orf80和SPC25)用于识别乳腺浸润癌。本研究为乳腺浸润癌的治疗和诊断提供了新方向。

相似文献

1
[Identification of Protein-Coding Gene Markers in Breast Invasive Carcinoma Based on Machine Learning].基于机器学习的乳腺浸润性癌蛋白质编码基因标志物的鉴定
Zhongguo Yi Xue Ke Xue Yuan Xue Bao. 2024 Apr;46(2):147-153. doi: 10.3881/j.issn.1000-503X.15717.
2
Using a machine learning approach to identify key prognostic molecules for esophageal squamous cell carcinoma.采用机器学习方法鉴定食管鳞癌的关键预后分子。
BMC Cancer. 2021 Aug 9;21(1):906. doi: 10.1186/s12885-021-08647-1.
3
EARN: an ensemble machine learning algorithm to predict driver genes in metastatic breast cancer.EARN:一种集成机器学习算法,用于预测转移性乳腺癌的驱动基因。
BMC Med Genomics. 2021 May 7;14(1):122. doi: 10.1186/s12920-021-00974-3.
4
Identification of a 6-gene signature for the survival prediction of breast cancer patients based on integrated multi-omics data analysis.基于整合多组学数据分析的乳腺癌患者生存预测的 6 基因标志物的鉴定。
PLoS One. 2020 Nov 10;15(11):e0241924. doi: 10.1371/journal.pone.0241924. eCollection 2020.
5
Construction and Validation of a Prognostic Model Based on mRNAsi-Related Genes in Breast Cancer.基于 mRNA 相关基因的乳腺癌预后模型的构建与验证。
Comput Math Methods Med. 2022 Oct 11;2022:6532591. doi: 10.1155/2022/6532591. eCollection 2022.
6
Machine learning-based models for the prediction of breast cancer recurrence risk.基于机器学习的乳腺癌复发风险预测模型。
BMC Med Inform Decis Mak. 2023 Nov 29;23(1):276. doi: 10.1186/s12911-023-02377-z.
7
Uncovering the potential differentially expressed miRNAs as diagnostic biomarkers for hepatocellular carcinoma based on machine learning in The Cancer Genome Atlas database.基于癌症基因组图谱数据库中的机器学习,揭示潜在的差异表达 miRNA 作为肝细胞癌的诊断生物标志物。
Oncol Rep. 2020 Jun;43(6):1771-1784. doi: 10.3892/or.2020.7551. Epub 2020 Mar 19.
8
Machine learning-based prediction model for distant metastasis of breast cancer.基于机器学习的乳腺癌远处转移预测模型。
Comput Biol Med. 2024 Feb;169:107943. doi: 10.1016/j.compbiomed.2024.107943. Epub 2024 Jan 6.
9
Machine learning model-based risk prediction of severe complications after off-pump coronary artery bypass grafting.基于机器学习模型的非体外循环冠状动脉搭桥术后严重并发症风险预测
Adv Clin Exp Med. 2023 Feb;32(2):185-194. doi: 10.17219/acem/152895.
10
Gene expression profiling revealed MCM3 to be a better marker than Ki67 in prognosis of invasive ductal breast carcinoma patients.基因表达谱分析显示,MCM3 作为标志物在预测浸润性导管乳腺癌患者预后方面优于 Ki67。
Clin Exp Med. 2020 May;20(2):249-259. doi: 10.1007/s10238-019-00604-4. Epub 2020 Jan 24.