• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因表达数据的 Cox 回归和随机生存森林模型预测结直肠癌生存。

Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data.

机构信息

School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, Scottsville, South Africa.

Faculty of Mathematical and Computer Sciences, University of Gezira, Wad Madani, Sudan.

出版信息

PLoS One. 2021 Dec 29;16(12):e0261625. doi: 10.1371/journal.pone.0261625. eCollection 2021.

DOI:10.1371/journal.pone.0261625
PMID:34965262
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8716055/
Abstract

Understanding and identifying the markers and clinical information that are associated with colorectal cancer (CRC) patient survival is needed for early detection and diagnosis. In this work, we aimed to build a simple model using Cox proportional hazards (PH) and random survival forest (RSF) and find a robust signature for predicting CRC overall survival. We used stepwise regression to develop Cox PH model to analyse 54 common differentially expressed genes from three mutations. RSF is applied using log-rank and log-rank-score based on 5000 survival trees, and therefore, variables important obtained to find the genes that are most influential for CRC survival. We compared the predictive performance of the Cox PH model and RSF for early CRC detection and diagnosis. The results indicate that SLC9A8, IER5, ARSJ, ANKRD27, and PIPOX genes were significantly associated with the CRC overall survival. In addition, age, sex, and stages are also affecting the CRC overall survival. The RSF model using log-rank is better than log-rank-score, while log-rank-score needed more trees to stabilize. Overall, the imputation of missing values enhanced the model's predictive performance. In addition, Cox PH predictive performance was better than RSF.

摘要

了解和识别与结直肠癌(CRC)患者生存相关的标志物和临床信息,对于早期检测和诊断至关重要。在这项工作中,我们旨在使用 Cox 比例风险(PH)和随机生存森林(RSF)构建一个简单的模型,并找到用于预测 CRC 总生存的稳健特征。我们使用逐步回归方法来开发 Cox PH 模型,以分析来自三种突变的 54 个常见差异表达基因。RSF 使用基于 5000 棵生存树的对数秩和对数秩评分进行应用,因此可以获得重要的变量来寻找对 CRC 生存最有影响的基因。我们比较了 Cox PH 模型和 RSF 用于早期 CRC 检测和诊断的预测性能。结果表明,SLC9A8、IER5、ARSJ、ANKRD27 和 PIPOX 基因与 CRC 总生存显著相关。此外,年龄、性别和分期也会影响 CRC 的总生存。使用对数秩的 RSF 模型比对数秩评分要好,而对数秩评分需要更多的树来稳定。总体而言,缺失值的插补提高了模型的预测性能。此外,Cox PH 的预测性能优于 RSF。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/0eac1c58b7e3/pone.0261625.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/beca4599a4c0/pone.0261625.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/77afa72b0d8f/pone.0261625.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/759b533f8e03/pone.0261625.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/a93136643165/pone.0261625.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/581003af83d8/pone.0261625.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/e132ecbee986/pone.0261625.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/9f72d6c6e47e/pone.0261625.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/0eac1c58b7e3/pone.0261625.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/beca4599a4c0/pone.0261625.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/77afa72b0d8f/pone.0261625.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/759b533f8e03/pone.0261625.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/a93136643165/pone.0261625.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/581003af83d8/pone.0261625.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/e132ecbee986/pone.0261625.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/9f72d6c6e47e/pone.0261625.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6f9a/8716055/0eac1c58b7e3/pone.0261625.g008.jpg

相似文献

1
Predictors of colorectal cancer survival using cox regression and random survival forests models based on gene expression data.基于基因表达数据的 Cox 回归和随机生存森林模型预测结直肠癌生存。
PLoS One. 2021 Dec 29;16(12):e0261625. doi: 10.1371/journal.pone.0261625. eCollection 2021.
2
Predicting Colorectal Cancer Survival Using Time-to-Event Machine Learning: Retrospective Cohort Study.基于生存事件的机器学习预测结直肠癌患者生存情况:回顾性队列研究。
J Med Internet Res. 2023 Oct 26;25:e44417. doi: 10.2196/44417.
3
Novel head and neck cancer survival analysis approach: random survival forests versus Cox proportional hazards regression.新型头颈部癌症生存分析方法:随机生存森林与 Cox 比例风险回归。
Head Neck. 2012 Jan;34(1):50-8. doi: 10.1002/hed.21698. Epub 2011 Feb 14.
4
Individual risk prediction: Comparing random forests with Cox proportional-hazards model by a simulation study.个体风险预测:通过模拟研究比较随机森林与 Cox 比例风险模型。
Biom J. 2023 Aug;65(6):e2100380. doi: 10.1002/bimj.202100380. Epub 2022 Sep 28.
5
Discovery of a novel six-long non-coding RNA signature predicting survival of colorectal cancer patients.发现一种新型的六长非编码 RNA 标志物,可预测结直肠癌患者的生存情况。
J Cell Biochem. 2018 Apr;119(4):3574-3585. doi: 10.1002/jcb.26548. Epub 2018 Jan 15.
6
Survival marker genes of colorectal cancer derived from consistent transcriptomic profiling.基于一致的转录组分析的结直肠癌生存标记基因。
BMC Genomics. 2018 Dec 11;19(Suppl 8):857. doi: 10.1186/s12864-018-5193-9.
7
Identification of a five-gene signature with prognostic value in colorectal cancer.鉴定结直肠癌中具有预后价值的五个基因标志物。
J Cell Physiol. 2019 Apr;234(4):3829-3836. doi: 10.1002/jcp.27154. Epub 2018 Aug 21.
8
A five-immune-related genes-based prognostic signature for colorectal cancer.一种基于五个免疫相关基因的结直肠癌预后特征。
Int Immunopharmacol. 2020 Nov;88:106866. doi: 10.1016/j.intimp.2020.106866. Epub 2020 Aug 8.
9
Exploring prognostic potential of long noncoding RNAs in colorectal cancer based on a competing endogenous RNA network.基于竞争性内源性 RNA 网络探索长链非编码 RNA 在结直肠癌中的预后潜力。
World J Gastroenterol. 2020 Mar 28;26(12):1298-1316. doi: 10.3748/wjg.v26.i12.1298.
10
Identifying important risk factors for survival in patient with systolic heart failure using random survival forests.使用随机生存森林法识别收缩性心力衰竭患者生存的重要危险因素。
Circ Cardiovasc Qual Outcomes. 2011 Jan 1;4(1):39-45. doi: 10.1161/CIRCOUTCOMES.110.939371. Epub 2010 Nov 23.

引用本文的文献

1
The development of a multimodal prediction model based on CT and MRI for the prognosis of pancreatic cancer.基于CT和MRI的胰腺癌预后多模态预测模型的开发。
BMC Gastroenterol. 2025 Aug 6;25(1):557. doi: 10.1186/s12876-025-04119-z.
2
Research on RNA modification in disease diagnosis and prognostic biomarkers: current status and challenges.疾病诊断和预后生物标志物中RNA修饰的研究:现状与挑战
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf361.
3
Development and Spatial External Validation of a Predictive Model of Survival Based on Random Survival Forest Analysis for People Living With HIV and AIDS After Highly Active Antiretroviral Therapy in China: Retrospective Cohort Study.

本文引用的文献

1
Predictors of singleton preterm birth using multinomial regression models accounting for missing data: A birth registry-based cohort study in northern Tanzania.利用多分类回归模型预测单胎早产的预测因素:坦桑尼亚北部基于出生登记的队列研究。
PLoS One. 2021 Apr 1;16(4):e0249411. doi: 10.1371/journal.pone.0249411. eCollection 2021.
2
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
3
基于随机生存森林分析的中国接受高效抗逆转录病毒治疗的艾滋病毒/艾滋病患者生存预测模型的开发与空间外部验证:回顾性队列研究
J Med Internet Res. 2025 Jun 2;27:e71257. doi: 10.2196/71257.
4
Integrating analysis of multi-omics summary data identifies novel plasma protein biomarkers and drug targets for bladder cancer.多组学汇总数据的整合分析鉴定出膀胱癌新的血浆蛋白生物标志物和药物靶点。
Discov Oncol. 2025 May 2;16(1):660. doi: 10.1007/s12672-025-02476-5.
5
Biomarker-driven drug repurposing for NAFLD-associated hepatocellular carcinoma using machine learning integrated ensemble feature selection.使用机器学习集成特征选择技术,基于生物标志物的非酒精性脂肪性肝病相关肝细胞癌药物再利用研究
Front Bioinform. 2025 Apr 17;5:1522401. doi: 10.3389/fbinf.2025.1522401. eCollection 2025.
6
Exploration of ANKRD27 as an immune-related prognostic factor in pan-cancer and hepatocellular carcinoma.探索ANKRD27作为泛癌和肝细胞癌中免疫相关预后因素的作用。
Front Oncol. 2025 Jan 6;14:1511240. doi: 10.3389/fonc.2024.1511240. eCollection 2024.
7
Cytokine profiles as predictors of HIV incidence using machine learning survival models and statistical interpretable techniques.使用机器学习生存模型和统计可解释技术,将细胞因子谱作为HIV发病率的预测指标
Sci Rep. 2024 Dec 2;14(1):29895. doi: 10.1038/s41598-024-81510-y.
8
Development and validation of a machine learning model to predict prognosis in liver failure patients treated with non-bioartificial liver support system.一种用于预测接受非生物人工肝支持系统治疗的肝衰竭患者预后的机器学习模型的开发与验证。
Front Med (Lausanne). 2024 Mar 13;11:1368899. doi: 10.3389/fmed.2024.1368899. eCollection 2024.
9
A novel lipid metabolism-based risk model associated with immunosuppressive mechanisms in diffuse large B-cell lymphoma.一种新型基于脂质代谢的风险模型与弥漫性大 B 细胞淋巴瘤的免疫抑制机制相关。
Lipids Health Dis. 2024 Jan 22;23(1):20. doi: 10.1186/s12944-024-02017-z.
10
Use of survival support vector machine combined with random survival forest to predict the survival of nasopharyngeal carcinoma patients.使用生存支持向量机结合随机生存森林预测鼻咽癌患者的生存情况。
Transl Cancer Res. 2023 Dec 31;12(12):3581-3590. doi: 10.21037/tcr-23-316. Epub 2023 Dec 21.
Sexual Dimorphism in Colon Cancer.
结肠癌中的性别二态性。
Front Oncol. 2020 Dec 9;10:607909. doi: 10.3389/fonc.2020.607909. eCollection 2020.
4
Predictors of perinatal death in the presence of missing data: A birth registry-based study in northern Tanzania.有缺失数据时围产儿死亡的预测因素:坦桑尼亚北部基于出生登记的研究。
PLoS One. 2020 Apr 16;15(4):e0231636. doi: 10.1371/journal.pone.0231636. eCollection 2020.
5
Identification of key genes for predicting colorectal cancer prognosis by integrated bioinformatics analysis.通过综合生物信息学分析鉴定预测结直肠癌预后的关键基因
Oncol Lett. 2020 Jan;19(1):388-398. doi: 10.3892/ol.2019.11068. Epub 2019 Nov 7.
6
Four targeted genes for predicting the prognosis of colorectal cancer: A bioinformatics analysis case.预测结直肠癌预后的四个靶向基因:一项生物信息学分析案例
Oncol Lett. 2019 Nov;18(5):5043-5054. doi: 10.3892/ol.2019.10866. Epub 2019 Sep 13.
7
Prognosis Prediction of Colorectal Cancer Using Gene Expression Profiles.利用基因表达谱预测结直肠癌的预后
Front Oncol. 2019 Apr 9;9:252. doi: 10.3389/fonc.2019.00252. eCollection 2019.
8
A Selective Review on Random Survival Forests for High Dimensional Data.高维数据随机生存森林的选择性综述
Quant Biosci. 2017;36(2):85-96. doi: 10.22283/qbs.2017.36.2.85.
9
Survival marker genes of colorectal cancer derived from consistent transcriptomic profiling.基于一致的转录组分析的结直肠癌生存标记基因。
BMC Genomics. 2018 Dec 11;19(Suppl 8):857. doi: 10.1186/s12864-018-5193-9.
10
Heterogeneity in Colorectal Cancer: A Challenge for Personalized Medicine?结直肠癌的异质性:对个体化医学的挑战?
Int J Mol Sci. 2018 Nov 23;19(12):3733. doi: 10.3390/ijms19123733.