• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用缺氧特征的集成预测早期乳腺癌患者的生存。

Prediction of early breast cancer patient survival using ensembles of hypoxia signatures.

机构信息

Ontario Institute for Cancer Research, Toronto, ON, Canada.

Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada.

出版信息

PLoS One. 2018 Sep 14;13(9):e0204123. doi: 10.1371/journal.pone.0204123. eCollection 2018.

DOI:10.1371/journal.pone.0204123
PMID:30216362
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6138385/
Abstract

BACKGROUND

Biomarkers are a key component of precision medicine. However, full clinical integration of biomarkers has been met with challenges, partly attributed to analytical difficulties. It has been shown that biomarker reproducibility is susceptible to data preprocessing approaches. Here, we systematically evaluated machine-learning ensembles of preprocessing methods as a general strategy to improve biomarker performance for prediction of survival from early breast cancer.

RESULTS

We risk stratified breast cancer patients into either low-risk or high-risk groups based on four published hypoxia signatures (Buffa, Winter, Hu, and Sorensen), using 24 different preprocessing approaches for microarray normalization. The 24 binary risk profiles determined for each hypoxia signature were combined using a random forest to evaluate the efficacy of a preprocessing ensemble classifier. We demonstrate that the best way of merging preprocessing methods varies from signature to signature, and that there is likely no 'best' preprocessing pipeline that is universal across datasets, highlighting the need to evaluate ensembles of preprocessing algorithms. Further, we developed novel signatures for each preprocessing method and the risk classifications from each were incorporated in a meta-random forest model. Interestingly, the classification of these biomarkers and its ensemble show striking consistency, demonstrating that similar intrinsic biological information are being faithfully represented. As such, these classification patterns further confirm that there is a subset of patients whose prognosis is consistently challenging to predict.

CONCLUSIONS

Performance of different prognostic signatures varies with pre-processing method. A simple classifier by unanimous voting of classifications is a reliable way of improving on single preprocessing methods. Future signatures will likely require integration of intrinsic and extrinsic clinico-pathological variables to better predict disease-related outcomes.

摘要

背景

生物标志物是精准医学的关键组成部分。然而,生物标志物的完全临床整合遇到了挑战,部分归因于分析困难。已经表明,生物标志物的可重复性容易受到数据预处理方法的影响。在这里,我们系统地评估了机器学习预处理方法的集成作为一种提高预测早期乳腺癌患者生存的生物标志物性能的通用策略。

结果

我们根据四个已发表的缺氧特征(Buffa、Winter、Hu 和 Sorensen),将乳腺癌患者分为低风险或高风险组,使用 24 种不同的微阵列归一化预处理方法。使用随机森林组合每个缺氧特征确定的 24 个二进制风险特征,以评估预处理集成分类器的效果。我们证明,合并预处理方法的最佳方法因特征而异,并且不太可能存在适用于所有数据集的“最佳”预处理管道,这突出表明需要评估预处理算法的集成。此外,我们为每种预处理方法开发了新的特征,并且将每种方法的风险分类纳入了元随机森林模型中。有趣的是,这些生物标志物的分类及其集成显示出惊人的一致性,表明正在忠实地表示类似的内在生物学信息。因此,这些分类模式进一步证实存在一组患者,其预后一直难以预测。

结论

不同预后特征的性能随预处理方法而变化。通过一致投票对分类进行简单分类是改进单一预处理方法的可靠方法。未来的特征可能需要整合内在和外在临床病理变量,以更好地预测与疾病相关的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/cd3d22bf20fb/pone.0204123.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/59a4bf6dde23/pone.0204123.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/769da0ef9e33/pone.0204123.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/cfee5639aec2/pone.0204123.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/0caea98da0c9/pone.0204123.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/cd3d22bf20fb/pone.0204123.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/59a4bf6dde23/pone.0204123.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/769da0ef9e33/pone.0204123.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/cfee5639aec2/pone.0204123.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/0caea98da0c9/pone.0204123.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/039c/6138385/cd3d22bf20fb/pone.0204123.g005.jpg

相似文献

1
Prediction of early breast cancer patient survival using ensembles of hypoxia signatures.利用缺氧特征的集成预测早期乳腺癌患者的生存。
PLoS One. 2018 Sep 14;13(9):e0204123. doi: 10.1371/journal.pone.0204123. eCollection 2018.
2
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
3
NCC-AUC: an AUC optimization method to identify multi-biomarker panel for cancer prognosis from genomic and clinical data.NCC-AUC:一种 AUC 优化方法,用于从基因组和临床数据中识别用于癌症预后的多生物标志物组。
Bioinformatics. 2015 Oct 15;31(20):3330-8. doi: 10.1093/bioinformatics/btv374. Epub 2015 Jun 18.
4
An ensemble predictive modeling framework for breast cancer classification.用于乳腺癌分类的集成预测建模框架。
Methods. 2017 Dec 1;131:128-134. doi: 10.1016/j.ymeth.2017.07.011. Epub 2017 Jul 15.
5
Fuzzy logic selection as a new reliable tool to identify molecular grade signatures in breast cancer--the INNODIAG study.模糊逻辑选择作为识别乳腺癌分子分级特征的一种新型可靠工具——INNODIAG研究
BMC Med Genomics. 2015 Feb 7;8:3. doi: 10.1186/s12920-015-0077-1.
6
Evaluation of public cancer datasets and signatures identifies TP53 mutant signatures with robust prognostic and predictive value.对公共癌症数据集和特征的评估确定了具有强大预后和预测价值的TP53突变特征。
BMC Cancer. 2015 Mar 26;15:179. doi: 10.1186/s12885-015-1102-7.
7
Tree-based machine learning algorithms identified minimal set of miRNA biomarkers for breast cancer diagnosis and molecular subtyping.基于树的机器学习算法确定了用于乳腺癌诊断和分子分型的最小 miRNA 生物标志物集。
Gene. 2018 Nov 30;677:111-118. doi: 10.1016/j.gene.2018.07.057. Epub 2018 Jul 25.
8
Design of a multi-signature ensemble classifier predicting neuroblastoma patients' outcome.多签名集成分类器设计,用于预测神经母细胞瘤患者的预后。
BMC Bioinformatics. 2012 Mar 28;13 Suppl 4(Suppl 4):S13. doi: 10.1186/1471-2105-13-S4-S13.
9
Improved breast cancer prognosis through the combination of clinical and genetic markers.通过临床和基因标志物相结合改善乳腺癌预后。
Bioinformatics. 2007 Jan 1;23(1):30-7. doi: 10.1093/bioinformatics/btl543. Epub 2006 Nov 26.
10
Prognostic gene signatures for patient stratification in breast cancer: accuracy, stability and interpretability of gene selection approaches using prior knowledge on protein-protein interactions.用于乳腺癌患者分层的预后基因特征:利用蛋白质 - 蛋白质相互作用的先验知识选择基因的方法的准确性、稳定性和可解释性。
BMC Bioinformatics. 2012 May 1;13:69. doi: 10.1186/1471-2105-13-69.

引用本文的文献

1
Identification of a gene expression signature associated with breast cancer survival and risk that improves clinical genomic platforms.鉴定与乳腺癌生存和风险相关的基因表达特征,该特征可改善临床基因组平台。
Bioinform Adv. 2023 Mar 22;3(1):vbad037. doi: 10.1093/bioadv/vbad037. eCollection 2023.
2
Proteomic discovery of non-invasive biomarkers of localized prostate cancer using mass spectrometry.使用质谱法对局限性前列腺癌进行非侵入性生物标志物的蛋白质组学发现。
Nat Rev Urol. 2021 Dec;18(12):707-724. doi: 10.1038/s41585-021-00500-1. Epub 2021 Aug 27.
3
Syndecan-1 Promotes Angiogenesis in Triple-Negative Breast Cancer through the Prognostically Relevant Tissue Factor Pathway and Additional Angiogenic Routes.

本文引用的文献

1
Comparison of pre-processing methods for Infinium HumanMethylation450 BeadChip array.比较 Infinium HumanMethylation450 BeadChip 阵列的预处理方法。
Bioinformatics. 2017 Oct 15;33(20):3151-3157. doi: 10.1093/bioinformatics/btx372.
2
The path to routine use of genomic biomarkers in the cancer clinic.癌症临床中基因组生物标志物常规应用之路。
Genome Res. 2015 Oct;25(10):1508-13. doi: 10.1101/gr.191114.115.
3
Spatial genomic heterogeneity within localized, multifocal prostate cancer.局限性、多灶性前列腺癌的空间基因组异质性。
Syndecan-1通过与预后相关的组织因子途径及其他血管生成途径促进三阴性乳腺癌血管生成。
Cancers (Basel). 2021 May 12;13(10):2318. doi: 10.3390/cancers13102318.
4
Hypoxia-Associated Prognostic Markers and Competing Endogenous RNA Co-Expression Networks in Breast Cancer.乳腺癌中缺氧相关预后标志物及竞争性内源性RNA共表达网络
Front Oncol. 2020 Dec 2;10:579868. doi: 10.3389/fonc.2020.579868. eCollection 2020.
Nat Genet. 2015 Jul;47(7):736-45. doi: 10.1038/ng.3315. Epub 2015 May 25.
4
Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection.将肿瘤基因组模拟与众包相结合,以评估体细胞单核苷酸变异检测。
Nat Methods. 2015 Jul;12(7):623-30. doi: 10.1038/nmeth.3407. Epub 2015 May 18.
5
Analysis of the genetic phylogeny of multifocal prostate cancer identifies multiple independent clonal expansions in neoplastic and morphologically normal prostate tissue.多灶性前列腺癌的遗传系统发育分析确定了肿瘤性和形态学正常前列腺组织中的多个独立克隆扩增。
Nat Genet. 2015 Apr;47(4):367-372. doi: 10.1038/ng.3221. Epub 2015 Mar 2.
6
Spatial and temporal diversity in genomic instability processes defines lung cancer evolution.基因组不稳定性过程中的空间和时间多样性定义了肺癌的进化。
Science. 2014 Oct 10;346(6206):251-6. doi: 10.1126/science.1253462.
7
The prognostic ease and difficulty of invasive breast carcinoma.浸润性乳腺癌的预后难易程度。
Cell Rep. 2014 Oct 9;9(1):129-142. doi: 10.1016/j.celrep.2014.08.073. Epub 2014 Oct 2.
8
Ensemble analyses improve signatures of tumour hypoxia and reveal inter-platform differences.整合分析改善了肿瘤缺氧特征并揭示了平台间差异。
BMC Bioinformatics. 2014 Jun 6;15:170. doi: 10.1186/1471-2105-15-170.
9
The causes and consequences of genetic heterogeneity in cancer evolution.癌症进化中遗传异质性的原因和后果。
Nature. 2013 Sep 19;501(7467):338-45. doi: 10.1038/nature12625.
10
Postdiagnosis supplement use and breast cancer prognosis in the After Breast Cancer Pooling Project.乳腺癌后汇集项目中诊断后补充剂使用与乳腺癌预后的关系
Breast Cancer Res Treat. 2013 Jun;139(2):529-37. doi: 10.1007/s10549-013-2548-4. Epub 2013 May 10.