• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

个体化患者数据融合定义了预后癌症亚型。

Patient-specific data fusion defines prognostic cancer subtypes.

机构信息

Cambridge Research Institute, Cancer Research UK, Cambridge, United Kingdom.

出版信息

PLoS Comput Biol. 2011 Oct;7(10):e1002227. doi: 10.1371/journal.pcbi.1002227. Epub 2011 Oct 20.

DOI:10.1371/journal.pcbi.1002227
PMID:22028636
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3197649/
Abstract

Different data types can offer complementary perspectives on the same biological phenomenon. In cancer studies, for example, data on copy number alterations indicate losses and amplifications of genomic regions in tumours, while transcriptomic data point to the impact of genomic and environmental events on the internal wiring of the cell. Fusing different data provides a more comprehensive model of the cancer cell than that offered by any single type. However, biological signals in different patients exhibit diverse degrees of concordance due to cancer heterogeneity and inherent noise in the measurements. This is a particularly important issue in cancer subtype discovery, where personalised strategies to guide therapy are of vital importance. We present a nonparametric Bayesian model for discovering prognostic cancer subtypes by integrating gene expression and copy number variation data. Our model is constructed from a hierarchy of Dirichlet Processes and addresses three key challenges in data fusion: (i) To separate concordant from discordant signals, (ii) to select informative features, (iii) to estimate the number of disease subtypes. Concordance of signals is assessed individually for each patient, giving us an additional level of insight into the underlying disease structure. We exemplify the power of our model in prostate cancer and breast cancer and show that it outperforms competing methods. In the prostate cancer data, we identify an entirely new subtype with extremely poor survival outcome and show how other analyses fail to detect it. In the breast cancer data, we find subtypes with superior prognostic value by using the concordant results. These discoveries were crucially dependent on our model's ability to distinguish concordant and discordant signals within each patient sample, and would otherwise have been missed. We therefore demonstrate the importance of taking a patient-specific approach, using highly-flexible nonparametric Bayesian methods.

摘要

不同的数据类型可以为同一生物现象提供互补的视角。例如,在癌症研究中,关于拷贝数改变的数据表明肿瘤中基因组区域的丢失和扩增,而转录组数据则表明基因组和环境事件对细胞内部布线的影响。融合不同的数据提供了比任何单一类型更全面的癌症细胞模型。然而,由于癌症异质性和测量中的固有噪声,不同患者的生物信号表现出不同程度的一致性。这在癌症亚型发现中是一个特别重要的问题,因为个性化的治疗策略对于指导治疗至关重要。我们提出了一种非参数贝叶斯模型,通过整合基因表达和拷贝数变异数据来发现预后癌症亚型。我们的模型是由一个狄利克雷过程层次结构构建的,解决了数据融合中的三个关键挑战:(i)分离一致和不一致的信号,(ii)选择信息丰富的特征,(iii)估计疾病亚型的数量。为每个患者单独评估信号的一致性,这使我们对潜在疾病结构有了额外的深入了解。我们在前列腺癌和乳腺癌中举例说明了我们模型的强大功能,并表明它优于竞争方法。在前列腺癌数据中,我们发现了一种具有极差生存结果的全新亚型,并展示了其他分析如何未能检测到它。在乳腺癌数据中,我们通过使用一致的结果找到了预后价值更高的亚型。这些发现都严重依赖于我们模型在每个患者样本中区分一致和不一致信号的能力,如果没有这种能力,这些发现将被忽略。因此,我们证明了采用特定于患者的方法,使用高度灵活的非参数贝叶斯方法的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/9685e6af2927/pcbi.1002227.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/0b9efa5870fd/pcbi.1002227.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/20dfa634ecc8/pcbi.1002227.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/57284fae4297/pcbi.1002227.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/01660d8ad02e/pcbi.1002227.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/8e7486804dce/pcbi.1002227.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/e2aab665a09e/pcbi.1002227.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/9685e6af2927/pcbi.1002227.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/0b9efa5870fd/pcbi.1002227.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/20dfa634ecc8/pcbi.1002227.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/57284fae4297/pcbi.1002227.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/01660d8ad02e/pcbi.1002227.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/8e7486804dce/pcbi.1002227.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/e2aab665a09e/pcbi.1002227.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/faec/3197649/9685e6af2927/pcbi.1002227.g007.jpg

相似文献

1
Patient-specific data fusion defines prognostic cancer subtypes.个体化患者数据融合定义了预后癌症亚型。
PLoS Comput Biol. 2011 Oct;7(10):e1002227. doi: 10.1371/journal.pcbi.1002227. Epub 2011 Oct 20.
2
Breast and prostate cancers harbor common somatic copy number alterations that consistently differ by race and are associated with survival.乳腺癌和前列腺癌存在常见的体细胞拷贝数改变,这些改变在不同种族之间存在一致性差异,并与生存相关。
BMC Med Genomics. 2020 Aug 20;13(1):116. doi: 10.1186/s12920-020-00765-2.
3
Evaluating key predictors of breast cancer through survival: a comparison of AFT frailty models with LASSO, ridge, and elastic net regularization.通过生存分析评估乳腺癌的关键预测因素:AFT脆弱模型与LASSO、岭回归和弹性网络正则化的比较
BMC Cancer. 2025 Apr 11;25(1):665. doi: 10.1186/s12885-025-14040-z.
4
Genomic subtypes of breast cancer identified by array-comparative genomic hybridization display distinct molecular and clinical characteristics.通过 array-comparative genomic hybridization 鉴定的乳腺癌基因组亚型具有不同的分子和临床特征。
Breast Cancer Res. 2010;12(3):R42. doi: 10.1186/bcr2596. Epub 2010 Jun 24.
5
Pan-cancer genomic analysis reveals FOXA1 amplification is associated with adverse outcomes in non-small cell lung, prostate, and breast cancers.泛癌基因组分析显示,FOXA1扩增与非小细胞肺癌、前列腺癌和乳腺癌的不良预后相关。
J Natl Cancer Inst. 2025 Jan 1;117(1):188-197. doi: 10.1093/jnci/djae224.
6
Novel insights into breast cancer copy number genetic heterogeneity revealed by single-cell genome sequencing.单细胞基因组测序揭示乳腺癌拷贝数遗传异质性的新见解。
Elife. 2020 May 13;9:e51480. doi: 10.7554/eLife.51480.
7
Basal-like Breast cancer DNA copy number losses identify genes involved in genomic instability, response to therapy, and patient survival.基底样乳腺癌的 DNA 拷贝数缺失鉴定了与基因组不稳定性、治疗反应和患者生存相关的基因。
Breast Cancer Res Treat. 2012 Jun;133(3):865-80. doi: 10.1007/s10549-011-1846-y. Epub 2011 Nov 3.
8
Micro-scale genomic DNA copy number aberrations as another means of mutagenesis in breast cancer.微尺度基因组 DNA 拷贝数异常作为乳腺癌的另一种诱变手段。
PLoS One. 2012;7(12):e51719. doi: 10.1371/journal.pone.0051719. Epub 2012 Dec 17.
9
Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.基于联合潜在变量模型的多种基因组数据类型综合聚类及其在乳腺癌和肺癌亚型分析中的应用。
Bioinformatics. 2009 Nov 15;25(22):2906-12. doi: 10.1093/bioinformatics/btp543. Epub 2009 Sep 16.
10
Towards clinically more relevant dissection of patient heterogeneity via survival-based Bayesian clustering.基于生存的贝叶斯聚类,探索更具临床相关性的患者异质性解剖。
Bioinformatics. 2017 Nov 15;33(22):3558-3566. doi: 10.1093/bioinformatics/btx464.

引用本文的文献

1
BAYESIAN LEARNING OF CLINICALLY MEANINGFUL SEPSIS PHENOTYPES IN NORTHERN TANZANIA.坦桑尼亚北部临床上有意义的脓毒症表型的贝叶斯学习
Ann Appl Stat. 2025 Sep;19(3):2193-2217. doi: 10.1214/25-aoas2045. Epub 2025 Aug 28.
2
Multi-Omics Integration in Nephrology: Advances, Challenges, and Future Directions.肾脏病学中的多组学整合:进展、挑战与未来方向
Semin Nephrol. 2024 Nov;44(6):151584. doi: 10.1016/j.semnephrol.2025.151584. Epub 2025 Apr 11.
3
Patient-centric knowledge graphs: a survey of current methods, challenges, and applications.

本文引用的文献

1
HTSanalyzeR: an R/Bioconductor package for integrated network analysis of high-throughput screens.HTSanalyzeR:一个用于高通量筛选的综合网络分析的 R / Bioconductor 包。
Bioinformatics. 2011 Mar 15;27(6):879-80. doi: 10.1093/bioinformatics/btr028. Epub 2011 Jan 22.
2
Enrichment map: a network-based method for gene-set enrichment visualization and interpretation.富集图谱:一种基于网络的基因集富集可视化和解释方法。
PLoS One. 2010 Nov 15;5(11):e13984. doi: 10.1371/journal.pone.0013984.
3
Systems biology and genomics of breast cancer.
以患者为中心的知识图谱:当前方法、挑战及应用综述
Front Artif Intell. 2024 Oct 23;7:1388479. doi: 10.3389/frai.2024.1388479. eCollection 2024.
4
MoNETA: MultiOmics Network Embedding for SubType Analysis.MoNETA:用于亚型分析的多组学网络嵌入
NAR Genom Bioinform. 2024 Oct 16;6(4):lqae141. doi: 10.1093/nargab/lqae141. eCollection 2024 Sep.
5
Bioinformatics Analysis and Validation of Potential Markers Associated with Prediction and Prognosis of Gastric Cancer.生物信息学分析和验证与胃癌预测和预后相关的潜在标志物。
Int J Mol Sci. 2024 May 28;25(11):5880. doi: 10.3390/ijms25115880.
6
Integrating omics atlas in health informatics system design-an opinion article.将组学图谱整合到健康信息系统设计中——一篇观点文章。
Front Digit Health. 2024 May 9;6:1374359. doi: 10.3389/fdgth.2024.1374359. eCollection 2024.
7
Covering Hierarchical Dirichlet Mixture Models on binary data to enhance genomic stratifications in onco-hematology.覆盖二元数据的层次狄利克雷混合模型,以增强肿瘤血液学中的基因组分层。
PLoS Comput Biol. 2024 Feb 2;20(2):e1011299. doi: 10.1371/journal.pcbi.1011299. eCollection 2024 Feb.
8
Multi-omic approaches for host-microbiome data integration.基于组学的宿主-微生物组数据整合方法。
Gut Microbes. 2024 Jan-Dec;16(1):2297860. doi: 10.1080/19490976.2023.2297860. Epub 2024 Jan 2.
9
Applications of multi-omics analysis in human diseases.多组学分析在人类疾病中的应用。
MedComm (2020). 2023 Jul 31;4(4):e315. doi: 10.1002/mco2.315. eCollection 2023 Aug.
10
A Drug Repurposing Pipeline Based on Bladder Cancer Integrated Proteotranscriptomics Signatures.基于膀胱癌综合蛋白质转录组学特征的药物重新利用流程
Methods Mol Biol. 2023;2684:59-99. doi: 10.1007/978-1-0716-3291-8_4.
乳腺癌的系统生物学和基因组学。
Cold Spring Harb Perspect Biol. 2011 Feb 1;3(2):a003293. doi: 10.1101/cshperspect.a003293.
4
Comparative survival analysis of breast cancer microarray studies identifies important prognostic genetic pathways.比较乳腺癌基因芯片研究的生存分析确定了重要的预后遗传途径。
BMC Cancer. 2010 Oct 21;10:573. doi: 10.1186/1471-2407-10-573.
5
Endocytosis of titanium dioxide nanoparticles in prostate cancer PC-3M cells.二氧化钛纳米颗粒在前列腺癌细胞 PC-3M 中的内吞作用。
Nanomedicine. 2011 Apr;7(2):123-30. doi: 10.1016/j.nano.2010.09.004. Epub 2010 Sep 29.
6
Chemokine signaling in cancer: Implications on the tumor microenvironment and therapeutic targeting.癌症中的趋化因子信号传导:对肿瘤微环境和治疗靶点的影响
Cancer Ther. 2009 Apr 14;7(A):254-267.
7
Integrative genomic profiling of human prostate cancer.人类前列腺癌的综合基因组分析。
Cancer Cell. 2010 Jul 13;18(1):11-22. doi: 10.1016/j.ccr.2010.05.026. Epub 2010 Jun 24.
8
Discovering transcriptional modules by Bayesian data integration.基于贝叶斯数据整合的转录模块发现。
Bioinformatics. 2010 Jun 15;26(12):i158-67. doi: 10.1093/bioinformatics/btq210.
9
BioNet: an R-Package for the functional analysis of biological networks.BioNet:用于生物网络功能分析的 R 包。
Bioinformatics. 2010 Apr 15;26(8):1129-30. doi: 10.1093/bioinformatics/btq089. Epub 2010 Feb 25.
10
Modeling and visualizing uncertainty in gene expression clusters using dirichlet process mixtures.使用狄利克雷过程混合模型对基因表达聚类中的不确定性进行建模和可视化。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):615-28. doi: 10.1109/TCBB.2007.70269.