• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯多视图聚类,给定复杂的视图间结构。

Bayesian Multi-View Clustering given complex inter-view structure.

机构信息

Department of Computer Science, Johns Hopkins University, Baltimore, MD, 21218, USA.

Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.

出版信息

F1000Res. 2024 Feb 29;11:1460. doi: 10.12688/f1000research.126215.2. eCollection 2022.

DOI:10.12688/f1000research.126215.2
Abstract

Multi-view datasets are becoming increasingly prevalent. These datasets consist of different modalities that provide complementary characterizations of the same underlying system. They can include heterogeneous types of information with complex relationships, varying degrees of missingness, and assorted sample sizes, as is often the case in multi-omic biological studies. Clustering multi-view data allows us to leverage different modalities to infer underlying systematic structure, but most existing approaches are limited to contexts in which entities are the same across views or have clear one-to-one relationships across data types with a common sample size. Many methods also make strong assumptions about the similarities of clusterings across views. We propose a Bayesian multi-view clustering approach (BMVC) which can handle the realities of multi-view datasets that often have complex relationships and diverse structure. BMVC incorporates known and complex many-to-many relationships between entities via a probabilistic graphical model that enables the joint inference of clusterings specific to each view, but where each view informs the others. Additionally, BMVC estimates the strength of the relationships between each pair of views, thus moderating the degree to which it imposes dependence constraints. We benchmarked BMVC on simulated data to show that it accurately estimates varying degrees of inter-view dependence when inter-view relationships are not limited to one-to-one correspondence. Next, we demonstrated its ability to capture visually interpretable inter-view structure in a public health survey of individuals and households in Puerto Rico following Hurricane Maria. Finally, we showed that BMVC clusters integrate the complex relationships between multi-omic profiles of breast cancer patient data, improving the biological homogeneity of clusters and elucidating hypotheses for functional biological mechanisms. We found that BMVC leverages complex inter-view structure to produce higher quality clusters than those generated by standard approaches. We also showed that BMVC is a valuable tool for real-world discovery and hypothesis generation.

摘要

多视图数据集越来越普遍。这些数据集由不同的模态组成,这些模态提供了对同一底层系统的互补描述。它们可以包括具有复杂关系、不同程度缺失和各种样本大小的异构类型的信息,这在多组学生物学研究中经常发生。聚类多视图数据可以让我们利用不同的模态来推断底层的系统结构,但大多数现有的方法都局限于实体在视图中是相同的或在数据类型之间具有明确的一一对应关系并且具有相同的样本大小的情况。许多方法还对视图之间的聚类相似度做出了很强的假设。我们提出了一种贝叶斯多视图聚类方法(BMVC),可以处理多视图数据集的实际情况,这些数据集通常具有复杂的关系和多样的结构。BMVC 通过概率图形模型来处理实体之间已知的和复杂的多对多关系,该模型可以联合推断特定于每个视图的聚类,但每个视图也可以为其他视图提供信息。此外,BMVC 还估计了每对视图之间的关系强度,从而适度地强加了依赖约束的程度。我们在模拟数据上对 BMVC 进行了基准测试,以表明当视图之间的关系不限于一对一对应时,它可以准确地估计不同程度的视图间依赖关系。接下来,我们展示了它在波多黎各飓风玛丽亚后对个人和家庭的公共卫生调查中捕捉直观的视图间结构的能力。最后,我们表明,BMVC 聚类可以整合乳腺癌患者数据的多组学特征之间的复杂关系,从而提高聚类的生物学同质性,并阐明功能生物学机制的假设。我们发现,BMVC 利用复杂的视图间结构生成了比标准方法生成的聚类质量更高的聚类。我们还表明,BMVC 是现实世界发现和假设生成的有价值的工具。

相似文献

1
Bayesian Multi-View Clustering given complex inter-view structure.贝叶斯多视图聚类,给定复杂的视图间结构。
F1000Res. 2024 Feb 29;11:1460. doi: 10.12688/f1000research.126215.2. eCollection 2022.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Subtype identification from heterogeneous TCGA datasets on a genomic scale by multi-view clustering with enhanced consensus.通过具有增强一致性的多视图聚类,从基因组规模的异质TCGA数据集中进行亚型识别。
BMC Med Genomics. 2017 Dec 21;10(Suppl 4):75. doi: 10.1186/s12920-017-0306-x.
4
A Bayesian two-way latent structure model for genomic data integration reveals few pan-genomic cluster subtypes in a breast cancer cohort.贝叶斯双向潜在结构模型用于基因组数据整合,揭示乳腺癌队列中很少有泛基因组聚类亚型。
Bioinformatics. 2019 Dec 1;35(23):4886-4897. doi: 10.1093/bioinformatics/btz381.
5
Multi-omic and multi-view clustering algorithms: review and cancer benchmark.多组学和多视角聚类算法:综述和癌症基准测试。
Nucleic Acids Res. 2018 Nov 16;46(20):10546-10562. doi: 10.1093/nar/gky889.
6
Clustering single-cell multi-omics data via graph regularized multi-view ensemble learning.通过图正则化多视图集成学习对单细胞多组学数据进行聚类。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae169.
7
MONET: Multi-omic module discovery by omic selection.MONET:通过组学选择进行多组学模块发现。
PLoS Comput Biol. 2020 Sep 15;16(9):e1008182. doi: 10.1371/journal.pcbi.1008182. eCollection 2020 Sep.
8
COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms.COPS:一种通过稳健的聚类算法多目标评估发现多组学疾病亚型的新平台。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012275. doi: 10.1371/journal.pcbi.1012275. eCollection 2024 Aug.
9
Consensus clustering for Bayesian mixture models.贝叶斯混合模型的一致性聚类。
BMC Bioinformatics. 2022 Jul 21;23(1):290. doi: 10.1186/s12859-022-04830-8.
10
Directionally dependent multi-view clustering using copula model.基于Copula 模型的有向多视图聚类方法。
PLoS One. 2020 Oct 23;15(10):e0238996. doi: 10.1371/journal.pone.0238996. eCollection 2020.

本文引用的文献

1
The PPARα and PPARγ Epigenetic Landscape in Cancer and Immune and Metabolic Disorders.PPARα 和 PPARγ 在癌症及免疫和代谢紊乱中的表观遗传景观。
Int J Mol Sci. 2021 Sep 30;22(19):10573. doi: 10.3390/ijms221910573.
2
DNA methylation landscapes of 1538 breast cancers reveal a replication-linked clock, epigenomic instability and cis-regulation.1538 例乳腺癌的 DNA 甲基化图谱揭示了与复制相关的时钟、表观基因组不稳定性和顺式调控。
Nat Commun. 2021 Sep 13;12(1):5406. doi: 10.1038/s41467-021-25661-w.
3
Targeting Pyrimidine Metabolism in the Era of Precision Cancer Medicine.
精准癌症医学时代的嘧啶代谢靶向治疗
Front Oncol. 2021 May 28;11:684961. doi: 10.3389/fonc.2021.684961. eCollection 2021.
4
The role of lysosomes in cancer development and progression.溶酶体在癌症发生和发展中的作用。
Cell Biosci. 2020 Nov 18;10(1):131. doi: 10.1186/s13578-020-00489-x.
5
Sex differences between women and men with COPD: A new analysis of the 3CIA study.COPD 患者中女性与男性的性别差异:3CIA 研究的新分析。
Respir Med. 2020 Sep;171:106105. doi: 10.1016/j.rmed.2020.106105. Epub 2020 Aug 13.
6
Hypoxia-induced changes in intragenic DNA methylation correlate with alternative splicing in breast cancer.缺氧诱导的基因内 DNA 甲基化变化与乳腺癌中的可变剪接相关。
J Biosci. 2020;45. doi: 10.1007/s12038-019-9977-0.
7
DNA methylation loss promotes immune evasion of tumours with high mutation and copy number load.DNA 甲基化缺失促进具有高突变和拷贝数负荷的肿瘤的免疫逃逸。
Nat Commun. 2019 Sep 19;10(1):4278. doi: 10.1038/s41467-019-12159-9.
8
Epigenetic Priming in Immunodeficiencies.免疫缺陷中的表观遗传启动
Front Cell Dev Biol. 2019 Jul 10;7:125. doi: 10.3389/fcell.2019.00125. eCollection 2019.
9
GPCR Modulation in Breast Cancer.G 蛋白偶联受体在乳腺癌中的调节作用。
Int J Mol Sci. 2018 Dec 2;19(12):3840. doi: 10.3390/ijms19123840.
10
Frequent basal cell cancer development is a clinical marker for inherited cancer susceptibility.频繁发生基底细胞癌是遗传性癌症易感性的临床标志物。
JCI Insight. 2018 Aug 9;3(15). doi: 10.1172/jci.insight.122744.