• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过自适应对齐多种异构组学数据进行模式融合分析。

Pattern fusion analysis by adaptive alignment of multiple heterogeneous omics data.

机构信息

Key Laboratory of Systems Biology, CAS Center for Excellence in Molecular Cell Science, Innovation Center for Cell Signaling Network, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences; University of Chinese Academy of Sciences, Shanghai 200031, China.

State Key Laboratory of Software Engineering, School of Computer, Wuhan University, Wuhan 430072, China.

出版信息

Bioinformatics. 2017 Sep 1;33(17):2706-2714. doi: 10.1093/bioinformatics/btx176.

DOI:10.1093/bioinformatics/btx176
PMID:28520848
Abstract

MOTIVATION

Integrating different omics profiles is a challenging task, which provides a comprehensive way to understand complex diseases in a multi-view manner. One key for such an integration is to extract intrinsic patterns in concordance with data structures, so as to discover consistent information across various data types even with noise pollution. Thus, we proposed a novel framework called 'pattern fusion analysis' (PFA), which performs automated information alignment and bias correction, to fuse local sample-patterns (e.g. from each data type) into a global sample-pattern corresponding to phenotypes (e.g. across most data types). In particular, PFA can identify significant sample-patterns from different omics profiles by optimally adjusting the effects of each data type to the patterns, thereby alleviating the problems to process different platforms and different reliability levels of heterogeneous data.

RESULTS

To validate the effectiveness of our method, we first tested PFA on various synthetic datasets, and found that PFA can not only capture the intrinsic sample clustering structures from the multi-omics data in contrast to the state-of-the-art methods, such as iClusterPlus, SNF and moCluster, but also provide an automatic weight-scheme to measure the corresponding contributions by data types or even samples. In addition, the computational results show that PFA can reveal shared and complementary sample-patterns across data types with distinct signal-to-noise ratios in Cancer Cell Line Encyclopedia (CCLE) datasets, and outperforms over other works at identifying clinically distinct cancer subtypes in The Cancer Genome Atlas (TCGA) datasets.

AVAILABILITY AND IMPLEMENTATION

PFA has been implemented as a Matlab package, which is available at http://www.sysbio.ac.cn/cb/chenlab/images/PFApackage_0.1.rar .

CONTACT

lnchen@sibs.ac.cn , liujuan@whu.edu.cn or zengtao@sibs.ac.cn.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

整合不同的组学谱是一项具有挑战性的任务,它提供了一种全面的方法,可以从多视图的角度理解复杂疾病。这种整合的一个关键是提取与数据结构一致的内在模式,以便即使在存在噪声污染的情况下,也能在各种数据类型中发现一致的信息。因此,我们提出了一种称为“模式融合分析”(PFA)的新框架,该框架执行自动信息对齐和偏差校正,将局部样本模式(例如,来自每种数据类型)融合到对应于表型的全局样本模式中(例如,跨越大多数数据类型)。特别是,PFA 可以通过优化每种数据类型对模式的影响来识别来自不同组学谱的显著样本模式,从而缓解处理不同平台和异构数据不同可靠性水平的问题。

结果

为了验证我们方法的有效性,我们首先在各种合成数据集上测试了 PFA,发现 PFA 不仅可以捕捉多组学数据中的内在样本聚类结构,与 iClusterPlus、SNF 和 moCluster 等最新方法相比,还可以提供一种自动权重方案来衡量数据类型甚至样本的相应贡献。此外,计算结果表明,PFA 可以揭示癌症细胞系百科全书(CCLE)数据集中具有不同信噪比的不同数据类型之间的共享和互补样本模式,并在识别癌症基因组图谱(TCGA)数据集中具有临床明显差异的癌症亚型方面优于其他作品。

可用性和实现

PFA 已实现为一个 Matlab 包,可在 http://www.sysbio.ac.cn/cb/chenlab/images/PFApackage_0.1.rar 获得。

联系人

lnchen@sibs.ac.cn,liujuan@whu.edu.cn 或 zengtao@sibs.ac.cn。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Pattern fusion analysis by adaptive alignment of multiple heterogeneous omics data.通过自适应对齐多种异构组学数据进行模式融合分析。
Bioinformatics. 2017 Sep 1;33(17):2706-2714. doi: 10.1093/bioinformatics/btx176.
2
Discovering personalized driver mutation profiles of single samples in cancer by network control strategy.通过网络控制策略发现癌症中单样本的个性化驱动突变特征。
Bioinformatics. 2018 Jun 1;34(11):1893-1903. doi: 10.1093/bioinformatics/bty006.
3
Clustering and variable selection evaluation of 13 unsupervised methods for multi-omics data integration.用于多组学数据整合的13种无监督方法的聚类和变量选择评估
Brief Bioinform. 2020 Dec 1;21(6):2011-2030. doi: 10.1093/bib/bbz138.
4
Multi-view Subspace Clustering Analysis for Aggregating Multiple Heterogeneous Omics Data.用于聚合多个异构组学数据的多视图子空间聚类分析
Front Genet. 2019 Aug 20;10:744. doi: 10.3389/fgene.2019.00744. eCollection 2019.
5
Single cell clustering based on cell-pair differentiability correlation and variance analysis.基于细胞对可区分性相关性和方差分析的单细胞聚类。
Bioinformatics. 2018 Nov 1;34(21):3684-3694. doi: 10.1093/bioinformatics/bty390.
6
Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification.使用低秩近似的多组学数据快速降维和整合聚类:在癌症分子分类中的应用
BMC Genomics. 2015 Dec 1;16:1022. doi: 10.1186/s12864-015-2223-8.
7
MCentridFS: a tool for identifying module biomarkers for multi-phenotypes from high-throughput data.MCentridFS:一种用于从高通量数据中识别多表型模块生物标志物的工具。
Mol Biosyst. 2014 Nov;10(11):2870-5. doi: 10.1039/c4mb00325j.
8
Multi-view manifold regularized compact low-rank representation for cancer samples clustering on multi-omics data.基于多组学数据的癌症样本聚类的多视图流形正则化紧致低秩表示
BMC Bioinformatics. 2022 Jan 20;22(Suppl 12):334. doi: 10.1186/s12859-021-04220-6.
9
High-Order Correlation Integration for Single-Cell or Bulk RNA-seq Data Analysis.用于单细胞或批量RNA测序数据分析的高阶相关性整合
Front Genet. 2019 Apr 26;10:371. doi: 10.3389/fgene.2019.00371. eCollection 2019.
10
Fuse: multiple network alignment via data fusion.Fuse:通过数据融合进行多重网络比对。
Bioinformatics. 2016 Apr 15;32(8):1195-203. doi: 10.1093/bioinformatics/btv731. Epub 2015 Dec 14.

引用本文的文献

1
Spatially informed graph transformers for spatially resolved transcriptomics.用于空间分辨转录组学的空间信息图变换器
Commun Biol. 2025 Apr 6;8(1):574. doi: 10.1038/s42003-025-08015-w.
2
Spatially aligned graph transfer learning for characterizing spatial regulatory heterogeneity.用于表征空间调控异质性的空间对齐图转移学习
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf021.
3
PartIES: a disease subtyping framework with Partition-level Integration using diffusion-Enhanced Similarities from multi-omics Data.PARTIES:一种基于分区水平集成的疾病亚型框架,利用多组学数据的扩散增强相似性。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae609.
4
IPFMC: an iterative pathway fusion approach for enhanced multi-omics clustering in cancer research.IPFMC:一种用于增强癌症研究中多组学聚类的迭代途径融合方法。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae541.
5
Bioinformatics Analysis and Validation of Potential Markers Associated with Prediction and Prognosis of Gastric Cancer.生物信息学分析和验证与胃癌预测和预后相关的潜在标志物。
Int J Mol Sci. 2024 May 28;25(11):5880. doi: 10.3390/ijms25115880.
6
Multi-modal domain adaptation for revealing spatial functional landscape from spatially resolved transcriptomics.多模态域自适应揭示空间分辨转录组学中的空间功能景观
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae257.
7
Integrating omics atlas in health informatics system design-an opinion article.将组学图谱整合到健康信息系统设计中——一篇观点文章。
Front Digit Health. 2024 May 9;6:1374359. doi: 10.3389/fdgth.2024.1374359. eCollection 2024.
8
Multi-modal molecular determinants of clinically relevant osteoporosis subtypes.临床相关骨质疏松症亚型的多模态分子决定因素。
Cell Discov. 2024 Mar 12;10(1):28. doi: 10.1038/s41421-024-00652-5.
9
Deeply integrating latent consistent representations in high-noise multi-omics data for cancer subtyping.在高噪声多组学数据中深度集成潜在一致表示以进行癌症亚型分类。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae061.
10
Spatially contrastive variational autoencoder for deciphering tissue heterogeneity from spatially resolved transcriptomics.基于空间对比变分自动编码器的空间分辨转录组学解析组织异质性
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae016.