• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从多维基因组数据中识别多层基因调控模块。

Identifying multi-layer gene regulatory modules from multi-dimensional genomic data.

机构信息

Program in Molecular and Computational Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA.

出版信息

Bioinformatics. 2012 Oct 1;28(19):2458-66. doi: 10.1093/bioinformatics/bts476. Epub 2012 Aug 3.

DOI:10.1093/bioinformatics/bts476
PMID:22863767
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3463121/
Abstract

MOTIVATION

Eukaryotic gene expression (GE) is subjected to precisely coordinated multi-layer controls, across the levels of epigenetic, transcriptional and post-transcriptional regulations. Recently, the emerging multi-dimensional genomic dataset has provided unprecedented opportunities to study the cross-layer regulatory interplay. In these datasets, the same set of samples is profiled on several layers of genomic activities, e.g. copy number variation (CNV), DNA methylation (DM), GE and microRNA expression (ME). However, suitable analysis methods for such data are currently sparse.

RESULTS

In this article, we introduced a sparse Multi-Block Partial Least Squares (sMBPLS) regression method to identify multi-dimensional regulatory modules from this new type of data. A multi-dimensional regulatory module contains sets of regulatory factors from different layers that are likely to jointly contribute to a local 'gene expression factory'. We demonstrated the performance of our method on the simulated data as well as on The Cancer Genomic Atlas Ovarian Cancer datasets including the CNV, DM, ME and GE data measured on 230 samples. We showed that majority of identified modules have significant functional and transcriptional enrichment, higher than that observed in modules identified using only a single type of genomic data. Our network analysis of the modules revealed that the CNV, DM and microRNA can have coupled impact on expression of important oncogenes and tumor suppressor genes.

AVAILABILITY AND IMPLEMENTATION

The source code implemented by MATLAB is freely available at: http://zhoulab.usc.edu/sMBPLS/.

CONTACT

xjzhou@usc.edu

SUPPLEMENTARY INFORMATION

Supplementary material are available at Bioinformatics online.

摘要

动机

真核基因表达(GE)受到精确协调的多层次控制,跨越表观遗传、转录和转录后调控的水平。最近,新兴的多维基因组数据集为研究跨层调控相互作用提供了前所未有的机会。在这些数据集中,同一组样本在几个基因组活动层面上进行了分析,例如拷贝数变异(CNV)、DNA 甲基化(DM)、GE 和 microRNA 表达(ME)。然而,目前适合此类数据的分析方法还很少。

结果

在本文中,我们引入了一种稀疏多块偏最小二乘(sMBPLS)回归方法,用于从这种新型数据中识别多维调控模块。一个多维调控模块包含来自不同层的调控因子集,这些因子可能共同促成局部的“基因表达工厂”。我们在模拟数据以及包括 230 个样本的 CNV、DM、ME 和 GE 数据的癌症基因组图谱卵巢癌数据集上演示了我们方法的性能。我们表明,大多数鉴定的模块具有显著的功能和转录丰度富集,高于仅使用单一类型基因组数据鉴定的模块。我们对模块的网络分析表明,CNV、DM 和 microRNA 可以对重要癌基因和肿瘤抑制基因的表达产生耦合影响。

可用性和实现

用 MATLAB 实现的源代码可在以下网址免费获得:http://zhoulab.usc.edu/sMBPLS/。

联系人

xjzhou@usc.edu

补充信息

补充材料可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/d601d1377589/bts476f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/acd8d08c2484/bts476f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/b766b5ac0df5/bts476f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/e47f6e96638b/bts476f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/5e8a5eb6acea/bts476f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/d601d1377589/bts476f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/acd8d08c2484/bts476f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/b766b5ac0df5/bts476f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/e47f6e96638b/bts476f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/5e8a5eb6acea/bts476f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dac1/3463121/d601d1377589/bts476f5p.jpg

相似文献

1
Identifying multi-layer gene regulatory modules from multi-dimensional genomic data.从多维基因组数据中识别多层基因调控模块。
Bioinformatics. 2012 Oct 1;28(19):2458-66. doi: 10.1093/bioinformatics/bts476. Epub 2012 Aug 3.
2
A novel computational framework for simultaneous integration of multiple types of genomic data to identify microRNA-gene regulatory modules.一种新型的计算框架,用于同时整合多种类型的基因组数据,以识别 microRNA-基因调控模块。
Bioinformatics. 2011 Jul 1;27(13):i401-9. doi: 10.1093/bioinformatics/btr206.
3
Discovery of multi-dimensional modules by integrative analysis of cancer genomic data.通过癌症基因组数据的综合分析发现多维模块。
Nucleic Acids Res. 2012 Oct;40(19):9379-91. doi: 10.1093/nar/gks725. Epub 2012 Aug 8.
4
Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data.通过整合体细胞突变、拷贝数变异和基因表达数据来鉴定核心癌症突变模块。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S4. doi: 10.1186/1752-0509-7-S2-S4. Epub 2013 Oct 14.
5
Identification of tumor suppressors and oncogenes from genomic and epigenetic features in ovarian cancer.从卵巢癌的基因组和表观遗传特征中鉴定肿瘤抑制基因和癌基因。
PLoS One. 2011;6(12):e28503. doi: 10.1371/journal.pone.0028503. Epub 2011 Dec 8.
6
ICan: an integrated co-alteration network to identify ovarian cancer-related genes.ICan:一个用于识别卵巢癌相关基因的整合共改变网络。
PLoS One. 2015 Mar 24;10(3):e0116095. doi: 10.1371/journal.pone.0116095. eCollection 2015.
7
A non-negative matrix factorization method for detecting modules in heterogeneous omics multi-modal data.一种用于在异质组学多模态数据中检测模块的非负矩阵分解方法。
Bioinformatics. 2016 Jan 1;32(1):1-8. doi: 10.1093/bioinformatics/btv544. Epub 2015 Sep 15.
8
A multi-view genomic data simulator.一个多视图基因组数据模拟器。
BMC Bioinformatics. 2015 May 12;16:151. doi: 10.1186/s12859-015-0577-1.
9
Identification of ovarian cancer subtype-specific network modules and candidate drivers through an integrative genomics approach.通过整合基因组学方法鉴定卵巢癌亚型特异性网络模块和候选驱动因子。
Oncotarget. 2016 Jan 26;7(4):4298-309. doi: 10.18632/oncotarget.6774.
10
Integrating multiple types of data to identify microRNA-gene co-modules.整合多种类型的数据以识别微小RNA-基因共模块。
Methods Mol Biol. 2013;1049:215-29. doi: 10.1007/978-1-62703-547-7_16.

引用本文的文献

1
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.多组学数据整合方法的技术综述:从经典统计方法到深度生成方法
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.
2
Integrative Analysis of Nontargeted LC-HRMS and High-Throughput Metabarcoding Data for Aquatic Environmental Studies Using Combined Multivariate Statistical Approaches.使用组合多变量统计方法对水生环境研究的非靶向液相色谱-高分辨质谱和高通量代谢条形码数据进行综合分析
Anal Chem. 2025 Jun 10;97(22):11563-11571. doi: 10.1021/acs.analchem.5c00539. Epub 2025 May 28.
3
sTPLS: identifying common and specific correlated patterns under multiple biological conditions.

本文引用的文献

1
Construction and analysis of an integrated regulatory network derived from high-throughput sequencing data.基于高通量测序数据构建和分析综合调控网络。
PLoS Comput Biol. 2011 Nov;7(11):e1002190. doi: 10.1371/journal.pcbi.1002190. Epub 2011 Nov 17.
2
Time to recurrence and survival in serous ovarian tumors predicted from integrated genomic profiles.基于整合基因组特征预测浆液性卵巢肿瘤的复发时间和生存情况。
PLoS One. 2011;6(11):e24709. doi: 10.1371/journal.pone.0024709. Epub 2011 Nov 3.
3
Multi-species integrative biclustering.多物种综合二分聚类。
sTPLS:识别多种生物学条件下的共同和特定相关模式。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf195.
4
asmbPLS: biomarker identification and patient survival prediction with multi-omics data.asmbPLS:利用多组学数据进行生物标志物识别和患者生存预测
Front Genet. 2024 Nov 22;15:1444054. doi: 10.3389/fgene.2024.1444054. eCollection 2024.
5
Methods for multi-omic data integration in cancer research.癌症研究中的多组学数据整合方法。
Front Genet. 2024 Sep 19;15:1425456. doi: 10.3389/fgene.2024.1425456. eCollection 2024.
6
Long Non-Coding RNAs, Nuclear Receptors and Their Cross-Talks in Cancer-Implications and Perspectives.长链非编码RNA、核受体及其在癌症中的相互作用——影响与展望
Cancers (Basel). 2024 Aug 22;16(16):2920. doi: 10.3390/cancers16162920.
7
A guided network estimation approach using multi-omic information.基于多组学信息的引导网络估计方法。
BMC Bioinformatics. 2024 May 30;25(1):202. doi: 10.1186/s12859-024-05778-7.
8
Integration of Multi-Omics Data for the Classification of Glioma Types and Identification of Novel Biomarkers.整合多组学数据用于胶质瘤类型分类和新型生物标志物鉴定
Bioinform Biol Insights. 2024 May 27;18:11779322241249563. doi: 10.1177/11779322241249563. eCollection 2024.
9
Integrating omics atlas in health informatics system design-an opinion article.将组学图谱整合到健康信息系统设计中——一篇观点文章。
Front Digit Health. 2024 May 9;6:1374359. doi: 10.3389/fdgth.2024.1374359. eCollection 2024.
10
A supervised Bayesian factor model for the identification of multi-omics signatures.基于监督贝叶斯因子模型的多组学特征识别。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae202.
Genome Biol. 2010;11(9):R96. doi: 10.1186/gb-2010-11-9-r96. Epub 2010 Sep 29.
4
L2-norm multiple kernel learning and its application to biomedical data fusion.L2-范数多核学习及其在生物医学数据融合中的应用。
BMC Bioinformatics. 2010 Jun 8;11:309. doi: 10.1186/1471-2105-11-309.
5
Small molecules with big effects: the role of the microRNAome in cancer and carcinogenesis.小分子,大作用:microRNA 组在癌症和癌变中的作用。
Mutat Res. 2011 Jun 17;722(2):94-105. doi: 10.1016/j.mrgentox.2010.05.006. Epub 2010 May 21.
6
Sparse partial least squares regression for simultaneous dimension reduction and variable selection.用于同时进行降维和变量选择的稀疏偏最小二乘回归。
J R Stat Soc Series B Stat Methodol. 2010 Jan;72(1):3-25. doi: 10.1111/j.1467-9868.2009.00723.x.
7
BRCA1 represses amphiregulin gene expression.BRCA1 抑制 Amphiregulin 基因的表达。
Cancer Res. 2010 Feb 1;70(3):996-1005. doi: 10.1158/0008-5472.CAN-09-2842. Epub 2010 Jan 26.
8
A Bayesian partition method for detecting pleiotropic and epistatic eQTL modules.基于贝叶斯的基因表达数量性状位点模块的上位性和多效性检测方法。
PLoS Comput Biol. 2010 Jan 15;6(1):e1000642. doi: 10.1371/journal.pcbi.1000642.
9
HOXA methylation in normal endometrium from premenopausal women is associated with the presence of ovarian cancer: a proof of principle study.绝经前女性正常子宫内膜中的HOXA甲基化与卵巢癌的存在相关:一项原理验证研究。
Int J Cancer. 2009 Nov 1;125(9):2214-8. doi: 10.1002/ijc.24599.
10
Extensions of sparse canonical correlation analysis with applications to genomic data.稀疏典型相关分析的扩展及其在基因组数据中的应用
Stat Appl Genet Mol Biol. 2009;8(1):Article28. doi: 10.2202/1544-6115.1470. Epub 2009 Jun 9.