• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

svdPPCS:一种基于有效奇异值分解的方法,用于鉴定保守和分歧的共表达基因模块。

svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification.

机构信息

Department of Computer Science, Xavier University of Louisiana, 1 Drexel Drive, New Orleans, LA 70125, USA.

出版信息

BMC Bioinformatics. 2010 Jun 22;11:338. doi: 10.1186/1471-2105-11-338.

DOI:10.1186/1471-2105-11-338
PMID:20565989
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2905369/
Abstract

BACKGROUND

Comparative analysis of gene expression profiling of multiple biological categories, such as different species of organisms or different kinds of tissue, promises to enhance the fundamental understanding of the universality as well as the specialization of mechanisms and related biological themes. Grouping genes with a similar expression pattern or exhibiting co-expression together is a starting point in understanding and analyzing gene expression data. In recent literature, gene module level analysis is advocated in order to understand biological network design and system behaviors in disease and life processes; however, practical difficulties often lie in the implementation of existing methods.

RESULTS

Using the singular value decomposition (SVD) technique, we developed a new computational tool, named svdPPCS (SVD-based Pattern Pairing and Chart Splitting), to identify conserved and divergent co-expression modules of two sets of microarray experiments. In the proposed methods, gene modules are identified by splitting the two-way chart coordinated with a pair of left singular vectors factorized from the gene expression matrices of the two biological categories. Importantly, the cutoffs are determined by a data-driven algorithm using the well-defined statistic, SVD-p. The implementation was illustrated on two time series microarray data sets generated from the samples of accessory gland (ACG) and malpighian tubule (MT) tissues of the line W118 of M. drosophila. Two conserved modules and six divergent modules, each of which has a unique characteristic profile across tissue kinds and aging processes, were identified. The number of genes contained in these models ranged from five to a few hundred. Three to over a hundred GO terms were over-represented in individual modules with FDR < 0.1. One divergent module suggested the tissue-specific relationship between the expressions of mitochondrion-related genes and the aging process. This finding, together with others, may be of biological significance. The validity of the proposed SVD-based method was further verified by a simulation study, as well as the comparisons with regression analysis and cubic spline regression analysis plus PAM based clustering.

CONCLUSIONS

svdPPCS is a novel computational tool for the comparative analysis of transcriptional profiling. It especially fits the comparison of time series data of related organisms or different tissues of the same organism under equivalent or similar experimental conditions. The general scheme can be directly extended to the comparisons of multiple data sets. It also can be applied to the integration of data sets from different platforms and of different sources.

摘要

背景

对多个生物学类别(例如不同生物体或不同组织)的基因表达谱进行比较分析,有望增强对机制和相关生物学主题普遍性和特殊性的基本理解。将具有相似表达模式或共同表达的基因分组在一起,是理解和分析基因表达数据的起点。在最近的文献中,提倡进行基因模块水平分析,以了解疾病和生命过程中的生物网络设计和系统行为;然而,实际困难往往在于现有方法的实施。

结果

我们使用奇异值分解(SVD)技术开发了一种新的计算工具,名为 svdPPCS(基于 SVD 的模式对配和图表分割),用于识别两组微阵列实验中保守和发散的共表达模块。在提出的方法中,通过分割与从两个生物类别基因表达矩阵的左奇异向量因子化得到的一对左奇异向量协调的双向图表来识别基因模块。重要的是,使用定义明确的统计量 SVD-p 的数据驱动算法确定了截止值。该实施例在来自 M. drosophila 的 W118 品系的附腺(ACG)和马氏管(MT)组织样本生成的两个时间序列微阵列数据集上进行了说明。鉴定了两个保守模块和六个发散模块,每个模块在组织种类和衰老过程中都具有独特的特征谱。这些模型中包含的基因数量从五个到几百个不等。在单个模块中,超过 3 个到 100 多个 GO 术语的过度表达,FDR<0.1。一个发散模块表明了与衰老过程相关的线粒体相关基因表达之间的组织特异性关系。这一发现与其他发现一起,可能具有生物学意义。通过模拟研究以及与回归分析和三次样条回归分析加 PAM 聚类的比较,进一步验证了所提出的基于 SVD 的方法的有效性。

结论

svdPPCS 是一种用于转录谱比较分析的新型计算工具。它特别适合于在相同实验条件下比较相关生物体的时间序列数据或同一生物体的不同组织。通用方案可以直接扩展到多个数据集的比较。它还可以应用于来自不同平台和不同来源的数据集的整合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/6711eda85f29/1471-2105-11-338-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/9ea785971c09/1471-2105-11-338-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/866f60edc597/1471-2105-11-338-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/0f64876371b7/1471-2105-11-338-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/30dd9af8a6b7/1471-2105-11-338-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/6711eda85f29/1471-2105-11-338-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/9ea785971c09/1471-2105-11-338-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/866f60edc597/1471-2105-11-338-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/0f64876371b7/1471-2105-11-338-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/30dd9af8a6b7/1471-2105-11-338-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1ecd/2905369/6711eda85f29/1471-2105-11-338-5.jpg

相似文献

1
svdPPCS: an effective singular value decomposition-based method for conserved and divergent co-expression gene module identification.svdPPCS:一种基于有效奇异值分解的方法,用于鉴定保守和分歧的共表达基因模块。
BMC Bioinformatics. 2010 Jun 22;11:338. doi: 10.1186/1471-2105-11-338.
2
Beyond clustering of array expressions.超越阵列表达式的聚类。
Int J Bioinform Res Appl. 2009;5(3):329-48. doi: 10.1504/IJBRA.2009.026423.
3
Feature related multi-view nonnegative matrix factorization for identifying conserved functional modules in multiple biological networks.基于特征相关的多视图非负矩阵分解识别多个生物网络中的保守功能模块。
BMC Bioinformatics. 2018 Oct 29;19(1):394. doi: 10.1186/s12859-018-2434-5.
4
Motif-guided sparse decomposition of gene expression data for regulatory module identification.基于模体的基因表达数据稀疏分解用于调控模块识别。
BMC Bioinformatics. 2011 Mar 22;12:82. doi: 10.1186/1471-2105-12-82.
5
Gene regulatory network clustering for graph layout based on microarray gene expression data.基于微阵列基因表达数据的用于图形布局的基因调控网络聚类
Genome Inform. 2010;24:84-95.
6
Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.通过高阶广义奇异值分解对共表达网络进行多组织分析,确定功能一致的转录模块。
PLoS Genet. 2014 Jan;10(1):e1004006. doi: 10.1371/journal.pgen.1004006. Epub 2014 Jan 2.
7
Analysis of cis-Regulatory Elements in Gene Co-expression Networks in Cancer.癌症基因共表达网络中的顺式调控元件分析
Methods Mol Biol. 2017;1613:291-310. doi: 10.1007/978-1-4939-7027-8_11.
8
Identification of nutrient partitioning genes participating in rice grain filling by singular value decomposition (SVD) of genome expression data.通过基因组表达数据的奇异值分解(SVD)鉴定参与水稻籽粒灌浆的养分分配基因。
BMC Genomics. 2003 Jul 10;4(1):26. doi: 10.1186/1471-2164-4-26.
9
An effective method for network module extraction from microarray data.一种从基因表达数据中提取网络模块的有效方法。
BMC Bioinformatics. 2012;13 Suppl 13(Suppl 13):S4. doi: 10.1186/1471-2105-13-S13-S4. Epub 2012 Aug 24.
10
Combining sequence and time series expression data to learn transcriptional modules.结合序列和时间序列表达数据以学习转录模块。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jul-Sep;2(3):194-202. doi: 10.1109/TCBB.2005.34.

引用本文的文献

1
Independent component analysis based gene co-expression network inference (ICAnet) to decipher functional modules for better single-cell clustering and batch integration.基于独立成分分析的基因共表达网络推断 (ICAnet) 以破译功能模块,从而更好地进行单细胞聚类和批次整合。
Nucleic Acids Res. 2021 May 21;49(9):e54. doi: 10.1093/nar/gkab089.
2
Dysregulation of antimicrobial peptide expression distinguishes Alzheimer's disease from normal aging.抗菌肽表达失调将阿尔茨海默病与正常衰老区分开来。
Aging (Albany NY). 2020 Jan 6;12(1):690-706. doi: 10.18632/aging.102650.
3
Discovery of two-level modular organization from matched genomic data via joint matrix tri-factorization.

本文引用的文献

1
A jackknife-like method for classification and uncertainty assessment of multi-category tumor samples using gene expression information.一种使用基因表达信息对多类别肿瘤样本进行分类和不确定性评估的类似折刀的方法。
BMC Genomics. 2010 Apr 29;11:273. doi: 10.1186/1471-2164-11-273.
2
Arabidopsis gene co-expression network and its functional modules.拟南芥基因共表达网络及其功能模块。
BMC Bioinformatics. 2009 Oct 21;10:346. doi: 10.1186/1471-2105-10-346.
3
Prediction of candidate primary immunodeficiency disease genes using a support vector machine learning approach.
通过联合矩阵三因子分解从匹配的基因组数据中发现两级模块化组织。
Nucleic Acids Res. 2018 Jul 6;46(12):5967-5976. doi: 10.1093/nar/gky440.
4
Identify bilayer modules via pseudo-3D clustering: applications to miRNA-gene bilayer networks.通过伪3D聚类识别双层模块:在miRNA-基因双层网络中的应用
Nucleic Acids Res. 2016 Nov 16;44(20):e152. doi: 10.1093/nar/gkw679. Epub 2016 Aug 2.
5
The modularity and dynamicity of miRNA-mRNA interactions in high-grade serous ovarian carcinomas and the prognostic implication.高级别浆液性卵巢癌中miRNA-mRNA相互作用的模块化和动态性及其预后意义。
Comput Biol Chem. 2016 Aug;63:3-14. doi: 10.1016/j.compbiolchem.2016.02.005. Epub 2016 Feb 27.
6
Quantitative assessment of gene expression network module-validation methods.基因表达网络模块验证方法的定量评估。
Sci Rep. 2015 Oct 16;5:15258. doi: 10.1038/srep15258.
7
Spatiotemporal positioning of multipotent modules in diverse biological networks.多能模块在不同生物网络中的时空定位。
Cell Mol Life Sci. 2014 Jul;71(14):2605-24. doi: 10.1007/s00018-013-1547-2. Epub 2014 Jan 11.
8
Coordinate microRNA-mediated regulation of protein complexes in prostate cancer.协调 microRNA 介导的前列腺癌细胞蛋白复合物的调控。
PLoS One. 2013 Dec 31;8(12):e84261. doi: 10.1371/journal.pone.0084261. eCollection 2013.
9
Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.通过高阶广义奇异值分解对共表达网络进行多组织分析,确定功能一致的转录模块。
PLoS Genet. 2014 Jan;10(1):e1004006. doi: 10.1371/journal.pgen.1004006. Epub 2014 Jan 2.
10
Metabolic module mining based on Independent Component Analysis in Arabidopsis thaliana.基于独立成分分析的拟南芥代谢模块挖掘。
Mol Cells. 2012 Sep;34(3):295-304. doi: 10.1007/s10059-012-0117-z. Epub 2012 Sep 7.
基于支持向量机学习方法预测候选原发性免疫缺陷病基因。
DNA Res. 2009 Dec;16(6):345-51. doi: 10.1093/dnares/dsp019. Epub 2009 Oct 3.
4
Semi-supervised gene shaving method for predicting low variation biological pathways from genome-wide data.用于从全基因组数据预测低变异生物途径的半监督基因筛选方法。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S54. doi: 10.1186/1471-2105-10-S1-S54.
5
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources.利用DAVID生物信息学资源对大型基因列表进行系统和综合分析。
Nat Protoc. 2009;4(1):44-57. doi: 10.1038/nprot.2008.211.
6
Evolutionarily conserved transcriptional co-expression guiding embryonic stem cell differentiation.指导胚胎干细胞分化的进化保守转录共表达
PLoS One. 2008;3(10):e3406. doi: 10.1371/journal.pone.0003406. Epub 2008 Oct 15.
7
Gene module level analysis: identification to networks and dynamics.基因模块水平分析:网络与动力学的识别
Curr Opin Biotechnol. 2008 Oct;19(5):482-91. doi: 10.1016/j.copbio.2008.07.011. Epub 2008 Sep 3.
8
Term-tissue specific models for prediction of gene ontology biological processes using transcriptional profiles of aging in drosophila melanogaster.使用黑腹果蝇衰老转录谱预测基因本体生物学过程的组织特异性模型。
BMC Bioinformatics. 2008 Feb 28;9:129. doi: 10.1186/1471-2105-9-129.
9
Relationship between mitochondrial electron transport chain dysfunction, development, and life extension in Caenorhabditis elegans.秀丽隐杆线虫中线粒体电子传递链功能障碍、发育与寿命延长之间的关系。
PLoS Biol. 2007 Oct 2;5(10):e259. doi: 10.1371/journal.pbio.0050259.
10
The discovery of transcriptional modules by a two-stage matrix decomposition approach.通过两阶段矩阵分解方法发现转录模块。
Bioinformatics. 2007 Feb 15;23(4):473-9. doi: 10.1093/bioinformatics/btl640. Epub 2006 Dec 22.