• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基因共表达的多维相关性及其在拟南芥大规模数据中的应用。

Multi-dimensional correlations for gene coexpression and application to the large-scale data of Arabidopsis.

机构信息

Human Genome Center, Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokane-dai, Minato-ku, Tokyo 108-8639, Japan.

出版信息

Bioinformatics. 2009 Oct 15;25(20):2677-84. doi: 10.1093/bioinformatics/btp442. Epub 2009 Jul 20.

DOI:10.1093/bioinformatics/btp442
PMID:19620096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2759550/
Abstract

BACKGROUND

Recent improvements in DNA microarray techniques have made a large variety of gene expression data available in public databases. This data can be used to evaluate the strength of gene coexpression by calculating the correlation of expression patterns among different genes between many experiments. However, gene expression levels differ significantly across various tissues in higher organisms, as well as in different cellular location in eukaryotes in different cell state. Thus the usual correlation measure can only evaluate the difference of tissues or cellular localizations, and cannot adequately elucidate the functional relationship from the coexpression of genes.

METHOD

We propose a new measure of coexpression by expanding the generally used correlation into a multidimensional one. We used principal component analyses to identify the major factors of gene expression correlation, and then re-calculate the correlation by subtracting the major components in order to remove biases cased by a few experiments. The repeated subtractions of the major components yielded a set of correlation values for each pair of genes. We observed the correlation changes when the first ten principal components were subtracted step-by-step in large-scale Arabidopsis expression data.

RESULTS

We found two extreme patterns of correlation changes, corresponding to stable and fragile coexpression. Our new indexes provided a good means to determine the functional relationships of the genes, by examining a few examples, and higher performance of Gene Ontology term prediction by using the support vector machine and the multidimensional correlation.

AVAILABILITY

The results are available from the expression detail pages in ATTED-II (http://atted.jp).

摘要

背景

最近 DNA 微阵列技术的改进使得大量基因表达数据可在公共数据库中使用。通过计算不同实验中不同基因之间表达模式的相关性,可以利用这些数据来评估基因共表达的强度。然而,在高等生物的各种组织中以及真核生物的不同细胞位置和不同细胞状态中,基因表达水平存在显著差异。因此,通常的相关度量方法只能评估组织或细胞定位的差异,而不能充分阐明基因共表达的功能关系。

方法

我们通过将常用的相关性扩展为多维相关性来提出一种新的共表达度量方法。我们使用主成分分析来识别基因表达相关性的主要因素,然后通过减去主要成分来重新计算相关性,以消除少数实验引起的偏差。重复减去主要成分可得到每对基因的一组相关值。我们在大规模拟南芥表达数据中观察到当逐步减去前十个主成分时相关性的变化。

结果

我们发现了两种极端的相关性变化模式,分别对应于稳定和脆弱的共表达。我们的新指标通过检查几个例子,为确定基因的功能关系提供了一种很好的方法,并且通过使用支持向量机和多维相关性,提高了基因本体论术语预测的性能。

可用性

结果可从 ATTED-II 的表达详细信息页面获得(http://atted.jp)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/903c71c0444d/btp442f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/ef646e333d4b/btp442f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/704241a57130/btp442f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/32fea8cb0e94/btp442f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/903c71c0444d/btp442f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/ef646e333d4b/btp442f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/704241a57130/btp442f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/32fea8cb0e94/btp442f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b2c5/2759550/903c71c0444d/btp442f4.jpg

相似文献

1
Multi-dimensional correlations for gene coexpression and application to the large-scale data of Arabidopsis.基因共表达的多维相关性及其在拟南芥大规模数据中的应用。
Bioinformatics. 2009 Oct 15;25(20):2677-84. doi: 10.1093/bioinformatics/btp442. Epub 2009 Jul 20.
2
ATTED-II in 2018: A Plant Coexpression Database Based on Investigation of the Statistical Property of the Mutual Rank Index.2018 年的 ATTED-II:基于互秩指数统计特性研究的植物共表达数据库。
Plant Cell Physiol. 2018 Jan 1;59(1):e3. doi: 10.1093/pcp/pcx191.
3
ATTED-II updates: condition-specific gene coexpression to extend coexpression analyses and applications to a broad range of flowering plants.ATTED-II 更新:特定条件下的基因共表达,将共表达分析和应用扩展到广泛的开花植物。
Plant Cell Physiol. 2011 Feb;52(2):213-9. doi: 10.1093/pcp/pcq203. Epub 2011 Jan 7.
4
ATTED-II in 2016: A Plant Coexpression Database Towards Lineage-Specific Coexpression.2016年的ATTED-II:一个针对谱系特异性共表达的植物共表达数据库。
Plant Cell Physiol. 2016 Jan;57(1):e5. doi: 10.1093/pcp/pcv165. Epub 2015 Nov 6.
5
ATTED-II v11: A Plant Gene Coexpression Database Using a Sample Balancing Technique by Subagging of Principal Components.ATTED-II v11:一个通过主成分子聚类使用样本平衡技术的植物基因共表达数据库。
Plant Cell Physiol. 2022 Jun 15;63(6):869-881. doi: 10.1093/pcp/pcac041.
6
ATTED-II provides coexpressed gene networks for Arabidopsis.ATTED-II为拟南芥提供共表达基因网络。
Nucleic Acids Res. 2009 Jan;37(Database issue):D987-91. doi: 10.1093/nar/gkn807. Epub 2008 Oct 25.
7
Rank of correlation coefficient as a comparable measure for biological significance of gene coexpression.相关系数等级可作为基因共表达生物学意义的可比度量。
DNA Res. 2009 Oct;16(5):249-60. doi: 10.1093/dnares/dsp016. Epub 2009 Sep 18.
8
ATTED-II in 2014: evaluation of gene coexpression in agriculturally important plants.2014年的ATTED-II:对重要农作物中基因共表达的评估
Plant Cell Physiol. 2014 Jan;55(1):e6. doi: 10.1093/pcp/pct178. Epub 2013 Dec 10.
9
Transcriptome coexpression analysis using ATTED-II for integrated transcriptomic/metabolomic analysis.使用ATTED-II进行转录组共表达分析以进行综合转录组/代谢组分析。
Methods Mol Biol. 2013;1011:317-26. doi: 10.1007/978-1-62703-414-2_25.
10
Differential Coexpression Analysis Reveals Extensive Rewiring of Arabidopsis Gene Coexpression in Response to Pseudomonas syringae Infection.差异共表达分析揭示了拟南芥基因共表达在应对丁香假单胞菌感染时的广泛重排。
Sci Rep. 2016 Oct 10;6:35064. doi: 10.1038/srep35064.

引用本文的文献

1
Apoptotic effects of human amniotic fluid mesenchymal stem cells conditioned medium on human MCF-7 breast cancer cell line.人羊水间充质干细胞条件培养基对人MCF-7乳腺癌细胞系的凋亡作用
Bioimpacts. 2023;13(3):191-206. doi: 10.34172/bi.2022.23813. Epub 2022 Mar 30.
2
Unified Transcriptomic Signature of Arbuscular Mycorrhiza Colonization in Roots of by Integration of Machine Learning, Promoter Analysis, and Direct Merging Meta-Analysis.通过机器学习、启动子分析和直接合并荟萃分析整合,揭示丛枝菌根在根系中定殖的统一转录组特征 。 (注:原文中“Unified Transcriptomic Signature of Arbuscular Mycorrhiza Colonization in Roots of by...”表述不太完整,推测这里可能遗漏了一些内容,比如植物名称等,但按照要求仅根据现有原文翻译)
Front Plant Sci. 2018 Nov 12;9:1550. doi: 10.3389/fpls.2018.01550. eCollection 2018.
3

本文引用的文献

1
ATTED-II provides coexpressed gene networks for Arabidopsis.ATTED-II为拟南芥提供共表达基因网络。
Nucleic Acids Res. 2009 Jan;37(Database issue):D987-91. doi: 10.1093/nar/gkn807. Epub 2008 Oct 25.
2
SVD-based anatomy of gene expressions for correlation analysis in Arabidopsis thaliana.基于奇异值分解的拟南芥基因表达解剖结构用于相关性分析
DNA Res. 2008 Dec;15(6):367-74. doi: 10.1093/dnares/dsn025. Epub 2008 Oct 17.
3
CRR23/NdhL is a subunit of the chloroplast NAD(P)H dehydrogenase complex in Arabidopsis.CRR23/NdhL是拟南芥叶绿体NAD(P)H脱氢酶复合体的一个亚基。
COXPRESdb v7: a gene coexpression database for 11 animal species supported by 23 coexpression platforms for technical evaluation and evolutionary inference.COXPRESdb v7:一个支持 23 个共表达平台的 11 种动物基因共表达数据库,用于技术评估和进化推理。
Nucleic Acids Res. 2019 Jan 8;47(D1):D55-D62. doi: 10.1093/nar/gky1155.
4
ATTED-II in 2016: A Plant Coexpression Database Towards Lineage-Specific Coexpression.2016年的ATTED-II:一个针对谱系特异性共表达的植物共表达数据库。
Plant Cell Physiol. 2016 Jan;57(1):e5. doi: 10.1093/pcp/pcv165. Epub 2015 Nov 6.
5
Visualisation of the T cell differentiation programme by Canonical Correspondence Analysis of transcriptomes.通过转录组的典范对应分析对T细胞分化程序进行可视化。
BMC Genomics. 2014 Nov 27;15(1):1028. doi: 10.1186/1471-2164-15-1028.
6
COXPRESdb in 2015: coexpression database for animal species by DNA-microarray and RNAseq-based expression data with multiple quality assessment systems.2015年的COXPRESdb:基于DNA微阵列和RNA测序的表达数据、带有多个质量评估系统的动物物种共表达数据库。
Nucleic Acids Res. 2015 Jan;43(Database issue):D82-6. doi: 10.1093/nar/gku1163. Epub 2014 Nov 11.
7
Similarity from multi-dimensional scaling: solving the accuracy and diversity dilemma in information filtering.多维缩放中的相似性:解决信息过滤中的准确性和多样性困境
PLoS One. 2014 Oct 24;9(10):e111005. doi: 10.1371/journal.pone.0111005. eCollection 2014.
8
Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.通过高阶广义奇异值分解对共表达网络进行多组织分析,确定功能一致的转录模块。
PLoS Genet. 2014 Jan;10(1):e1004006. doi: 10.1371/journal.pgen.1004006. Epub 2014 Jan 2.
9
ATTED-II in 2014: evaluation of gene coexpression in agriculturally important plants.2014年的ATTED-II:对重要农作物中基因共表达的评估
Plant Cell Physiol. 2014 Jan;55(1):e6. doi: 10.1093/pcp/pct178. Epub 2013 Dec 10.
10
Data integration through proximity-based networks provides biological principles of organization across scales.通过基于邻近度的网络进行数据集成,提供了跨尺度的组织的生物学原理。
Plant Cell. 2013 Jun;25(6):1917-27. doi: 10.1105/tpc.113.111039. Epub 2013 Jun 7.
Plant Cell Physiol. 2008 May;49(5):835-42. doi: 10.1093/pcp/pcn058. Epub 2008 Apr 3.
4
A single gene network accurately predicts phenotypic effects of gene perturbation in Caenorhabditis elegans.一个单基因网络能准确预测秀丽隐杆线虫基因扰动的表型效应。
Nat Genet. 2008 Feb;40(2):181-8. doi: 10.1038/ng.2007.70. Epub 2008 Jan 27.
5
KEGG for linking genomes to life and the environment.京都基因与基因组百科全书,用于将基因组与生命及环境相联系。
Nucleic Acids Res. 2008 Jan;36(Database issue):D480-4. doi: 10.1093/nar/gkm882. Epub 2007 Dec 12.
6
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.拟南芥信息资源库(TAIR):基因结构与功能注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D1009-14. doi: 10.1093/nar/gkm965. Epub 2007 Nov 5.
7
AtPID: Arabidopsis thaliana protein interactome database--an integrative platform for plant systems biology.AtPID:拟南芥蛋白质相互作用组数据库——植物系统生物学的综合平台。
Nucleic Acids Res. 2008 Jan;36(Database issue):D999-1008. doi: 10.1093/nar/gkm844. Epub 2007 Oct 25.
8
COXPRESdb: a database of coexpressed gene networks in mammals.COXPRESdb:一个哺乳动物中共表达基因网络的数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D77-82. doi: 10.1093/nar/gkm840. Epub 2007 Oct 11.
9
Capturing heterogeneity in gene expression studies by surrogate variable analysis.通过替代变量分析在基因表达研究中捕捉异质性。
PLoS Genet. 2007 Sep;3(9):1724-35. doi: 10.1371/journal.pgen.0030161. Epub 2007 Aug 1.
10
Distinct functions for the two PsbP-like proteins PPL1 and PPL2 in the chloroplast thylakoid lumen of Arabidopsis.拟南芥叶绿体类囊体腔中两种类PsbP蛋白PPL1和PPL2的不同功能
Plant Physiol. 2007 Nov;145(3):668-79. doi: 10.1104/pp.107.105866. Epub 2007 Sep 7.