• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相关系数等级可作为基因共表达生物学意义的可比度量。

Rank of correlation coefficient as a comparable measure for biological significance of gene coexpression.

机构信息

Human Genome Center, Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan.

出版信息

DNA Res. 2009 Oct;16(5):249-60. doi: 10.1093/dnares/dsp016. Epub 2009 Sep 18.

DOI:10.1093/dnares/dsp016
PMID:19767600
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2762411/
Abstract

Information regarding gene coexpression is useful to predict gene function. Several databases have been constructed for gene coexpression in model organisms based on a large amount of publicly available gene expression data measured by GeneChip platforms. In these databases, Pearson's correlation coefficients (PCCs) of gene expression patterns are widely used as a measure of gene coexpression. Although the coexpression measure or GeneChip summarization method affects the performance of the gene coexpression database, previous studies for these calculation procedures were tested with only a small number of samples and a particular species. To evaluate the effectiveness of coexpression measures, assessments with large-scale microarray data are required. We first examined characteristics of PCC and found that the optimal PCC threshold to retrieve functionally related genes was affected by the method of gene expression database construction and the target gene function. In addition, we found that this problem could be overcome when we used correlation ranks instead of correlation values. This observation was evaluated by large-scale gene expression data for four species: Arabidopsis, human, mouse and rat.

摘要

关于基因共表达的信息对于预测基因功能很有用。已经构建了几个基于大量公开的基因表达数据的模型生物的基因共表达数据库,这些数据是通过 GeneChip 平台测量的。在这些数据库中,基因表达模式的 Pearson 相关系数 (PCC) 被广泛用作基因共表达的度量。虽然共表达度量或 GeneChip 汇总方法会影响基因共表达数据库的性能,但之前对这些计算过程的研究仅在少数样本和特定物种上进行了测试。为了评估共表达度量的有效性,需要使用大规模的微阵列数据进行评估。我们首先检查了 PCC 的特征,发现检索功能相关基因的最佳 PCC 阈值受到基因表达数据库构建方法和目标基因功能的影响。此外,我们发现当我们使用相关等级而不是相关值时,可以克服这个问题。通过对四个物种(拟南芥、人类、小鼠和大鼠)的大规模基因表达数据进行了评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/998c72e8b125/dsp01605.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/fd9c584adb87/dsp01601.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/5e342f456a27/dsp01602.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/9f3a84e63eaf/dsp01603.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/e328d9c71845/dsp01604.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/998c72e8b125/dsp01605.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/fd9c584adb87/dsp01601.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/5e342f456a27/dsp01602.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/9f3a84e63eaf/dsp01603.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/e328d9c71845/dsp01604.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d43/2762411/998c72e8b125/dsp01605.jpg

相似文献

1
Rank of correlation coefficient as a comparable measure for biological significance of gene coexpression.相关系数等级可作为基因共表达生物学意义的可比度量。
DNA Res. 2009 Oct;16(5):249-60. doi: 10.1093/dnares/dsp016. Epub 2009 Sep 18.
2
ATTED-II in 2018: A Plant Coexpression Database Based on Investigation of the Statistical Property of the Mutual Rank Index.2018 年的 ATTED-II:基于互秩指数统计特性研究的植物共表达数据库。
Plant Cell Physiol. 2018 Jan 1;59(1):e3. doi: 10.1093/pcp/pcx191.
3
Gene coexpression measures in large heterogeneous samples using count statistics.使用计数统计量在大型异质样本中进行基因共表达测量。
Proc Natl Acad Sci U S A. 2014 Nov 18;111(46):16371-6. doi: 10.1073/pnas.1417128111. Epub 2014 Oct 6.
4
Multi-dimensional correlations for gene coexpression and application to the large-scale data of Arabidopsis.基因共表达的多维相关性及其在拟南芥大规模数据中的应用。
Bioinformatics. 2009 Oct 15;25(20):2677-84. doi: 10.1093/bioinformatics/btp442. Epub 2009 Jul 20.
5
Manipulating large-scale Arabidopsis microarray expression data: identifying dominant expression patterns and biological process enrichment.处理大规模拟南芥基因芯片表达数据:识别主要表达模式及生物过程富集情况
Methods Mol Biol. 2009;553:57-77. doi: 10.1007/978-1-60327-563-7_4.
6
Comparative study of RNA-seq- and microarray-derived coexpression networks in Arabidopsis thaliana.拟南芥 RNA-seq 和微阵列衍生共表达网络的比较研究。
Bioinformatics. 2013 Mar 15;29(6):717-24. doi: 10.1093/bioinformatics/btt053. Epub 2013 Feb 1.
7
Gene coexpression analysis in based on public microarray data.基于公共基因芯片数据的基因共表达分析。
STAR Protoc. 2022 Feb 26;3(1):101208. doi: 10.1016/j.xpro.2022.101208. eCollection 2022 Mar 18.
8
Coexpression of neighboring genes in the genome of Arabidopsis thaliana.拟南芥基因组中相邻基因的共表达。
Genome Res. 2004 Jun;14(6):1060-7. doi: 10.1101/gr.2131104.
9
SVD-based anatomy of gene expressions for correlation analysis in Arabidopsis thaliana.基于奇异值分解的拟南芥基因表达解剖结构用于相关性分析
DNA Res. 2008 Dec;15(6):367-74. doi: 10.1093/dnares/dsn025. Epub 2008 Oct 17.
10
COXPRESdb: a database of coexpressed gene networks in mammals.COXPRESdb:一个哺乳动物中共表达基因网络的数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D77-82. doi: 10.1093/nar/gkm840. Epub 2007 Oct 11.

引用本文的文献

1
MEANtools integrates multi-omics data to identify metabolites and predict biosynthetic pathways.MEANtools整合多组学数据以识别代谢物并预测生物合成途径。
PLoS Biol. 2025 Jul 28;23(7):e3003307. doi: 10.1371/journal.pbio.3003307. eCollection 2025 Jul.
2
From data to discovery: leveraging big data in plant natural products biosynthesis research.从数据到发现:植物天然产物生物合成研究中大数据的利用
Plant J. 2025 Jun;122(6):e70288. doi: 10.1111/tpj.70288.
3
Comparative Transcriptome Reveals Conserved Gene Expression in Reproductive Organs in Solanaceae.

本文引用的文献

1
ATTED-II provides coexpressed gene networks for Arabidopsis.ATTED-II为拟南芥提供共表达基因网络。
Nucleic Acids Res. 2009 Jan;37(Database issue):D987-91. doi: 10.1093/nar/gkn807. Epub 2008 Oct 25.
2
GeneCAT--novel webtools that combine BLAST and co-expression analyses.GeneCAT——结合BLAST和共表达分析的新型网络工具。
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W320-6. doi: 10.1093/nar/gkn292. Epub 2008 May 14.
3
CressExpress: a tool for large-scale mining of expression data from Arabidopsis.CressExpress:一种用于大规模挖掘拟南芥表达数据的工具。
比较转录组揭示茄科生殖器官中保守的基因表达
Int J Mol Sci. 2025 Apr 10;26(8):3568. doi: 10.3390/ijms26083568.
4
Pupylation-Based Proximity Labeling Unravels a Comprehensive Protein and Phosphoprotein Interactome of the Arabidopsis TOR Complex.基于 pupylation 的邻近标记揭示了拟南芥 TOR 复合体完整的蛋白质和磷酸化蛋白质相互作用组。
Adv Sci (Weinh). 2025 May;12(19):e2414496. doi: 10.1002/advs.202414496. Epub 2025 Mar 24.
5
Three cytochrome P450 enzymes consecutively catalyze the biosynthesis of furanoclerodane precursors in Salvia species.三种细胞色素P450酶连续催化鼠尾草属植物中呋喃克罗烷前体的生物合成。
Plant Commun. 2025 May 12;6(5):101286. doi: 10.1016/j.xplc.2025.101286. Epub 2025 Feb 18.
6
Mapping the functional network of human cancer through machine learning and pan-cancer proteogenomics.通过机器学习和泛癌蛋白质基因组学绘制人类癌症功能网络
Nat Cancer. 2025 Jan;6(1):205-222. doi: 10.1038/s43018-024-00869-z. Epub 2024 Dec 11.
7
Polyploidy and the evolution of phenotypic integration: Network analysis reveals relationships among anatomy, morphology, and physiology.多倍体与表型整合的进化:网络分析揭示解剖学、形态学和生理学之间的关系。
Appl Plant Sci. 2024 Jul 16;12(4):e11605. doi: 10.1002/aps3.11605. eCollection 2024 Jul-Aug.
8
The Rise of Hypothesis-Driven Artificial Intelligence in Oncology.肿瘤学中假设驱动型人工智能的兴起
Cancers (Basel). 2024 Feb 18;16(4):822. doi: 10.3390/cancers16040822.
9
Seasonal Stability Assessment of Reference Genes for Quantitative Real-Time Polymerase Chain Reaction Normalization in .用于……中定量实时聚合酶链反应标准化的参考基因的季节性稳定性评估
Curr Issues Mol Biol. 2024 Feb 3;46(2):1335-1347. doi: 10.3390/cimb46020085.
10
A telomere-to-telomere reference genome of ficus () provides new insights into sex determination.榕树()的端粒到端粒参考基因组为性别决定提供了新见解。
Hortic Res. 2023 Dec 13;11(1):uhad257. doi: 10.1093/hr/uhad257. eCollection 2024 Jan.
Plant Physiol. 2008 Jul;147(3):1004-16. doi: 10.1104/pp.107.115535. Epub 2008 May 8.
4
The AtGenExpress hormone and chemical treatment data set: experimental design, data evaluation, model data analysis and data access.拟南芥基因表达谱激素与化学处理数据集:实验设计、数据评估、模型数据分析及数据获取
Plant J. 2008 Aug;55(3):526-42. doi: 10.1111/j.0960-7412.2008.03510.x.
5
Prediction of human disease genes by human-mouse conserved coexpression analysis.通过人鼠保守共表达分析预测人类疾病基因。
PLoS Comput Biol. 2008 Mar 28;4(3):e1000043. doi: 10.1371/journal.pcbi.1000043.
6
Annotating genes of known and unknown function by large-scale coexpression analysis.通过大规模共表达分析对已知和未知功能的基因进行注释。
Plant Physiol. 2008 May;147(1):41-57. doi: 10.1104/pp.108.117366. Epub 2008 Mar 19.
7
The Arabidopsis Information Resource (TAIR): gene structure and function annotation.拟南芥信息资源库(TAIR):基因结构与功能注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D1009-14. doi: 10.1093/nar/gkm965. Epub 2007 Nov 5.
8
COXPRESdb: a database of coexpressed gene networks in mammals.COXPRESdb:一个哺乳动物中共表达基因网络的数据库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D77-82. doi: 10.1093/nar/gkm840. Epub 2007 Oct 11.
9
The utility of MAS5 expression summary and detection call algorithms.MAS5表达汇总及检测调用算法的效用。
BMC Bioinformatics. 2007 Jul 30;8:273. doi: 10.1186/1471-2105-8-273.
10
Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks.微阵列标准化程序的比较分析:对逆向工程基因网络的影响
Bioinformatics. 2007 Jul 1;23(13):i282-8. doi: 10.1093/bioinformatics/btm201.