• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类基因共表达图谱:源自组织转录组图谱的可靠网络。

Human gene coexpression landscape: confident network derived from tissue transcriptomic profiles.

作者信息

Prieto Carlos, Risueño Alberto, Fontanillo Celia, De las Rivas Javier

机构信息

Bioinformatics and Functional Genomics Research Group, Cancer Research Center (CIC-IBMCC, CSIC/USAL), Salamanca, Spain.

出版信息

PLoS One. 2008;3(12):e3911. doi: 10.1371/journal.pone.0003911. Epub 2008 Dec 15.

DOI:10.1371/journal.pone.0003911
PMID:19081792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2597745/
Abstract

BACKGROUND

Analysis of gene expression data using genome-wide microarrays is a technique often used in genomic studies to find coexpression patterns and locate groups of co-transcribed genes. However, most studies done at global "omic" scale are not focused on human samples and when they correspond to human very often include heterogeneous datasets, mixing normal with disease-altered samples. Moreover, the technical noise present in genome-wide expression microarrays is another well reported problem that many times is not addressed with robust statistical methods, and the estimation of errors in the data is not provided.

METHODOLOGY/PRINCIPAL FINDINGS: Human genome-wide expression data from a controlled set of normal-healthy tissues is used to build a confident human gene coexpression network avoiding both pathological and technical noise. To achieve this we describe a new method that combines several statistical and computational strategies: robust normalization and expression signal calculation; correlation coefficients obtained by parametric and non-parametric methods; random cross-validations; and estimation of the statistical accuracy and coverage of the data. All these methods provide a series of coexpression datasets where the level of error is measured and can be tuned. To define the errors, the rates of true positives are calculated by assignment to biological pathways. The results provide a confident human gene coexpression network that includes 3327 gene-nodes and 15841 coexpression-links and a comparative analysis shows good improvement over previously published datasets. Further functional analysis of a subset core network, validated by two independent methods, shows coherent biological modules that share common transcription factors. The network reveals a map of coexpression clusters organized in well defined functional constellations. Two major regions in this network correspond to genes involved in nuclear and mitochondrial metabolism and investigations on their functional assignment indicate that more than 60% are house-keeping and essential genes. The network displays new non-described gene associations and it allows the placement in a functional context of some unknown non-assigned genes based on their interactions with known gene families.

CONCLUSIONS/SIGNIFICANCE: The identification of stable and reliable human gene to gene coexpression networks is essential to unravel the interactions and functional correlations between human genes at an omic scale. This work contributes to this aim, and we are making available for the scientific community the validated human gene coexpression networks obtained, to allow further analyses on the network or on some specific gene associations. The data are available free online at http://bioinfow.dep.usal.es/coexpression/.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/ff7444d24007/pone.0003911.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/2ba56293920d/pone.0003911.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/673f60cc6f2d/pone.0003911.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/84066b8cec51/pone.0003911.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/a17e95b4256f/pone.0003911.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/aacb664f1aaa/pone.0003911.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/8cc331ef7377/pone.0003911.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/ff7444d24007/pone.0003911.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/2ba56293920d/pone.0003911.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/673f60cc6f2d/pone.0003911.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/84066b8cec51/pone.0003911.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/a17e95b4256f/pone.0003911.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/aacb664f1aaa/pone.0003911.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/8cc331ef7377/pone.0003911.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5659/2597745/ff7444d24007/pone.0003911.g007.jpg
摘要

背景

使用全基因组微阵列分析基因表达数据是基因组研究中常用的技术,用于寻找共表达模式并定位共转录基因群体。然而,大多数在全球“组学”规模上进行的研究并非聚焦于人类样本,而且当研究对象是人类时,往往包含异质数据集,将正常样本与疾病改变的样本混合在一起。此外,全基因组表达微阵列中存在的技术噪声是另一个有大量报道的问题,很多时候没有用稳健的统计方法来解决,并且也没有提供数据误差的估计。

方法/主要发现:来自一组受控的正常健康组织的人类全基因组表达数据被用于构建一个可靠的人类基因共表达网络,以避免病理和技术噪声。为实现这一目标,我们描述了一种结合多种统计和计算策略的新方法:稳健归一化和表达信号计算;通过参数和非参数方法获得的相关系数;随机交叉验证;以及数据统计准确性和覆盖范围的估计。所有这些方法提供了一系列共表达数据集,其中误差水平得到测量且可以调整。为了定义误差,通过分配到生物途径来计算真阳性率。结果提供了一个可靠的人类基因共表达网络,包含3327个基因节点和15841个共表达链接,比较分析表明相较于先前发表的数据集有显著改进。通过两种独立方法验证的子集核心网络的进一步功能分析显示了共享共同转录因子的连贯生物模块。该网络揭示了以明确的功能星座形式组织的共表达簇图谱。该网络中的两个主要区域对应于参与核代谢和线粒体代谢的基因,对其功能分配的研究表明超过60%是管家基因和必需基因。该网络展示了新的未描述的基因关联,并且基于一些未知未分配基因与已知基因家族的相互作用,它能够将这些基因置于功能背景中。

结论/意义:识别稳定可靠的人类基因间共表达网络对于在组学规模上揭示人类基因之间的相互作用和功能相关性至关重要。这项工作有助于实现这一目标,并且我们正在向科学界提供所获得的经过验证的人类基因共表达网络,以便对该网络或某些特定基因关联进行进一步分析。数据可在http://bioinfow.dep.usal.es/coexpression/免费在线获取。

相似文献

1
Human gene coexpression landscape: confident network derived from tissue transcriptomic profiles.人类基因共表达图谱:源自组织转录组图谱的可靠网络。
PLoS One. 2008;3(12):e3911. doi: 10.1371/journal.pone.0003911. Epub 2008 Dec 15.
2
Evolutionary hallmarks of the human proteome: chasing the age and coregulation of protein-coding genes.人类蛋白质组的进化特征:探寻蛋白质编码基因的年龄与共调控
BMC Genomics. 2016 Oct 25;17(Suppl 8):725. doi: 10.1186/s12864-016-3062-y.
3
Exploring tomato gene functions based on coexpression modules using graph clustering and differential coexpression approaches.基于图聚类和差异共表达方法,利用共表达模块探索番茄基因功能。
Plant Physiol. 2012 Apr;158(4):1487-502. doi: 10.1104/pp.111.188367. Epub 2012 Feb 3.
4
Subspace differential coexpression analysis: problem definition and a general approach.子空间微分共表达分析:问题定义与通用方法。
Pac Symp Biocomput. 2010:145-56.
5
CoGTEx: Unscaled system-level coexpression estimation from GTEx data forecast novel functional gene partners.CoGTEx:从 GTEx 数据预测新的功能基因伙伴的无标度系统水平共表达估计。
PLoS One. 2024 Oct 4;19(10):e0309961. doi: 10.1371/journal.pone.0309961. eCollection 2024.
6
Network analysis of transcriptomic diversity amongst resident tissue macrophages and dendritic cells in the mouse mononuclear phagocyte system.在小鼠单核吞噬细胞系统中,驻留组织巨噬细胞和树突状细胞的转录组多样性的网络分析。
PLoS Biol. 2020 Oct 8;18(10):e3000859. doi: 10.1371/journal.pbio.3000859. eCollection 2020 Oct.
7
Key genes and functional coexpression modules involved in the pathogenesis of systemic lupus erythematosus.系统性红斑狼疮发病机制中的关键基因和功能共表达模块。
J Cell Physiol. 2018 Nov;233(11):8815-8825. doi: 10.1002/jcp.26795. Epub 2018 May 28.
8
Multi-tissue analysis of co-expression networks by higher-order generalized singular value decomposition identifies functionally coherent transcriptional modules.通过高阶广义奇异值分解对共表达网络进行多组织分析,确定功能一致的转录模块。
PLoS Genet. 2014 Jan;10(1):e1004006. doi: 10.1371/journal.pgen.1004006. Epub 2014 Jan 2.
9
Gene coexpression network analysis as a source of functional annotation for rice genes.基因共表达网络分析作为水稻基因功能注释的一种来源。
PLoS One. 2011;6(7):e22196. doi: 10.1371/journal.pone.0022196. Epub 2011 Jul 22.
10
Differential coexpression in human tissues and the confounding effect of mean expression levels.人类组织中的差异共表达及平均表达水平的混杂效应。
Bioinformatics. 2019 Jan 1;35(1):55-61. doi: 10.1093/bioinformatics/bty538.

引用本文的文献

1
Luteinizing Hormone Receptor Mutation (LHR) Causes Abnormal Follicular Development Revealed by Follicle Single-Cell Analysis and CRISPR/Cas9.黄体生成素受体突变(LHR)导致卵泡单细胞分析和 CRISPR/Cas9 揭示的卵泡发育异常。
Interdiscip Sci. 2024 Dec;16(4):976-989. doi: 10.1007/s12539-024-00646-7. Epub 2024 Aug 16.
2
Comparative analysis of the hypothalamus transcriptome of laying ducks with different residual feeding intake.不同剩余采食量产蛋鸭下丘脑转录组的比较分析。
Poult Sci. 2024 Mar;103(3):103355. doi: 10.1016/j.psj.2023.103355. Epub 2023 Dec 6.
3
Molecular signature to predict quality of life and survival with glioblastoma using Multiview omics model.

本文引用的文献

1
Interleukin-6 (IL-6) and/or soluble IL-6 receptor down-regulation of human type II collagen gene expression in articular chondrocytes requires a decrease of Sp1.Sp3 ratio and of the binding activity of both factors to the COL2A1 promoter.白细胞介素-6(IL-6)和/或可溶性IL-6受体下调关节软骨细胞中人类II型胶原基因的表达需要降低Sp1/Sp3比值以及这两种因子与COL2A1启动子的结合活性。
J Biol Chem. 2008 Feb 22;283(8):4850-65. doi: 10.1074/jbc.M706387200. Epub 2007 Dec 7.
2
Gene annotation and pathway mapping in KEGG.基因注释与KEGG中的通路映射。
Methods Mol Biol. 2007;396:71-91. doi: 10.1007/978-1-59745-515-2_6.
3
Filtering genes to improve sensitivity in oligonucleotide microarray data analysis.
基于多视图组学模型预测胶质母细胞瘤患者生活质量和生存情况的分子特征。
PLoS One. 2023 Nov 16;18(11):e0287448. doi: 10.1371/journal.pone.0287448. eCollection 2023.
4
Integrated bioinformatics and statistical approach to identify the common molecular mechanisms of obesity that are linked to the development of two psychiatric disorders: Schizophrenia and major depressive disorder.采用综合生物信息学和统计学方法来鉴定肥胖与两种精神疾病(精神分裂症和重度抑郁症)的发展相关的常见分子机制。
PLoS One. 2023 Jul 26;18(7):e0276820. doi: 10.1371/journal.pone.0276820. eCollection 2023.
5
LncRNA landscape and associated ceRNA network in placental villus of unexplained recurrent spontaneous abortion.不明原因复发性自然流产胎盘绒毛中的长链非编码 RNA 图谱及相关 ceRNA 网络。
Reprod Biol Endocrinol. 2023 Jun 20;21(1):57. doi: 10.1186/s12958-023-01107-4.
6
High-Throughput Sequencing Reveals That Inhibits Colorectal Cancer by Regulating Prognosis-Related Genes.高通量测序揭示了通过调控预后相关基因抑制结直肠癌。
J Pers Med. 2023 Mar 20;13(3):550. doi: 10.3390/jpm13030550.
7
Generating weighted and thresholded gene coexpression networks using signed distance correlation.使用符号距离相关性生成加权和阈值化的基因共表达网络。
Netw Sci (Camb Univ Press). 2022 Jun;10(2):131-145. doi: 10.1017/nws.2022.13. Epub 2022 Jun 16.
8
Identification of the Association Between Toll-Like Receptors and T-Cell Activation in Takayasu's Arteritis.鉴定 Takayasu 动脉炎中 Toll 样受体与 T 细胞激活之间的关联。
Front Immunol. 2022 Jan 20;12:792901. doi: 10.3389/fimmu.2021.792901. eCollection 2021.
9
Chordin-like 1 is a novel prognostic biomarker and correlative with immune cell infiltration in lung adenocarcinoma.Chordin-like 1 是肺腺癌的一种新型预后生物标志物,与免疫细胞浸润相关。
Aging (Albany NY). 2022 Jan 12;14(1):389-409. doi: 10.18632/aging.203814.
10
HOX cluster and their cofactors showed an altered expression pattern in eutopic and ectopic endometriosis tissues.HOX 基因簇及其协同因子在在位和异位子宫内膜组织中表现出改变的表达模式。
Reprod Biol Endocrinol. 2021 Sep 1;19(1):132. doi: 10.1186/s12958-021-00816-y.
在寡核苷酸微阵列数据分析中筛选基因以提高灵敏度。
Nucleic Acids Res. 2007;35(16):e102. doi: 10.1093/nar/gkm537. Epub 2007 Aug 15.
4
Human collagen Krox up-regulates type I collagen expression in normal and scleroderma fibroblasts through interaction with Sp1 and Sp3 transcription factors.人胶原蛋白Krox通过与Sp1和Sp3转录因子相互作用,上调正常和硬皮病成纤维细胞中I型胶原蛋白的表达。
J Biol Chem. 2007 Nov 2;282(44):32000-14. doi: 10.1074/jbc.M705197200. Epub 2007 Aug 13.
5
Comparative analysis of microarray normalization procedures: effects on reverse engineering gene networks.微阵列标准化程序的比较分析:对逆向工程基因网络的影响
Bioinformatics. 2007 Jul 1;23(13):i282-8. doi: 10.1093/bioinformatics/btm201.
6
Characterization of mismatch and high-signal intensity probes associated with Affymetrix genechips.与Affymetrix基因芯片相关的错配和高信号强度探针的表征
Bioinformatics. 2007 Aug 15;23(16):2088-95. doi: 10.1093/bioinformatics/btm306. Epub 2007 Jun 6.
7
PAP: a comprehensive workbench for mammalian transcriptional regulatory sequence analysis.PAP:用于哺乳动物转录调控序列分析的综合工作台。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W238-44. doi: 10.1093/nar/gkm308. Epub 2007 May 21.
8
Evaluation of clustering algorithms for protein-protein interaction networks.蛋白质-蛋白质相互作用网络聚类算法的评估
BMC Bioinformatics. 2006 Nov 6;7:488. doi: 10.1186/1471-2105-7-488.
9
A genetic signature of interspecies variations in gene expression.基因表达种间变异的遗传特征。
Nat Genet. 2006 Jul;38(7):830-4. doi: 10.1038/ng1819. Epub 2006 Jun 18.
10
Pvclust: an R package for assessing the uncertainty in hierarchical clustering.Pvclust:一个用于评估层次聚类不确定性的R语言包。
Bioinformatics. 2006 Jun 15;22(12):1540-2. doi: 10.1093/bioinformatics/btl117. Epub 2006 Apr 4.