利用表型连锁网络对基因变异进行无偏功能聚类。

Unbiased functional clustering of gene variants with a phenotypic-linkage network.

作者信息

Honti Frantisek, Meader Stephen, Webber Caleb

机构信息

MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom.

出版信息

PLoS Comput Biol. 2014 Aug 28;10(8):e1003815. doi: 10.1371/journal.pcbi.1003815. eCollection 2014 Aug.

DOI:10.1371/journal.pcbi.1003815

PMID:25166029

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4148192/

Abstract

Groupwise functional analysis of gene variants is becoming standard in next-generation sequencing studies. As the function of many genes is unknown and their classification to pathways is scant, functional associations between genes are often inferred from large-scale omics data. Such data types--including protein-protein interactions and gene co-expression networks--are used to examine the interrelations of the implicated genes. Statistical significance is assessed by comparing the interconnectedness of the mutated genes with that of random gene sets. However, interconnectedness can be affected by confounding bias, potentially resulting in false positive findings. We show that genes implicated through de novo sequence variants are biased in their coding-sequence length and longer genes tend to cluster together, which leads to exaggerated p-values in functional studies; we present here an integrative method that addresses these bias. To discern molecular pathways relevant to complex disease, we have inferred functional associations between human genes from diverse data types and assessed them with a novel phenotype-based method. Examining the functional association between de novo gene variants, we control for the heretofore unexplored confounding bias in coding-sequence length. We test different data types and networks and find that the disease-associated genes cluster more significantly in an integrated phenotypic-linkage network than in other gene networks. We present a tool of superior power to identify functional associations among genes mutated in the same disease even after accounting for significant sequencing study bias and demonstrate the suitability of this method to functionally cluster variant genes underlying polygenic disorders.

摘要

在下一代测序研究中，基因变异的分组功能分析正变得越来越标准化。由于许多基因的功能未知，且它们在通路中的分类很少，基因之间的功能关联通常是从大规模组学数据中推断出来的。这些数据类型——包括蛋白质-蛋白质相互作用和基因共表达网络——被用来研究相关基因的相互关系。通过将突变基因的连通性与随机基因集的连通性进行比较来评估统计显著性。然而，连通性可能会受到混杂偏差的影响，从而可能导致假阳性结果。我们表明，通过从头序列变异牵连到的基因在其编码序列长度上存在偏差，并且较长的基因倾向于聚集在一起，这导致功能研究中的p值被夸大；我们在此提出一种解决这些偏差的综合方法。为了识别与复杂疾病相关的分子通路，我们从不同的数据类型中推断出人类基因之间的功能关联，并用一种基于新表型的方法对它们进行评估。在研究从头基因变异之间的功能关联时，我们控制了编码序列长度方面迄今为止未被探索的混杂偏差。我们测试了不同的数据类型和网络，发现与疾病相关的基因在整合的表型连锁网络中比在其他基因网络中聚类更显著。我们提出了一种具有更高功效的工具，即使在考虑了显著的测序研究偏差之后，也能识别在同一种疾病中发生突变的基因之间的功能关联，并证明了该方法适用于对多基因疾病潜在的变异基因进行功能聚类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d299/4148192/29e8d743043d/pcbi.1003815.g001.jpg

相似文献

Unbiased functional clustering of gene variants with a phenotypic-linkage network.

PLoS Comput Biol. 2014 Aug 28;10(8):e1003815. doi: 10.1371/journal.pcbi.1003815. eCollection 2014 Aug.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Clinical presentation of congenital hypogonadotropic hypogonadism in males with delayed puberty according to genetic etiology: a systematic review and meta-analysis after reclassification of gene variants.

Hum Reprod. 2025 May 1;40(5):904-918. doi: 10.1093/humrep/deaf041.

A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records.

BMC Bioinformatics. 2014 Sep 24;15(1):315. doi: 10.1186/1471-2105-15-315.

Genome analysis and knowledge-driven variant interpretation with TGex.

BMC Med Genomics. 2019 Dec 30;12(1):200. doi: 10.1186/s12920-019-0647-8.

De novo missense variants disrupting protein-protein interactions affect risk for autism through gene co-expression and protein networks in neuronal cell types.

Mol Autism. 2020 Oct 8;11(1):76. doi: 10.1186/s13229-020-00386-7.

Functional genomics complements quantitative genetics in identifying disease-gene associations.

PLoS Comput Biol. 2010 Nov 11;6(11):e1000991. doi: 10.1371/journal.pcbi.1000991.

Diverse genetic causes of polymicrogyria with epilepsy.

Epilepsia. 2021 Apr;62(4):973-983. doi: 10.1111/epi.16854. Epub 2021 Apr 5.

A systems level, functional genomics analysis of chronic epilepsy.

PLoS One. 2011;6(6):e20763. doi: 10.1371/journal.pone.0020763. Epub 2011 Jun 14.

BioBin: a bioinformatics tool for automating the binning of rare variants using publicly available biological knowledge.

BMC Med Genomics. 2013;6 Suppl 2(Suppl 2):S6. doi: 10.1186/1755-8794-6-S2-S6. Epub 2013 May 7.

引用本文的文献

Deep phenotyping for precision medicine in Parkinson's disease.

Dis Model Mech. 2022 Jun 1;15(6). doi: 10.1242/dmm.049376.

Integration of functional genomics data to uncover cell type-specific pathways affected in Parkinson's disease.

Biochem Soc Trans. 2021 Nov 1;49(5):2091-2100. doi: 10.1042/BST20210128.

Combining multiomics and drug perturbation profiles to identify muscle-specific treatments for spinal muscular atrophy.

JCI Insight. 2021 Jul 8;6(13):e149446. doi: 10.1172/jci.insight.149446.

Human-Specific Transcriptome of Ventral and Dorsal Midbrain Dopamine Neurons.

Ann Neurol. 2020 Jun;87(6):853-868. doi: 10.1002/ana.25719. Epub 2020 Mar 30.

Large-scale neuroanatomical study uncovers 198 gene associations in mouse brain morphogenesis.

Nat Commun. 2019 Aug 1;10(1):3465. doi: 10.1038/s41467-019-11431-2.

Habituation Learning Is a Widely Affected Mechanism in Drosophila Models of Intellectual Disability and Autism Spectrum Disorders.

Biol Psychiatry. 2019 Aug 15;86(4):294-305. doi: 10.1016/j.biopsych.2019.04.029. Epub 2019 May 9.

The genomic basis of mood instability: identification of 46 loci in 363,705 UK Biobank participants, genetic correlation with psychiatric disorders, and association with gene expression and function.

Mol Psychiatry. 2020 Nov;25(11):3091-3099. doi: 10.1038/s41380-019-0439-8. Epub 2019 Jun 5.

Single-Cell Sequencing of iPSC-Dopamine Neurons Reconstructs Disease Progression and Identifies HDAC4 as a Regulator of Parkinson Cell Phenotypes.

Cell Stem Cell. 2019 Jan 3;24(1):93-106.e6. doi: 10.1016/j.stem.2018.10.023. Epub 2018 Nov 29.

Diverse type 2 diabetes genetic risk factors functionally converge in a phenotype-focused gene network.

PLoS Comput Biol. 2017 Oct 23;13(10):e1005816. doi: 10.1371/journal.pcbi.1005816. eCollection 2017 Oct.

Transcriptomic profiling of purified patient-derived dopamine neurons identifies convergent perturbations and therapeutics for Parkinson's disease.

Hum Mol Genet. 2017 Feb 1;26(3):552-566. doi: 10.1093/hmg/ddw412.

本文引用的文献

The roles of FMRP-regulated genes in autism spectrum disorder: single- and multiple-hit genetic etiologies.

Am J Hum Genet. 2013 Nov 7;93(5):825-39. doi: 10.1016/j.ajhg.2013.09.013. Epub 2013 Oct 24.

Improved exome prioritization of disease genes through cross-species phenotype comparison.

Genome Res. 2014 Feb;24(2):340-8. doi: 10.1101/gr.160325.113. Epub 2013 Oct 25.

De novo mutations in epileptic encephalopathies.

Nature. 2013 Sep 12;501(7466):217-21. doi: 10.1038/nature12439. Epub 2013 Aug 11.

Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network.

Cell. 2013 Aug 1;154(3):518-29. doi: 10.1016/j.cell.2013.06.049.

Diagnostic exome sequencing in persons with severe intellectual disability.

N Engl J Med. 2012 Nov 15;367(20):1921-9. doi: 10.1056/NEJMoa1206524. Epub 2012 Oct 3.

Range of genetic mutations associated with severe non-syndromic sporadic intellectual disability: an exome sequencing study.

Lancet. 2012 Nov 10;380(9854):1674-82. doi: 10.1016/S0140-6736(12)61480-9. Epub 2012 Sep 27.

The International Mouse Phenotyping Consortium: past and future perspectives on mouse phenotyping.

Mamm Genome. 2012 Oct;23(9-10):632-40. doi: 10.1007/s00335-012-9427-x. Epub 2012 Sep 1.

Patterns and rates of exonic de novo mutations in autism spectrum disorders.

Nature. 2012 Apr 4;485(7397):242-5. doi: 10.1038/nature11011.

Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations.

Nature. 2012 Apr 4;485(7397):246-50. doi: 10.1038/nature10989.

De novo mutations revealed by whole-exome sequencing are strongly associated with autism.

Nature. 2012 Apr 4;485(7397):237-41. doi: 10.1038/nature10945.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用表型连锁网络对基因变异进行无偏功能聚类。

Unbiased functional clustering of gene variants with a phenotypic-linkage network.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献