多表型联合模型（MultiPhen）：联合多个表型的模型可增加 GWAS 中的发现。

MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS.

机构信息

Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom.

出版信息

PLoS One. 2012;7(5):e34861. doi: 10.1371/journal.pone.0034861. Epub 2012 May 2.

DOI:10.1371/journal.pone.0034861

PMID:22567092

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3342314/

Abstract

The genome-wide association study (GWAS) approach has discovered hundreds of genetic variants associated with diseases and quantitative traits. However, despite clinical overlap and statistical correlation between many phenotypes, GWAS are generally performed one-phenotype-at-a-time. Here we compare the performance of modelling multiple phenotypes jointly with that of the standard univariate approach. We introduce a new method and software, MultiPhen, that models multiple phenotypes simultaneously in a fast and interpretable way. By performing ordinal regression, MultiPhen tests the linear combination of phenotypes most associated with the genotypes at each SNP, and thus potentially captures effects hidden to single phenotype GWAS. We demonstrate via simulation that this approach provides a dramatic increase in power in many scenarios. There is a boost in power for variants that affect multiple phenotypes and for those that affect only one phenotype. While other multivariate methods have similar power gains, we describe several benefits of MultiPhen over these. In particular, we demonstrate that other multivariate methods that assume the genotypes are normally distributed, such as canonical correlation analysis (CCA) and MANOVA, can have highly inflated type-1 error rates when testing case-control or non-normal continuous phenotypes, while MultiPhen produces no such inflation. To test the performance of MultiPhen on real data we applied it to lipid traits in the Northern Finland Birth Cohort 1966 (NFBC1966). In these data MultiPhen discovers 21% more independent SNPs with known associations than the standard univariate GWAS approach, while applying MultiPhen in addition to the standard approach provides 37% increased discovery. The most associated linear combinations of the lipids estimated by MultiPhen at the leading SNPs accurately reflect the Friedewald Formula, suggesting that MultiPhen could be used to refine the definition of existing phenotypes or uncover novel heritable phenotypes.

摘要

全基因组关联研究（GWAS）方法已经发现了数百种与疾病和数量性状相关的遗传变异。然而，尽管许多表型之间存在临床重叠和统计学相关性，但 GWAS 通常是逐个表型进行的。在这里，我们比较了同时对多个表型进行建模的方法与标准单变量方法的性能。我们引入了一种新的方法和软件 MultiPhen，它可以快速、可解释地同时对多个表型进行建模。通过进行有序回归，MultiPhen 测试了与每个 SNP 基因型最相关的表型的线性组合，从而可能捕获到单表型 GWAS 隐藏的效应。通过模拟，我们证明了这种方法在许多情况下显著提高了功效。对于影响多个表型的变体和仅影响一个表型的变体，功效都有所提高。虽然其他多变量方法具有类似的功效增益，但我们描述了 MultiPhen 相对于这些方法的几个优势。特别是，我们证明了那些假设基因型呈正态分布的其他多变量方法，如典型相关分析（CCA）和多变量方差分析（MANOVA），在测试病例对照或非正态连续表型时，可能会产生高度膨胀的一类错误率，而 MultiPhen 则不会产生这种膨胀。为了在真实数据上测试 MultiPhen 的性能，我们将其应用于 1966 年芬兰北部出生队列（NFBC1966）中的脂质特征。在这些数据中，MultiPhen 发现了比标准单变量 GWAS 方法多 21%的具有已知关联的独立 SNP，而在标准方法之外应用 MultiPhen 则提供了 37%的发现增加。MultiPhen 在主要 SNP 上对脂质进行估计的最相关线性组合准确地反映了 Friedewald 公式，这表明 MultiPhen 可以用于细化现有表型的定义或发现新的遗传性表型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4051/3342314/367fe00d9cae/pone.0034861.g001.jpg

相似文献

MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS.多表型联合模型（MultiPhen）：联合多个表型的模型可增加 GWAS 中的发现。

PLoS One. 2012;7(5):e34861. doi: 10.1371/journal.pone.0034861. Epub 2012 May 2.

Association Tests of Multiple Phenotypes: ATeMP.多表型关联测试：ATeMP

PLoS One. 2015 Oct 19;10(10):e0140348. doi: 10.1371/journal.pone.0140348. eCollection 2015.

Semiparametric Allelic Tests for Mapping Multiple Phenotypes: Binomial Regression and Mahalanobis Distance.用于定位多种表型的半参数等位基因检验：二项式回归和马氏距离

Genet Epidemiol. 2015 Dec;39(8):635-50. doi: 10.1002/gepi.21930. Epub 2015 Oct 23.

Joint Analysis of Multiple Phenotypes in Association Studies based on Cross-Validation Prediction Error.基于交叉验证预测误差的关联研究中多种表型的联合分析

Sci Rep. 2019 Jan 31;9(1):1073. doi: 10.1038/s41598-018-37538-y.

SCOPA and META-SCOPA: software for the analysis and aggregation of genome-wide association studies of multiple correlated phenotypes.SCOPA和META-SCOPA：用于分析和汇总多个相关表型的全基因组关联研究的软件。

BMC Bioinformatics. 2017 Jan 11;18(1):25. doi: 10.1186/s12859-016-1437-3.

Power Comparisons of Methods for Joint Association Analysis of Multiple Phenotypes.多表型联合关联分析方法的效能比较

Hum Hered. 2015;80(3):144-52. doi: 10.1159/000446239. Epub 2016 Jun 25.

Comparison of methods for multivariate gene-based association tests for complex diseases using common variants.利用常见变异进行复杂疾病的多元基因关联检验方法比较。

Eur J Hum Genet. 2019 May;27(5):811-823. doi: 10.1038/s41431-018-0327-8. Epub 2019 Jan 25.

An Adaptive Fisher's Combination Method for Joint Analysis of Multiple Phenotypes in Association Studies.一种用于关联研究中多种表型联合分析的自适应 Fisher 组合方法。

Sci Rep. 2016 Oct 3;6:34323. doi: 10.1038/srep34323.

A novel association test for multiple secondary phenotypes from a case-control GWAS.一种针对病例对照全基因组关联研究中多个次要表型的新型关联测试。

Genet Epidemiol. 2017 Jul;41(5):413-426. doi: 10.1002/gepi.22045. Epub 2017 Apr 10.

A hierarchical clustering method for dimension reduction in joint analysis of multiple phenotypes.一种用于多种表型联合分析中降维的层次聚类方法。

Genet Epidemiol. 2018 Jun;42(4):344-353. doi: 10.1002/gepi.22124. Epub 2018 Apr 22.

引用本文的文献

Joint modeling of mixed outcomes using a rank-based sparse neural network.使用基于秩的稀疏神经网络对混合结果进行联合建模。

J Biomed Inform. 2025 Jul 5;169:104870. doi: 10.1016/j.jbi.2025.104870.

The sequence kernel association test for the proportional odds model.比例优势模型的序列核关联检验。

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf304.

Genet Epidemiol. 2025 Jul;49(5):e70012. doi: 10.1002/gepi.70012.

Plasma metabolomic signatures for copy number variants and COVID-19 risk loci in Northern Finland populations.芬兰北部人群中拷贝数变异和新冠病毒疾病风险位点的血浆代谢组学特征

Sci Rep. 2025 Apr 16;15(1):13172. doi: 10.1038/s41598-025-94839-9.

Exploring beyond diagnoses in electronic health records to improve discovery: a review of the phenome-wide association study.探索电子健康记录中的诊断之外的信息以改善发现：全表型关联研究综述

JAMIA Open. 2025 Feb 28;8(1):ooaf006. doi: 10.1093/jamiaopen/ooaf006. eCollection 2025 Feb.

Genome data based deep learning identified new genes predicting pharmacological treatment response of attention deficit hyperactivity disorder.基于基因组数据的深度学习识别出预测注意力缺陷多动障碍药物治疗反应的新基因。

Transl Psychiatry. 2025 Feb 7;15(1):46. doi: 10.1038/s41398-025-03250-5.

Pitfalls in performing genome-wide association studies on ratio traits.对比率性状进行全基因组关联研究时的陷阱。

HGG Adv. 2025 Apr 10;6(2):100406. doi: 10.1016/j.xhgg.2025.100406. Epub 2025 Jan 15.

Efficient multi-phenotype genome-wide analysis identifies genetic associations for unsupervised deep-learning-derived high-dimensional brain imaging phenotypes.高效的多表型全基因组分析确定了与无监督深度学习衍生的高维脑成像表型的遗传关联。

medRxiv. 2024 Dec 8:2024.12.06.24318618. doi: 10.1101/2024.12.06.24318618.

A novel phenotype imputation method with copula model.基于 copula 模型的新型表型推断方法。

BMC Bioinformatics. 2024 Nov 30;25(1):369. doi: 10.1186/s12859-024-05990-5.

SCAMPI: A scalable statistical framework for genome-wide interaction testing harnessing cross-trait correlations.SCAMPI：一种利用跨性状相关性进行全基因组相互作用测试的可扩展统计框架。

bioRxiv. 2024 Sep 14:2024.09.10.612314. doi: 10.1101/2024.09.10.612314.

本文引用的文献

New gene functions in megakaryopoiesis and platelet formation.新基因在巨核细胞生成和血小板形成中的功能。

Nature. 2011 Nov 30;480(7376):201-8. doi: 10.1038/nature10659.

Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk.新途径中的遗传变异会影响血压和心血管疾病风险。

Nature. 2011 Sep 11;478(7367):103-9. doi: 10.1038/nature10405.

Pervasive sharing of genetic effects in autoimmune disease.自身免疫性疾病中遗传效应的普遍存在。

PLoS Genet. 2011 Aug;7(8):e1002254. doi: 10.1371/journal.pgen.1002254. Epub 2011 Aug 10.

The use of phenome-wide association studies (PheWAS) for exploration of novel genotype-phenotype relationships and pleiotropy discovery.利用表型全基因组关联研究（PheWAS）探索新的基因型-表型关系和多效性发现。

Genet Epidemiol. 2011 Jul;35(5):410-22. doi: 10.1002/gepi.20589. Epub 2011 May 18.

Identification of an imprinted master trans regulator at the KLF14 locus related to multiple metabolic phenotypes.鉴定与多种代谢表型相关的 KLF14 基因座上的印迹主转录调节因子。

Nat Genet. 2011 Jun;43(6):561-4. doi: 10.1038/ng.833. Epub 2011 May 15.

Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index.对 249796 人的关联分析揭示了 18 个与体重指数相关的新位点。

Nat Genet. 2010 Nov;42(11):937-48. doi: 10.1038/ng.686. Epub 2010 Oct 10.

Biological, clinical and population relevance of 95 loci for blood lipids.95 个与血脂相关的生物学、临床和人群相关性位点。

Nature. 2010 Aug 5;466(7307):707-13. doi: 10.1038/nature09270.

Analyze multivariate phenotypes in genetic association studies by combining univariate association tests.通过结合单变量关联测试分析遗传关联研究中的多变量表型。

Genet Epidemiol. 2010 Jul;34(5):444-54. doi: 10.1002/gepi.20497.

A genome-wide perspective of genetic variation in human metabolism.人类代谢中遗传变异的全基因组视角。

Nat Genet. 2010 Feb;42(2):137-41. doi: 10.1038/ng.507. Epub 2009 Dec 27.

An integrated phenomic approach to multivariate allelic association.一种整合表型方法用于多变量等位基因关联。

Eur J Hum Genet. 2010 Feb;18(2):233-9. doi: 10.1038/ejhg.2009.133. Epub 2009 Aug 26.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多表型联合模型（MultiPhen）：联合多个表型的模型可增加 GWAS 中的发现。

MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献