一种用于通过NGS数据识别多效性全局结构的二次正则化功能典型相关分析。

A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data.

作者信息

Lin Nan, Zhu Yun, Fan Ruzong, Xiong Momiao

机构信息

Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States of America.

Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, United States of America.

出版信息

PLoS Comput Biol. 2017 Oct 17;13(10):e1005788. doi: 10.1371/journal.pcbi.1005788. eCollection 2017 Oct.

DOI:10.1371/journal.pcbi.1005788

PMID:29040274

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5659802/

Abstract

Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics.

摘要

研究基因变异的多效性作用可以提高统计效能，为深入理解疾病复杂的遗传结构提供重要信息，并为设计副作用更少的有效治疗方法提供有力工具。然而，当前的多表型关联分析范式缺乏广度（同时联合分析的表型和基因变异数量）和深度（表型和基因型的层次结构）。高维多效性分析的一个关键问题是从高维基因型和表型数据中有效地提取信息丰富的内部表示和特征。为了探索基因变异的相关信息，有效降低数据维度，并克服推进基因多效性分析新统计方法和计算算法发展中的关键障碍，我们提出了一种新的统计方法，称为二次正则化函数典型相关分析（QRFCCA）用于关联分析，该方法结合了三种方法：（1）二次正则化矩阵分解，（2）函数数据分析和（3）典型相关分析（CCA）。大规模模拟表明，QRFCCA在保持适当的一类错误率的同时，比十种竞争统计方法具有更高的效能。为了进一步评估性能，将QRFCCA和其他十种统计方法应用于来自英国双胞胎研究的全基因组测序数据集。我们使用QRFCCA共鉴定出79个具有罕见变异的基因和67个具有常见变异且与46个性状显著相关的基因。结果表明，QRFCCA明显优于其他十种统计方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07bc/5659802/de74c96f3c8d/pcbi.1005788.g001.jpg

相似文献

A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data.一种用于通过NGS数据识别多效性全局结构的二次正则化功能典型相关分析。

PLoS Comput Biol. 2017 Oct 17;13(10):e1005788. doi: 10.1371/journal.pcbi.1005788. eCollection 2017 Oct.

A new statistical framework for genetic pleiotropic analysis of high dimensional phenotype data.一种用于高维表型数据遗传多效性分析的新统计框架。

BMC Genomics. 2016 Nov 7;17(1):881. doi: 10.1186/s12864-016-3169-1.

Pleiotropy informed adaptive association test of multiple traits using genome-wide association study summary data.利用全基因组关联研究汇总数据进行多性状的多效性知情适应性关联测试。

Biometrics. 2019 Dec;75(4):1076-1085. doi: 10.1111/biom.13076. Epub 2019 Aug 2.

GPA: a statistical approach to prioritizing GWAS results by integrating pleiotropy and annotation.GPA：一种通过整合多效性和注释对全基因组关联研究结果进行优先级排序的统计方法。

PLoS Genet. 2014 Nov 13;10(11):e1004787. doi: 10.1371/journal.pgen.1004787. eCollection 2014 Nov.

Association studies for next-generation sequencing.下一代测序的关联研究。

Genome Res. 2011 Jul;21(7):1099-108. doi: 10.1101/gr.115998.110. Epub 2011 Apr 26.

A general approach to testing for pleiotropy with rare and common variants.一种用于检测罕见和常见变异的多效性的通用方法。

Genet Epidemiol. 2017 Feb;41(2):163-170. doi: 10.1002/gepi.22011. Epub 2016 Nov 30.

Exploring the Pleiotropic Genes and Therapeutic Targets Associated with Heart Failure and Chronic Kidney Disease by Integrating metaCCA and SGLT2 Inhibitors' Target Prediction.通过整合metaCCA 和 SGLT2 抑制剂靶标预测，探讨与心力衰竭和慢性肾脏病相关的多效基因和治疗靶点。

Biomed Res Int. 2021 Sep 8;2021:4229194. doi: 10.1155/2021/4229194. eCollection 2021.

multi-GPA-Tree: Statistical approach for pleiotropy informed and functional annotation tree guided prioritization of GWAS results.多遗传风险评分树（multi-GPA-Tree）：一种基于统计方法的关联分析结果优先级排序策略，该策略考虑了遗传多效性信息，并采用功能注释树进行指导。

PLoS Comput Biol. 2023 Dec 7;19(12):e1011686. doi: 10.1371/journal.pcbi.1011686. eCollection 2023 Dec.

Identifying pleiotropic genes for complex phenotypes with summary statistics from a perspective of composite null hypothesis testing.从复合零假设检验的角度，利用汇总统计信息鉴定复杂表型的多效基因。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab389.

PLEIO: a method to map and interpret pleiotropic loci with GWAS summary statistics.PLEIO：一种使用 GWAS 汇总统计数据进行基因多效性位点映射和解释的方法。

Am J Hum Genet. 2021 Jan 7;108(1):36-48. doi: 10.1016/j.ajhg.2020.11.017. Epub 2020 Dec 21.

引用本文的文献

DNA Methylation Changes and Phenotypic Adaptations Induced Repeated Extreme Altitude Exposure at 8848 Meters.8848米反复极端海拔暴露诱导的DNA甲基化变化及表型适应

Int J Mol Sci. 2024 Nov 25;25(23):12652. doi: 10.3390/ijms252312652.

UGT1A1 genetic variants are associated with increases in bilirubin levels in rheumatoid arthritis patients treated with sarilumab.UGT1A1 基因变异与利妥昔单抗治疗类风湿关节炎患者胆红素水平升高有关。

Pharmacogenomics J. 2022 May;22(3):160-165. doi: 10.1038/s41397-022-00269-5. Epub 2022 Feb 11.

Integrative functional linear model for genome-wide association studies with multiple traits.基于多种性状的全基因组关联研究的综合功能线性模型。

Biostatistics. 2022 Apr 13;23(2):574-590. doi: 10.1093/biostatistics/kxaa043.

Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis.双变量因果发现及其在基因表达和成像数据分析中的应用。

Front Genet. 2018 Aug 31;9:347. doi: 10.3389/fgene.2018.00347. eCollection 2018.

本文引用的文献

The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog).新的NHGRI-EBI已发表全基因组关联研究目录（GWAS目录）。

Nucleic Acids Res. 2017 Jan 4;45(D1):D896-D901. doi: 10.1093/nar/gkw1133. Epub 2016 Nov 29.

Statistical Methods for Testing Genetic Pleiotropy.检测基因多效性的统计方法

Genetics. 2016 Oct;204(2):483-497. doi: 10.1534/genetics.116.189308. Epub 2016 Aug 15.

A plethora of pleiotropy across complex traits.众多复杂性状的多效性。

Nat Genet. 2016 Jun 28;48(7):707-8. doi: 10.1038/ng.3604.

Power Comparisons of Methods for Joint Association Analysis of Multiple Phenotypes.多表型联合关联分析方法的效能比较

Hum Hered. 2015;80(3):144-52. doi: 10.1159/000446239. Epub 2016 Jun 25.

Detection and interpretation of shared genetic influences on 42 human traits.对42种人类性状的共同遗传影响的检测与解读。

Nat Genet. 2016 Jul;48(7):709-17. doi: 10.1038/ng.3570. Epub 2016 May 16.

metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.metaCCA：基于全基因组关联研究汇总统计量，运用典型相关分析的多变量荟萃分析。

Bioinformatics. 2016 Jul 1;32(13):1981-9. doi: 10.1093/bioinformatics/btw052. Epub 2016 Feb 19.

Pleiotropic effects of statins: new therapeutic targets in drug design.他汀类药物的多效性：药物设计中的新治疗靶点。

Naunyn Schmiedebergs Arch Pharmacol. 2016 Jul;389(7):695-712. doi: 10.1007/s00210-016-1252-4. Epub 2016 May 5.

An Application of the Multivariate Linear Mixed Model to the Analysis of Shoulder Complexity in Breast Cancer Patients.多元线性混合模型在乳腺癌患者肩部复杂性分析中的应用。

Int J Environ Res Public Health. 2016 Mar 2;13(3):274. doi: 10.3390/ijerph13030274.

A Statistical Approach for Testing Cross-Phenotype Effects of Rare Variants.一种用于检验罕见变异的跨表型效应的统计方法。

Am J Hum Genet. 2016 Mar 3;98(3):525-540. doi: 10.1016/j.ajhg.2016.01.017.

Multivariate Analysis of Genotype-Phenotype Association.基因型-表型关联的多变量分析

Genetics. 2016 Apr;202(4):1345-63. doi: 10.1534/genetics.115.181339. Epub 2016 Feb 19.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于通过NGS数据识别多效性全局结构的二次正则化功能典型相关分析。

A quadratically regularized functional canonical correlation analysis for identifying the global structure of pleiotropy with NGS data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献