一种用于全基因组关联图谱绘制的网络驱动方法。

A network-driven approach for genome-wide association mapping.

作者信息

Lee Seunghak, Kong Soonho, Xing Eric P

机构信息

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.

出版信息

Bioinformatics. 2016 Jun 15;32(12):i164-i173. doi: 10.1093/bioinformatics/btw270.

DOI:10.1093/bioinformatics/btw270

PMID:27307613

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4908354/

Abstract

MOTIVATION

It remains a challenge to detect associations between genotypes and phenotypes because of insufficient sample sizes and complex underlying mechanisms involved in associations. Fortunately, it is becoming more feasible to obtain gene expression data in addition to genotypes and phenotypes, giving us new opportunities to detect true genotype-phenotype associations while unveiling their association mechanisms.

RESULTS

In this article, we propose a novel method, NETAM, that accurately detects associations between SNPs and phenotypes, as well as gene traits involved in such associations. We take a network-driven approach: NETAM first constructs an association network, where nodes represent SNPs, gene traits or phenotypes, and edges represent the strength of association between two nodes. NETAM assigns a score to each path from an SNP to a phenotype, and then identifies significant paths based on the scores. In our simulation study, we show that NETAM finds significantly more phenotype-associated SNPs than traditional genotype-phenotype association analysis under false positive control, taking advantage of gene expression data. Furthermore, we applied NETAM on late-onset Alzheimer's disease data and identified 477 significant path associations, among which we analyzed paths related to beta-amyloid, estrogen, and nicotine pathways. We also provide hypothetical biological pathways to explain our findings.

AVAILABILITY AND IMPLEMENTATION

Software is available at http://www.sailing.cs.cmu.edu/

CONTACT

: epxing@cs.cmu.edu.

摘要

动机

由于样本量不足以及关联中涉及的潜在机制复杂，检测基因型与表型之间的关联仍然是一项挑战。幸运的是，除了基因型和表型之外，获取基因表达数据变得越来越可行，这为我们检测真正的基因型 - 表型关联并揭示其关联机制提供了新的机会。

结果

在本文中，我们提出了一种新颖的方法NETAM，它能够准确检测单核苷酸多态性（SNP）与表型之间的关联以及此类关联中涉及的基因特征。我们采用网络驱动的方法：NETAM首先构建一个关联网络，其中节点代表SNP、基因特征或表型，边代表两个节点之间的关联强度。NETAM为从一个SNP到一个表型的每条路径分配一个分数，然后根据这些分数识别出显著路径。在我们的模拟研究中，我们表明，在控制假阳性的情况下，NETAM利用基因表达数据比传统的基因型 - 表型关联分析发现了更多与表型相关的SNP。此外，我们将NETAM应用于迟发性阿尔茨海默病数据，并识别出477个显著的路径关联，其中我们分析了与β - 淀粉样蛋白、雌激素和尼古丁途径相关的路径。我们还提供了假设的生物学途径来解释我们的发现。

可用性与实现

软件可在http://www.sailing.cs.cmu.edu/获取。

联系方式

epxing@cs.cmu.edu

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46da/4908354/1ba7cc6732a6/btw270f1p.jpg

相似文献

A network-driven approach for genome-wide association mapping.

Bioinformatics. 2016 Jun 15;32(12):i164-i173. doi: 10.1093/bioinformatics/btw270.

Backward genotype-transcript-phenotype association mapping.

Methods. 2017 Oct 1;129:18-23. doi: 10.1016/j.ymeth.2017.09.004. Epub 2017 Sep 14.

Leveraging input and output structures for joint mapping of epistatic and marginal eQTLs.

Bioinformatics. 2012 Jun 15;28(12):i137-46. doi: 10.1093/bioinformatics/bts227.

A time-varying group sparse additive model for genome-wide association studies of dynamic complex traits.

Bioinformatics. 2016 Oct 1;32(19):2903-10. doi: 10.1093/bioinformatics/btw347. Epub 2016 Jun 13.

A multivariate regression approach to association analysis of a quantitative trait network.

Bioinformatics. 2009 Jun 15;25(12):i204-12. doi: 10.1093/bioinformatics/btp218.

A Lasso multi-marker mixed model for association mapping with population structure correction.

Bioinformatics. 2013 Jan 15;29(2):206-14. doi: 10.1093/bioinformatics/bts669. Epub 2012 Nov 22.

SCOPA and META-SCOPA: software for the analysis and aggregation of genome-wide association studies of multiple correlated phenotypes.

BMC Bioinformatics. 2017 Jan 11;18(1):25. doi: 10.1186/s12859-016-1437-3.

Finding genome-transcriptome-phenome association with structured association mapping and visualization in GenAMap.

Pac Symp Biocomput. 2012:327-38.

From phenotype to genotype: an association study of longitudinal phenotypic markers to Alzheimer's disease relevant SNPs.

Bioinformatics. 2012 Sep 15;28(18):i619-i625. doi: 10.1093/bioinformatics/bts411.

iGWAS: Integrative Genome-Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis.

Genet Epidemiol. 2015 Jul;39(5):347-56. doi: 10.1002/gepi.21905. Epub 2015 May 22.

引用本文的文献

Using expression quantitative trait loci data and graph-embedded neural networks to uncover genotype-phenotype interactions.

Front Genet. 2022 Aug 15;13:921775. doi: 10.3389/fgene.2022.921775. eCollection 2022.

Addressing noise in co-expression network construction.

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab495.

Linking genotype to phenotype in multi-omics data of small sample.

BMC Genomics. 2021 Jul 13;22(1):537. doi: 10.1186/s12864-021-07867-w.

A Network-guided Association Mapping Approach from DNA Methylation to Disease.

Sci Rep. 2019 Apr 3;9(1):5601. doi: 10.1038/s41598-019-42010-6.

Machine learning identifies interacting genetic variants contributing to breast cancer risk: A case study in Finnish cases and controls.

Sci Rep. 2018 Sep 3;8(1):13149. doi: 10.1038/s41598-018-31573-5.

Ensembles of Lasso Screening Rules.

IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2841-2852. doi: 10.1109/TPAMI.2017.2765321. Epub 2017 Nov 24.

PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures.

Nucleic Acids Res. 2018 Apr 6;46(6):e35. doi: 10.1093/nar/gkx1321.

Backward genotype-transcript-phenotype association mapping.

Methods. 2017 Oct 1;129:18-23. doi: 10.1016/j.ymeth.2017.09.004. Epub 2017 Sep 14.

本文引用的文献

Strong rules for discarding predictors in lasso-type problems.

J R Stat Soc Series B Stat Methodol. 2012 Mar;74(2):245-266. doi: 10.1111/j.1467-9868.2011.01004.x.

Aggrecan, link protein and tenascin-R are essential components of the perineuronal net to protect neurons against iron-induced oxidative stress.

Cell Death Dis. 2014 Mar 13;5(3):e1119. doi: 10.1038/cddis.2014.25.

CCDC62 variant rs12817488 is associated with the risk of Parkinson's disease in a Han Chinese population.

Eur Neurol. 2014;71(1-2):77-83. doi: 10.1159/000354333. Epub 2013 Dec 4.

Bridging the Gap between Genotype and Phenotype via Network Approaches.

Front Genet. 2013 May 31;3:227. doi: 10.3389/fgene.2012.00227. eCollection 2012.

Alzheimer's disease: review of hormone therapy trials and implications for treatment and prevention after menopause.

J Steroid Biochem Mol Biol. 2014 Jul;142:99-106. doi: 10.1016/j.jsbmb.2013.05.010. Epub 2013 May 28.

Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer's disease.

Cell. 2013 Apr 25;153(3):707-20. doi: 10.1016/j.cell.2013.03.030.

Wiki-pi: a web-server of annotated human protein-protein interactions to aid in discovery of protein function.

PLoS One. 2012;7(11):e49029. doi: 10.1371/journal.pone.0049029. Epub 2012 Nov 28.

Alzheimer disease: a tale of two prions.

Prion. 2013 Jan-Feb;7(1):14-9. doi: 10.4161/pri.22118. Epub 2012 Sep 10.

Genome-wide efficient mixed-model analysis for association studies.

Nat Genet. 2012 Jun 17;44(7):821-4. doi: 10.1038/ng.2310.

Leveraging input and output structures for joint mapping of epistatic and marginal eQTLs.

Bioinformatics. 2012 Jun 15;28(12):i137-46. doi: 10.1093/bioinformatics/bts227.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于全基因组关联图谱绘制的网络驱动方法。

A network-driven approach for genome-wide association mapping.

作者信息

Lee Seunghak, Kong Soonho, Xing Eric P

机构信息

School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA.