生物过滤器：一种用于全基因组关联研究多位点分析的知识整合系统。

Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies.

作者信息

Bush William S, Dudek Scott M, Ritchie Marylyn D

机构信息

Center for Human Genetics Research, Vanderbilt University, Nashville, TN 37232, USA.

出版信息

Pac Symp Biocomput. 2009:368-79.

PMID:19209715

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2859610/

Abstract

Genome-wide association studies provide an unprecedented opportunity to identify combinations of genetic variants that contribute to disease susceptibility. The combinatorial problem of jointly analyzing the millions of genetic variations accessible by high-throughput genotyping technologies is a difficult challenge. One approach to reducing the search space of this variable selection problem is to assess specific combinations of genetic variations based on prior statistical and biological knowledge. In this work, we provide a systematic approach to integrate multiple public databases of gene groupings and sets of disease-related genes to produce multi-SNP models that have an established biological foundation. This approach yields a collection of models which can be tested statistically in genome-wide data, along with an ordinal quantity describing the number of data sources that support any given model. Using this knowledge-driven approach reduces the computational and statistical burden of large-scale interaction analysis while simultaneously providing a biological foundation for the relevance of any significant statistical result that is found.

摘要

全基因组关联研究为识别导致疾病易感性的基因变异组合提供了前所未有的机会。通过高通量基因分型技术可获取数百万个基因变异，联合分析这些变异的组合问题是一项艰巨的挑战。减少此变量选择问题搜索空间的一种方法是基于先前的统计和生物学知识评估基因变异的特定组合。在这项工作中，我们提供了一种系统方法，整合多个关于基因分组和疾病相关基因集的公共数据库，以生成具有既定生物学基础的多单核苷酸多态性（SNP）模型。这种方法产生了一组模型，这些模型可在全基因组数据中进行统计测试，同时还产生一个序数数量，描述支持任何给定模型的数据源数量。使用这种知识驱动的方法可减轻大规模相互作用分析的计算和统计负担，同时为所发现的任何显著统计结果的相关性提供生物学基础。

相似文献

Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies.生物过滤器：一种用于全基因组关联研究多位点分析的知识整合系统。

Pac Symp Biocomput. 2009:368-79.

Finding unique filter sets in PLATO: a precursor to efficient interaction analysis in GWAS data.在PLATO中寻找独特的过滤集：全基因组关联研究（GWAS）数据高效交互分析的前奏。

Pac Symp Biocomput. 2010:315-26.

Statistical epistasis networks reduce the computational complexity of searching three-locus genetic models.统计上位性网络降低了搜索三位点遗传模型的计算复杂性。

Pac Symp Biocomput. 2013:397-408.

Next-generation analysis of cataracts: determining knowledge driven gene-gene interactions using Biofilter, and gene-environment interactions using the PhenX Toolkit.白内障的下一代分析：使用生物过滤器确定知识驱动的基因-基因相互作用，以及使用PhenX工具包确定基因-环境相互作用。

Pac Symp Biocomput. 2013:147-58.

Discovering SNP-disease relationships in genome-wide SNP data using an improved harmony search based on SNP locus and genetic inheritance patterns.利用基于 SNP 位置和遗传遗传模式的改进和声搜索在全基因组 SNP 数据中发现 SNP 疾病关系。

PLoS One. 2023 Oct 13;18(10):e0292266. doi: 10.1371/journal.pone.0292266. eCollection 2023.

MegaSNPHunter: a learning approach to detect disease predisposition SNPs and high level interactions in genome wide association study.MegaSNPHunter：一种在全基因组关联研究中检测疾病易感性单核苷酸多态性和高阶相互作用的学习方法。

BMC Bioinformatics. 2009 Jan 9;10:13. doi: 10.1186/1471-2105-10-13.

Gene, pathway and network frameworks to identify epistatic interactions of single nucleotide polymorphisms derived from GWAS data.用于识别源自全基因组关联研究（GWAS）数据的单核苷酸多态性上位性相互作用的基因、通路和网络框架。

BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S15. doi: 10.1186/1752-0509-6-S3-S15. Epub 2012 Dec 17.

OPATs: Omnibus P-value association tests.OPATs：全集合 P 值关联检验。

Brief Bioinform. 2019 Jan 18;20(1):1-14. doi: 10.1093/bib/bbx068.

HiSSI: high-order SNP-SNP interactions detection based on efficient significant pattern and differential evolution.HiSSI：基于高效显著模式和差分进化的高阶 SNP-SNP 交互检测。

BMC Med Genomics. 2019 Dec 30;12(Suppl 7):139. doi: 10.1186/s12920-019-0584-6.

AprioriGWAS, a new pattern mining strategy for detecting genetic variants associated with disease through interaction effects.AprioriGWAS，一种通过交互作用效应检测与疾病相关基因变异的新模式挖掘策略。

PLoS Comput Biol. 2014 Jun 5;10(6):e1003627. doi: 10.1371/journal.pcbi.1003627. eCollection 2014 Jun.

引用本文的文献

Genome-Wide Association Study of Breast Density among Women of African Ancestry.非洲裔女性乳腺密度的全基因组关联研究。

Cancers (Basel). 2023 May 16;15(10):2776. doi: 10.3390/cancers15102776.

Evidence of epistasis in regions of long-range linkage disequilibrium across five complex diseases in the UK Biobank and eMERGE datasets.在英国生物银行和 eMERGE 数据集的五个复杂疾病的长程连锁不平衡区域中发现了上位性的证据。

Am J Hum Genet. 2023 Apr 6;110(4):575-591. doi: 10.1016/j.ajhg.2023.03.007.

Development and Trends in Artificial Intelligence in Critical Care Medicine: A Bibliometric Analysis of Related Research over the Period of 2010-2021.重症医学中人工智能的发展与趋势：2010 - 2021年相关研究的文献计量分析

J Pers Med. 2022 Dec 27;13(1):50. doi: 10.3390/jpm13010050.

Leveraging phenotypic variability to identify genetic interactions in human phenotypes.利用表型变异性鉴定人类表型中的遗传相互作用。

Am J Hum Genet. 2021 Jan 7;108(1):49-67. doi: 10.1016/j.ajhg.2020.11.016. Epub 2020 Dec 15.

Investigation of gene-gene interactions in cardiac traits and serum fatty acid levels in the LURIC Health Study.在 LURIC 健康研究中调查心脏特征和血清脂肪酸水平的基因-基因相互作用。

PLoS One. 2020 Sep 11;15(9):e0238304. doi: 10.1371/journal.pone.0238304. eCollection 2020.

A framework for pathway knowledge driven prioritization in genome-wide association studies.基于通路知识的全基因组关联研究优先级排序框架。

Genet Epidemiol. 2020 Nov;44(8):841-853. doi: 10.1002/gepi.22345. Epub 2020 Aug 10.

Application of data mining for predicting hemodynamics instability during pheochromocytoma surgery.应用数据挖掘预测嗜铬细胞瘤手术中血液动力学不稳定。

BMC Med Inform Decis Mak. 2020 Jul 20;20(1):165. doi: 10.1186/s12911-020-01180-4.

Discovering genetic interactions bridging pathways in genome-wide association studies.发现全基因组关联研究中连接途径的遗传相互作用。

Nat Commun. 2019 Sep 19;10(1):4274. doi: 10.1038/s41467-019-12131-7.

Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies.大规模基于DNA的基因-基因相互作用研究中连锁不平衡模式的混杂

BioData Min. 2019 Jun 10;12:11. doi: 10.1186/s13040-019-0199-7. eCollection 2019.

Innovative strategies for annotating the "relationSNP" between variants and molecular phenotypes.注释变异体与分子表型之间“关系单核苷酸多态性”的创新策略。

BioData Min. 2019 May 14;12:10. doi: 10.1186/s13040-019-0197-9. eCollection 2019.

本文引用的文献

Routine Discovery of Complex Genetic Models using Genetic Algorithms.使用遗传算法对复杂遗传模型进行常规发现。

Appl Soft Comput. 2004 Feb 1;4(1):79-86. doi: 10.1016/j.asoc.2003.08.003.

Gathering the gold dust: methods for assessing the aggregate impact of small effect genes in genomic scans.收集金粉：基因组扫描中评估小效应基因总体影响的方法

Pac Symp Biocomput. 2008:190-200.

KEGG for linking genomes to life and the environment.京都基因与基因组百科全书，用于将基因组与生命及环境相联系。

Nucleic Acids Res. 2008 Jan;36(Database issue):D480-4. doi: 10.1093/nar/gkm882. Epub 2007 Dec 12.

The Pfam protein families database.Pfam蛋白质家族数据库。

Nucleic Acids Res. 2008 Jan;36(Database issue):D281-8. doi: 10.1093/nar/gkm960. Epub 2007 Nov 26.

Prioritized subset analysis: improving power in genome-wide association studies.优先子集分析：提高全基因组关联研究的效能

Hum Hered. 2008;65(3):129-41. doi: 10.1159/000109730. Epub 2007 Oct 12.

Reactome: a knowledge base of biologic pathways and processes.Reactome：生物通路和过程的知识库。

Genome Biol. 2007;8(3):R39. doi: 10.1186/gb-2007-8-3-r39.

Reconstruction of a functional human gene network, with an application for prioritizing positional candidate genes.功能性人类基因网络的重建及其在定位候选基因优先级排序中的应用。

Am J Hum Genet. 2006 Jun;78(6):1011-25. doi: 10.1086/504300. Epub 2006 Apr 25.

Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis.跨越生物学上位性与统计上位性之间的概念鸿沟：系统生物学与更现代的综合理论

Bioessays. 2005 Jun;27(6):637-46. doi: 10.1002/bies.20236.

Mapping complex disease loci in whole-genome association studies.全基因组关联研究中的复杂疾病基因座定位

Nature. 2004 May 27;429(6990):446-52. doi: 10.1038/nature02623.

The genetic association database.基因关联数据库。

Nat Genet. 2004 May;36(5):431-2. doi: 10.1038/ng0504-431.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。