• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于功能研究中 SNP 优先级排序的潜在模型。

A latent model for prioritization of SNPs for functional studies.

机构信息

Department of Health Sciences Research, Mayo Clinic College of Medicine, Rochester, Minnesota, United States of America.

出版信息

PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8.

DOI:10.1371/journal.pone.0020764
PMID:21687685
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3110798/
Abstract

One difficult question facing researchers is how to prioritize SNPs detected from genetic association studies for functional studies. Often a list of the top M SNPs is determined based on solely the p-value from an association analysis, where M is determined by financial/time constraints. For many studies of complex diseases, multiple analyses have been completed and integrating these multiple sets of results may be difficult. One may also wish to incorporate biological knowledge, such as whether the SNP is in the exon of a gene or a regulatory region, into the selection of markers to follow-up. In this manuscript, we propose a Bayesian latent variable model (BLVM) for incorporating "features" about a SNP to estimate a latent "quality score", with SNPs prioritized based on the posterior probability distribution of the rankings of these quality scores. We illustrate the method using data from an ovarian cancer genome-wide association study (GWAS). In addition to the application of the BLVM to the ovarian GWAS, we applied the BLVM to simulated data which mimics the setting involving the prioritization of markers across multiple GWAS for related diseases/traits. The top ranked SNP by BLVM for the ovarian GWAS, ranked 2(nd) and 7(th) based on p-values from analyses of all invasive and invasive serous cases. The top SNP based on serous case analysis p-value (which ranked 197(th) for invasive case analysis), was ranked 8(th) based on the posterior probability of being in the top 5 markers (0.13). In summary, the application of the BLVM allows for the systematic integration of multiple SNP "features" for the prioritization of loci for fine-mapping or functional studies, taking into account the uncertainty in ranking.

摘要

研究人员面临的一个难题是如何为功能研究对从遗传关联研究中检测到的 SNP 进行优先级排序。通常,根据关联分析的 p 值确定前 M 个 SNP 的列表,其中 M 由财务/时间限制决定。对于许多复杂疾病的研究,已经完成了多项分析,整合这些多组结果可能很困难。人们可能还希望将生物学知识(例如 SNP 是否位于基因的外显子或调控区域)纳入后续标记的选择中。在本文中,我们提出了一种贝叶斯潜在变量模型 (BLVM),用于将 SNP 的“特征”纳入其中,以估计潜在的“质量分数”,并根据这些质量分数排名的后验概率分布对 SNP 进行优先级排序。我们使用卵巢癌全基因组关联研究 (GWAS) 的数据说明了该方法。除了将 BLVM 应用于卵巢 GWAS 之外,我们还将 BLVM 应用于模拟数据,该模拟数据模拟了在多个与相关疾病/特征相关的 GWAS 中对标记进行优先级排序的情况。BLVM 对卵巢 GWAS 的排名最高的 SNP 排名第 2(nd)和第 7(th),基于所有侵袭性和侵袭性浆液性病例分析的 p 值。基于浆液性病例分析 p 值的排名最高的 SNP(在侵袭性病例分析中排名第 197(th)),基于排在前 5 个标记的后验概率(0.13)排名第 8(th)。总之,BLVM 的应用允许对多个 SNP“特征”进行系统整合,以对精细映射或功能研究的基因座进行优先级排序,同时考虑到排名的不确定性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/8e427ad1dead/pone.0020764.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/345cc49b1e8e/pone.0020764.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/7f6154d39bed/pone.0020764.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/49e537deb6c1/pone.0020764.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/8e427ad1dead/pone.0020764.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/345cc49b1e8e/pone.0020764.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/7f6154d39bed/pone.0020764.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/49e537deb6c1/pone.0020764.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1c4/3110798/8e427ad1dead/pone.0020764.g004.jpg

相似文献

1
A latent model for prioritization of SNPs for functional studies.用于功能研究中 SNP 优先级排序的潜在模型。
PLoS One. 2011;6(6):e20764. doi: 10.1371/journal.pone.0020764. Epub 2011 Jun 8.
2
A method combining a random forest-based technique with the modeling of linkage disequilibrium through latent variables, to run multilocus genome-wide association studies.一种结合基于随机森林的技术和通过潜在变量进行连锁不平衡建模的方法,用于进行多基因座全基因组关联研究。
BMC Bioinformatics. 2018 Mar 27;19(1):106. doi: 10.1186/s12859-018-2054-0.
3
SNP prioritization using a Bayesian probability of association.使用关联贝叶斯概率进行 SNP 优先级排序。
Genet Epidemiol. 2013 Feb;37(2):214-21. doi: 10.1002/gepi.21704. Epub 2012 Dec 26.
4
Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection.通过贝叶斯潜在变量选择进行全基因组关联研究的多单核苷酸多态性集分析
Genet Epidemiol. 2015 Dec;39(8):664-77. doi: 10.1002/gepi.21932. Epub 2015 Oct 30.
5
Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.使用经验贝叶斯方法将功能基因组信息纳入基因关联研究。
Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1.
6
iLOCi: a SNP interaction prioritization technique for detecting epistasis in genome-wide association studies.iLOCi:一种 SNP 相互作用优先级技术,用于检测全基因组关联研究中的上位性。
BMC Genomics. 2012;13 Suppl 7(Suppl 7):S2. doi: 10.1186/1471-2164-13-S7-S2. Epub 2012 Dec 13.
7
Association between DNA damage response and repair genes and risk of invasive serous ovarian cancer.DNA 损伤反应和修复基因与侵袭性浆液性卵巢癌风险的关联。
PLoS One. 2010 Apr 8;5(4):e10061. doi: 10.1371/journal.pone.0010061.
8
Using prior information from the medical literature in GWAS of oral cancer identifies novel susceptibility variant on chromosome 4--the AdAPT method.利用口腔癌 GWAS 中来自医学文献的先验信息,确定了染色体 4 上的新型易感性变异体——AdAPT 方法。
PLoS One. 2012;7(5):e36888. doi: 10.1371/journal.pone.0036888. Epub 2012 May 25.
9
Shared genetic factors for age at natural menopause in Iranian and European women.伊朗和欧洲女性自然绝经年龄的共享遗传因素。
Hum Reprod. 2013 Jul;28(7):1987-94. doi: 10.1093/humrep/det106. Epub 2013 Apr 16.
10
Selecting Closely-Linked SNPs Based on Local Epistatic Effects for Haplotype Construction Improves Power of Association Mapping.基于局部上位效应选择紧密连锁 SNPs 进行单倍型构建可提高关联作图的功效。
G3 (Bethesda). 2019 Dec 3;9(12):4115-4126. doi: 10.1534/g3.119.400451.

引用本文的文献

1
Genetic associations in ankylosing spondylitis: circulating proteins as drug targets and biomarkers.强直性脊柱炎的遗传相关性:循环蛋白作为药物靶点和生物标志物。
Front Immunol. 2024 May 21;15:1394438. doi: 10.3389/fimmu.2024.1394438. eCollection 2024.
2
Reaching the End-Game for GWAS: Machine Learning Approaches for the Prioritization of Complex Disease Loci.全基因组关联研究进入终局:用于复杂疾病基因座优先级排序的机器学习方法
Front Genet. 2020 Apr 15;11:350. doi: 10.3389/fgene.2020.00350. eCollection 2020.
3
Incorporating Functional Genomic Information in Genetic Association Studies Using an Empirical Bayes Approach.

本文引用的文献

1
Integrating pathway analysis and genetics of gene expression for genome-wide association studies.整合通路分析和基因表达的遗传学用于全基因组关联研究。
Am J Hum Genet. 2010 Apr 9;86(4):581-91. doi: 10.1016/j.ajhg.2010.02.020. Epub 2010 Mar 25.
2
SCAN: SNP and copy number annotation.扫描:单核苷酸多态性和拷贝数注释。
Bioinformatics. 2010 Jan 15;26(2):259-62. doi: 10.1093/bioinformatics/btp644. Epub 2009 Nov 17.
3
Genotype imputation.基因型推算
使用经验贝叶斯方法将功能基因组信息纳入基因关联研究。
Genet Epidemiol. 2016 Apr;40(3):176-87. doi: 10.1002/gepi.21956. Epub 2016 Feb 1.
4
Comparing the efficacy of SNP filtering methods for identifying a single causal SNP in a known association region.比较单核苷酸多态性(SNP)过滤方法在已知关联区域中识别单个因果SNP的功效。
Ann Hum Genet. 2014 Jan;78(1):50-61. doi: 10.1111/ahg.12043. Epub 2013 Nov 11.
5
SNP prioritization using a Bayesian probability of association.使用关联贝叶斯概率进行 SNP 优先级排序。
Genet Epidemiol. 2013 Feb;37(2):214-21. doi: 10.1002/gepi.21704. Epub 2012 Dec 26.
6
The genetic basis for interindividual immune response variation to measles vaccine: new understanding and new vaccine approaches.个体对麻疹疫苗免疫反应差异的遗传基础:新的认识和新的疫苗方法。
Expert Rev Vaccines. 2013 Jan;12(1):57-70. doi: 10.1586/erv.12.134.
Annu Rev Genomics Hum Genet. 2009;10:387-406. doi: 10.1146/annurev.genom.9.081307.164242.
4
Cancer statistics, 2009.2009年癌症统计数据。
CA Cancer J Clin. 2009 Jul-Aug;59(4):225-49. doi: 10.3322/caac.20006. Epub 2009 May 27.
5
Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.全基因组关联位点对人类疾病和性状的潜在病因学及功能影响。
Proc Natl Acad Sci U S A. 2009 Jun 9;106(23):9362-7. doi: 10.1073/pnas.0903103106. Epub 2009 May 27.
6
Validating, augmenting and refining genome-wide association signals.验证、增强和完善全基因组关联信号。
Nat Rev Genet. 2009 May;10(5):318-29. doi: 10.1038/nrg2544.
7
PLINK: a tool set for whole-genome association and population-based linkage analyses.PLINK:一个用于全基因组关联分析和基于群体的连锁分析的工具集。
Am J Hum Genet. 2007 Sep;81(3):559-75. doi: 10.1086/519795. Epub 2007 Jul 25.
8
Imputation-based analysis of association studies: candidate regions and quantitative traits.基于归因的关联研究分析:候选区域和数量性状
PLoS Genet. 2007 Jul;3(7):e114. doi: 10.1371/journal.pgen.0030114. Epub 2007 May 30.
9
Enriching the analysis of genomewide association studies with hierarchical modeling.用分层模型丰富全基因组关联研究的分析。
Am J Hum Genet. 2007 Aug;81(2):397-404. doi: 10.1086/519794. Epub 2007 Jun 26.
10
Hierarchical Bayes prioritization of marker associations from a genome-wide association scan for further investigation.基于全基因组关联扫描的标记关联分层贝叶斯优先级排序以进行进一步研究。
Genet Epidemiol. 2007 Dec;31(8):871-82. doi: 10.1002/gepi.20248.