• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

收集金粉:基因组扫描中评估小效应基因总体影响的方法

Gathering the gold dust: methods for assessing the aggregate impact of small effect genes in genomic scans.

作者信息

Province Michael A, Borecki Ingrid B

机构信息

Division of Statistical Genomics, Box 8506, Center for Genome Sciences, Washington University School of Medicine, 4444 Forest Park Blvd, St. Louis, MO 63108, USA.

出版信息

Pac Symp Biocomput. 2008:190-200.

PMID:18229686
Abstract

Genomewide association scan (GWAS) data mining has found moderate-effect "gold nugget" complex trait genes. But for many traits, much of the explanatory variance may be truly polygenic, more like gold dust, whose small marginal effects are undetectable by traditional methods. Yet, their collective effects may be quite important in advancing personalized medicine. We consider a novel approach to sift out the genetic gold dust influencing quantitative (or qualitative) traits. Out of a GWAS, we randomly grab handfuls of SNPs, modeling their effects in a multiple linear (or logistic) regression. The model's significance is used to obtain an iteratively updated pseudo-Bayesian posterior probability associated with each SNP, which is repeated over many random draws until the distribution becomes stable. A stepwise procedure culls the list of SNPs to define the final set. Results from a benchmark simulation of 5 quantitative trait genes among 1,000, in 1,000 random subjects, are contrasted with marginal tests using nominal significance, Bonferroni-corrected significance, false discovery rates, as well as with serial selection methods. Random handfuls produced the best combination of sensitivity (0.95) specificity (0.99) and true positive rate (0.71) of all methods tested and better replicability in an independent subject set. From more extensive simulations, we determine which combinations of signal to noise ratios, SNP typing densities, and sample sizes are tractable with which methods to gather the gold dust.

摘要

全基因组关联扫描(GWAS)数据挖掘已发现具有中等效应的“金块”复杂性状基因。但对于许多性状而言,大部分可解释变异可能实际上是多基因的,更像是金粉,其微小的边际效应无法用传统方法检测到。然而,它们的共同效应在推进个性化医疗方面可能相当重要。我们考虑一种新颖的方法来筛选出影响数量(或质量)性状的遗传金粉。在GWAS中,我们随机抽取一把单核苷酸多态性(SNP),在多元线性(或逻辑)回归中对其效应进行建模。利用模型的显著性来获得与每个SNP相关的迭代更新的伪贝叶斯后验概率,在多次随机抽样中重复此过程,直到分布变得稳定。一个逐步的程序会筛选SNP列表以确定最终集合。对1000名随机受试者中5个数量性状基因的基准模拟结果,与使用名义显著性、邦费罗尼校正显著性、错误发现率的边际检验以及序列选择方法进行了对比。在所有测试方法中,随机抽取产生了最佳的灵敏度(0.95)、特异性(0.99)和真阳性率(0.71)组合,并且在独立受试者组中具有更好的可重复性。通过更广泛的模拟,我们确定了信噪比、SNP分型密度和样本量的哪些组合适用于哪些方法来收集金粉。

相似文献

1
Gathering the gold dust: methods for assessing the aggregate impact of small effect genes in genomic scans.收集金粉:基因组扫描中评估小效应基因总体影响的方法
Pac Symp Biocomput. 2008:190-200.
2
Bivariate association analysis for quantitative traits using generalized estimation equation.使用广义估计方程进行数量性状的双变量关联分析。
J Genet Genomics. 2009 Dec;36(12):733-43. doi: 10.1016/S1673-8527(08)60166-6.
3
A genome scan for quantitative trait loci influencing carcass, post-natal growth and reproductive traits in commercial Angus cattle.对影响商业安格斯牛胴体、产后生长和繁殖性状的数量性状基因座进行全基因组扫描。
Anim Genet. 2010 Dec;41(6):597-607. doi: 10.1111/j.1365-2052.2010.02063.x.
4
Inferring gene transcriptional modulatory relations: a genetical genomics approach.推断基因转录调控关系:一种遗传基因组学方法。
Hum Mol Genet. 2005 May 1;14(9):1119-25. doi: 10.1093/hmg/ddi124. Epub 2005 Mar 16.
5
Comparison of association mapping methods in a complex pedigreed population.复杂家系群体中关联作图方法的比较。
Genet Epidemiol. 2010 Jul;34(5):455-62. doi: 10.1002/gepi.20499.
6
Direct analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypes.在基于人群的关联研究中,通过单倍型的贝叶斯分区建模对未分型的单核苷酸多态性(SNP)基因型数据进行直接分析。
Genet Epidemiol. 2005 Sep;29(2):91-107. doi: 10.1002/gepi.20080.
7
Evaluating the ability of tree-based methods and logistic regression for the detection of SNP-SNP interaction.评估基于树的方法和逻辑回归检测单核苷酸多态性(SNP)-SNP相互作用的能力。
Ann Hum Genet. 2009 May;73(Pt 3):360-9. doi: 10.1111/j.1469-1809.2009.00511.x. Epub 2009 Mar 8.
8
On the power to detect SNP/phenotype association in candidate quantitative trait loci genomic regions: a simulation study.关于在候选数量性状基因座基因组区域中检测单核苷酸多态性/表型关联的能力:一项模拟研究。
Pac Symp Biocomput. 2003:478-89.
9
Comparison between genomic predictions using daughter yield deviation and conventional estimated breeding value as response variables.基于女儿产量偏差和常规估计育种值作为反应变量的基因组预测比较。
J Anim Breed Genet. 2010 Dec;127(6):423-32. doi: 10.1111/j.1439-0388.2010.00878.x.
10
Modeling interactions with known risk loci-a Bayesian model averaging approach.对与已知风险基因座的相互作用进行建模——一种贝叶斯模型平均方法。
Ann Hum Genet. 2011 Jan;75(1):1-9. doi: 10.1111/j.1469-1809.2010.00618.x. Epub 2010 Nov 30.

引用本文的文献

1
SNP eQTL status and eQTL density in the adjacent region of the SNP are associated with its statistical significance in GWA studies.SNP 的 eQTL 状态和 SNP 相邻区域的 eQTL 密度与其在 GWAS 研究中的统计学意义相关。
BMC Genet. 2019 Nov 12;20(1):85. doi: 10.1186/s12863-019-0786-0.
2
A Bayesian Partitioning Model for the Detection of Multilocus Effects in Case-Control Studies.用于病例对照研究中多位点效应检测的贝叶斯划分模型。
Hum Hered. 2015;79(2):69-79. doi: 10.1159/000369858. Epub 2015 Jun 3.
3
BioBin: a bioinformatics tool for automating the binning of rare variants using publicly available biological knowledge.
BioBin:一个生物信息学工具,用于利用公开可用的生物知识自动对稀有变体进行分类。
BMC Med Genomics. 2013;6 Suppl 2(Suppl 2):S6. doi: 10.1186/1755-8794-6-S2-S6. Epub 2013 May 7.
4
Network models of genome-wide association studies uncover the topological centrality of protein interactions in complex diseases.全基因组关联研究的网络模型揭示了蛋白质相互作用在复杂疾病中的拓扑中心性。
J Am Med Inform Assoc. 2013 Jul-Aug;20(4):619-29. doi: 10.1136/amiajnl-2012-001519. Epub 2013 Jan 25.
5
Translating Mendelian and complex inheritance of Alzheimer's disease genes for predicting unique personal genome variants.翻译阿尔茨海默病基因的孟德尔和复杂遗传,以预测独特的个人基因组变异。
J Am Med Inform Assoc. 2012 Mar-Apr;19(2):306-16. doi: 10.1136/amiajnl-2011-000656.
6
Mining gold dust under the genome wide significance level: a two-stage approach to analysis of GWAS.在全基因组显著性水平下挖掘金尘:GWAS 分析的两阶段方法。
Genet Epidemiol. 2011 Feb;35(2):111-8. doi: 10.1002/gepi.20556. Epub 2010 Dec 31.
7
Using biological knowledge to uncover the mystery in the search for epistasis in genome-wide association studies.利用生物学知识揭示全基因组关联研究中寻找上位性的奥秘。
Ann Hum Genet. 2011 Jan;75(1):172-82. doi: 10.1111/j.1469-1809.2010.00630.x.
8
LD-spline: mapping SNPs on genotyping platforms to genomic regions using patterns of linkage disequilibrium.LD-spline:利用连锁不平衡模式将基因分型平台上的 SNPs 映射到基因组区域。
BioData Min. 2009 Dec 3;2(1):7. doi: 10.1186/1756-0381-2-7.
9
Biofilter: a knowledge-integration system for the multi-locus analysis of genome-wide association studies.生物过滤器:一种用于全基因组关联研究多位点分析的知识整合系统。
Pac Symp Biocomput. 2009:368-79.