• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于遗传分析研讨会17小外显子序列数据的大规模风险预测

Large-scale risk prediction applied to Genetic Analysis Workshop 17 mini-exome sequence data.

作者信息

Li Gengxin, Ferguson John, Zheng Wei, Lee Joon Sang, Zhang Xianghua, Li Lun, Kang Jia, Yan Xiting, Zhao Hongyu

机构信息

Department of Epidemiology and Public Health, Yale University, 60 College Street, New Haven, CT 06520, USA.

出版信息

BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S46. doi: 10.1186/1753-6561-5-S9-S46.

DOI:10.1186/1753-6561-5-S9-S46
PMID:22373389
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3287883/
Abstract

We consider the application of Efron's empirical Bayes classification method to risk prediction in a genome-wide association study using the Genetic Analysis Workshop 17 (GAW17) data. A major advantage of using this method is that the effect size distribution for the set of possible features is empirically estimated and that all subsequent parameter estimation and risk prediction is guided by this distribution. Here, we generalize Efron's method to allow for some of the peculiarities of the GAW17 data. In particular, we introduce two ways to extend Efron's model: a weighted empirical Bayes model and a joint covariance model that allows the model to properly incorporate the annotation information of single-nucleotide polymorphisms (SNPs). In the course of our analysis, we examine several aspects of the possible simulation model, including the identity of the most important genes, the differing effects of synonymous and nonsynonymous SNPs, and the relative roles of covariates and genes in conferring disease risk. Finally, we compare the three methods to each other and to other classifiers (random forest and neural network).

摘要

我们考虑将埃弗龙的经验贝叶斯分类方法应用于利用遗传分析研讨会17(GAW17)数据进行的全基因组关联研究中的风险预测。使用该方法的一个主要优点是,对一组可能特征的效应大小分布进行了经验估计,并且所有后续的参数估计和风险预测都由该分布指导。在这里,我们对埃弗龙的方法进行了推广,以适应GAW17数据的一些特点。特别是,我们引入了两种扩展埃弗龙模型的方法:加权经验贝叶斯模型和联合协方差模型,该模型允许模型正确纳入单核苷酸多态性(SNP)的注释信息。在我们的分析过程中,我们研究了可能的模拟模型的几个方面,包括最重要基因的身份、同义SNP和非同义SNP的不同效应,以及协变量和基因在赋予疾病风险方面的相对作用。最后,我们将这三种方法相互比较,并与其他分类器(随机森林和神经网络)进行比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20b1/3287883/5e3d26b7b6f1/1753-6561-5-S9-S46-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20b1/3287883/518bd9ad2c7d/1753-6561-5-S9-S46-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20b1/3287883/5e3d26b7b6f1/1753-6561-5-S9-S46-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20b1/3287883/518bd9ad2c7d/1753-6561-5-S9-S46-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/20b1/3287883/5e3d26b7b6f1/1753-6561-5-S9-S46-2.jpg

相似文献

1
Large-scale risk prediction applied to Genetic Analysis Workshop 17 mini-exome sequence data.应用于遗传分析研讨会17小外显子序列数据的大规模风险预测
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S46. doi: 10.1186/1753-6561-5-S9-S46.
2
The application of Efron's bootstrap methods in validating feature classification using artificial neural networks for the analysis of mammographic masses.
Conf Proc IEEE Eng Med Biol Soc. 2004;2004:1553-6. doi: 10.1109/IEMBS.2004.1403474.
3
SNPs and other features as they predispose to complex disease: genome-wide predictive analysis of a quantitative phenotype for hypertension.单核苷酸多态性和其他特征,因为它们使人们易患复杂疾病:高血压定量表型的全基因组预测分析。
PLoS One. 2011;6(11):e27891. doi: 10.1371/journal.pone.0027891. Epub 2011 Nov 30.
4
An Empirical Bayes risk prediction model using multiple traits for sequencing data.一种使用多性状的经验贝叶斯风险预测模型用于测序数据。
Stat Appl Genet Mol Biol. 2015 Dec;14(6):551-73. doi: 10.1515/sagmb-2015-0060.
5
Collapsing-based and kernel-based single-gene analyses applied to Genetic Analysis Workshop 17 mini-exome data.应用于遗传分析研讨会17小外显子数据的基于塌缩法和基于核函数法的单基因分析。
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S117. doi: 10.1186/1753-6561-5-S9-S117. eCollection 2011.
6
An Empirical Bayes Mixture Model for Effect Size Distributions in Genome-Wide Association Studies.全基因组关联研究中效应大小分布的经验贝叶斯混合模型。
PLoS Genet. 2015 Dec 29;11(12):e1005717. doi: 10.1371/journal.pgen.1005717. eCollection 2015 Dec.
7
Sequentially adjusted randomization to force balance in controlled trials with unknown prevalence of covariates: application to alcoholism research.在协变量患病率未知的对照试验中进行顺序调整随机化以实现强制平衡:在酒精中毒研究中的应用。
Alcohol Alcohol. 2005 Mar-Apr;40(2):124-31. doi: 10.1093/alcalc/agh131. Epub 2005 Jan 10.
8
Pairwise shared genomic segment analysis in high-risk pedigrees: application to Genetic Analysis Workshop 17 exome-sequencing SNP data.高危家系中的成对共享基因组片段分析:应用于遗传分析研讨会17外显子组测序SNP数据
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S9. doi: 10.1186/1753-6561-5-S9-S9.
9
Introduction to genetic analysis workshop 17 summaries.遗传分析研讨会 17 总结简介。
Genet Epidemiol. 2011;35 Suppl 1:S1-4. doi: 10.1002/gepi.20641.
10
A weighted empirical Bayes risk prediction model using multiple traits.
Stat Appl Genet Mol Biol. 2020 Sep 4;19(3):/j/sagmb.2020.19.issue-3/sagmb-2019-0056/sagmb-2019-0056.xml. doi: 10.1515/sagmb-2019-0056.

引用本文的文献

1
New Empirical Bayes Models to Jointly Analyze Multiple RNA-Sequencing Data in a Hypophosphatasia Disease Study.用于低磷酸酯酶症疾病研究中联合分析多个RNA测序数据的新经验贝叶斯模型。
Genes (Basel). 2024 Mar 26;15(4):407. doi: 10.3390/genes15040407.
2
Inflated type I error rates when using aggregation methods to analyze rare variants in the 1000 Genomes Project exon sequencing data in unrelated individuals: summary results from Group 7 at Genetic Analysis Workshop 17.在分析无关个体的 1000 基因组项目外显子测序数据中的罕见变异时,使用聚合方法会导致膨胀的Ⅰ型错误率:来自第 17 届遗传分析研讨会第 7 组的总结结果。
Genet Epidemiol. 2011;35 Suppl 1(Suppl 1):S56-60. doi: 10.1002/gepi.20650.

本文引用的文献

1
Genetic Analysis Workshop 17 mini-exome simulation.遗传分析研讨会17小型外显子模拟
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S2. doi: 10.1186/1753-6561-5-S9-S2.
2
An application of Random Forests to a genome-wide association dataset: methodological considerations & new findings.随机森林在全基因组关联数据集上的应用:方法学考虑与新发现。
BMC Genet. 2010 Jun 14;11:49. doi: 10.1186/1471-2156-11-49.
3
Empirical Bayes Estimates for Large-Scale Prediction Problems.大规模预测问题的经验贝叶斯估计
J Am Stat Assoc. 2009 Sep 1;104(487):1015-1028. doi: 10.1198/jasa.2009.tm08523.
4
A groupwise association test for rare mutations using a weighted sum statistic.使用加权和统计量对罕见突变进行分组关联测试。
PLoS Genet. 2009 Feb;5(2):e1000384. doi: 10.1371/journal.pgen.1000384. Epub 2009 Feb 13.
5
Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies.全基因组关联研究中比值比的偏差校正估计量和置信区间
Biostatistics. 2008 Oct;9(4):621-34. doi: 10.1093/biostatistics/kxn001. Epub 2008 Feb 28.
6
Gene selection and classification of microarray data using random forest.使用随机森林进行微阵列数据的基因选择与分类
BMC Bioinformatics. 2006 Jan 6;7:3. doi: 10.1186/1471-2105-7-3.