• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于病例对照关联研究的新期望最大化统计检验,该研究考虑通过高通量测序获得的罕见变异。

A new expectation-maximization statistical test for case-control association studies considering rare variants obtained by high-throughput sequencing.

作者信息

Gordon Derek, Finch Stephen J, De La Vega Francisco M

机构信息

Department of Genetics, Rutgers University, Piscataway, N.J., USA.

出版信息

Hum Hered. 2011;71(2):113-25. doi: 10.1159/000325590. Epub 2011 Jul 6.

DOI:10.1159/000325590
PMID:21734402
Abstract

Genome-wide association studies (GWAS) have been successful in identifying common genetic variation reproducibly associated with disease. However, most associated variants confer very small risk and after meta-analysis of large cohorts a large fraction of expected heritability still remains unexplained. A possible explanation is that rare variants currently undetected by GWAS with SNP arrays could contribute a large fraction of risk when present in cases. This concept has spurred great interest in exploring the role of rare variants in disease. As the cost of sequencing continue to plummet, it is becoming feasible to directly sequence case-control samples for testing disease association including rare variants. We have developed a test statistic that allows for association testing among cases and controls using data directly from sequencing reads. In addition, our method allows for random errors in reads. We determine the probability of a true genotype call based on the observed base pair reads using the expectation-maximization algorithm. We apply the SumStat procedure to obtain a single statistic for a group of multiple rare variant loci. We document the validity of our method through simulations. Our results suggest that our statistic maintains the correct type I error rate, even in the presence of differential misclassification for sequence reads, and that it has good power under a number of scenarios. Finally, our SumStat results show power at least as good as the maximum single locus results.

摘要

全基因组关联研究(GWAS)已成功鉴定出与疾病可重复相关的常见基因变异。然而,大多数相关变异带来的风险非常小,在对大型队列进行荟萃分析后,很大一部分预期遗传力仍无法解释。一种可能的解释是,目前GWAS使用SNP阵列未检测到的罕见变异在病例中出现时可能会导致很大一部分风险。这一概念激发了人们对探索罕见变异在疾病中的作用的极大兴趣。随着测序成本持续大幅下降,直接对病例对照样本进行测序以检测包括罕见变异在内的疾病关联变得可行。我们开发了一种检验统计量,可使用直接来自测序读数的数据在病例和对照之间进行关联测试。此外,我们的方法考虑了读数中的随机误差。我们使用期望最大化算法根据观察到的碱基对读数确定真实基因型判定的概率。我们应用SumStat程序为一组多个罕见变异位点获得单个统计量。我们通过模拟证明了我们方法的有效性。我们的结果表明,即使在序列读数存在差异错误分类的情况下,我们的统计量仍能保持正确的I型错误率,并且在多种情况下具有良好的功效。最后,我们的SumStat结果显示其功效至少与最大单一位点结果一样好。

相似文献

1
A new expectation-maximization statistical test for case-control association studies considering rare variants obtained by high-throughput sequencing.一种用于病例对照关联研究的新期望最大化统计检验,该研究考虑通过高通量测序获得的罕见变异。
Hum Hered. 2011;71(2):113-25. doi: 10.1159/000325590. Epub 2011 Jul 6.
2
Linear trend tests for case-control genetic association that incorporate random phenotype and genotype misclassification error.纳入随机表型和基因型错误分类误差的病例对照基因关联线性趋势检验。
Genet Epidemiol. 2007 Dec;31(8):853-70. doi: 10.1002/gepi.20246.
3
A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.基于 EM 算法的基于测序数据的等位基因频率估计、SNP 检测和关联研究的统一方法。
BMC Genomics. 2013;14 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2164-14-S1-S1. Epub 2013 Jan 21.
4
Using tree-based recursive partitioning methods to group haplotypes for increased power in association studies.使用基于树的递归划分方法对单倍型进行分组,以提高关联研究的效能。
Ann Hum Genet. 2005 Sep;69(Pt 5):577-89. doi: 10.1111/j.1529-8817.2005.00193.x.
5
Identification of causal sequence variants of disease in the next generation sequencing era.下一代测序时代疾病因果序列变异的鉴定
Methods Mol Biol. 2011;700:37-46. doi: 10.1007/978-1-61737-954-3_3.
6
Large-scale detection of rare variants via pooled multiplexed next-generation sequencing: towards next-generation Ecotilling.通过汇集式多重新一代测序进行大规模稀有变异体检测:迈向新一代生态基因分型。
Plant J. 2011 Aug;67(4):736-45. doi: 10.1111/j.1365-313X.2011.04627.x. Epub 2011 Jul 11.
7
Hidden Markov models for controlling false discovery rate in genome-wide association analysis.用于全基因组关联分析中控制错误发现率的隐马尔可夫模型
Methods Mol Biol. 2012;802:337-44. doi: 10.1007/978-1-61779-400-1_22.
8
Quantifying bias due to allele misclassification in case-control studies of haplotypes.在单倍型病例对照研究中,对由于等位基因错误分类导致的偏倚进行量化。
Genet Epidemiol. 2006 Nov;30(7):590-601. doi: 10.1002/gepi.20170.
9
Evaluating rare variants under two-stage design.两阶段设计下稀有变异的评估。
J Hum Genet. 2012 Jun;57(6):352-7. doi: 10.1038/jhg.2012.33. Epub 2012 May 10.
10
Comparison of six statistics of genetic association regarding their ability to discriminate between causal variants and genetically linked markers.六种基因关联统计方法在区分因果变异和基因连锁标记能力方面的比较。
Hum Hered. 2011;72(2):142-52. doi: 10.1159/000332006. Epub 2011 Oct 19.

引用本文的文献

1
A powerful and adaptive association test for rare variants.一种针对罕见变异的强大且自适应的关联测试。
Genetics. 2014 Aug;197(4):1081-95. doi: 10.1534/genetics.114.165035. Epub 2014 May 15.
2
Assessing the impact of differential genotyping errors on rare variant tests of association.评估差异基因型错误对罕见变异关联检验的影响。
PLoS One. 2013;8(3):e56626. doi: 10.1371/journal.pone.0056626. Epub 2013 Mar 5.
3
Assessing the impact of non-differential genotyping errors on rare variant tests of association.评估非差异性基因分型错误对罕见变异关联测试的影响。
Hum Hered. 2011;72(3):153-60. doi: 10.1159/000332222. Epub 2011 Oct 15.