Suppr超能文献

基于家系的关联分析方法及其在不确定基因型数据中的应用

Family-based association tests using genotype data with uncertainty.

机构信息

Department of Statistics, University of California, Irvine, CA 92697, USA.

出版信息

Biostatistics. 2012 Apr;13(2):228-40. doi: 10.1093/biostatistics/kxr045. Epub 2011 Dec 8.

Abstract

Family-based association studies have been widely used to identify association between diseases and genetic markers. It is known that genotyping uncertainty is inherent in both directly genotyped or sequenced DNA variations and imputed data in silico. The uncertainty can lead to genotyping errors and missingness and can negatively impact the power and Type I error rates of family-based association studies even if the uncertainty is independent of disease status. Compared with studies using unrelated subjects, there are very few methods that address the issue of genotyping uncertainty for family-based designs. The limited attempts have mostly been made to correct the bias caused by genotyping errors. Without properly addressing the issue, the conventional testing strategy, i.e. family-based association tests using called genotypes, can yield invalid statistical inferences. Here, we propose a new test to address the challenges in analyzing case-parents data by using calls with high accuracy and modeling genotype-specific call rates. Our simulations show that compared with the conventional strategy and an alternative test, our new test has an improved performance in the presence of substantial uncertainty and has a similar performance when the uncertainty level is low. We also demonstrate the advantages of our new method by applying it to imputed markers from a genome-wide case-parents association study.

摘要

基于家系的关联研究已被广泛用于识别疾病与遗传标记之间的关联。众所周知,直接对 DNA 变异进行基因分型或测序以及在计算机上对数据进行推测都会存在基因分型不确定性。这种不确定性会导致基因分型错误和缺失,并可能对基于家系的关联研究的效能和 I 型错误率产生负面影响,即使这种不确定性与疾病状态无关。与使用无关个体的研究相比,针对基于家系设计的基因分型不确定性问题的方法非常少。已有的尝试大多集中在纠正基因分型错误引起的偏差上。如果不妥善解决这个问题,传统的测试策略,即使用已确定基因型的基于家系的关联测试,可能会产生无效的统计推断。在这里,我们提出了一种新的测试方法,用于通过使用高精度的调用和建模基因型特异性调用率来解决分析病例-父母数据的挑战。我们的模拟表明,与传统策略和另一种测试相比,在存在大量不确定性的情况下,我们的新测试具有更好的性能,而在不确定性水平较低时,其性能则相似。我们还通过应用于全基因组病例-父母关联研究中的推测标记,展示了我们新方法的优势。

相似文献

1
Family-based association tests using genotype data with uncertainty.
Biostatistics. 2012 Apr;13(2):228-40. doi: 10.1093/biostatistics/kxr045. Epub 2011 Dec 8.
2
Incorporating parental information into family-based association tests.
Biostatistics. 2013 Jul;14(3):556-72. doi: 10.1093/biostatistics/kxs048. Epub 2012 Dec 23.
4
Testing for association with a case-parents design in the presence of genotyping errors.
Genet Epidemiol. 2004 Feb;26(2):142-54. doi: 10.1002/gepi.10297.
5
Non-random error in genotype calling procedures: implications for family-based and case-control genome-wide association studies.
Am J Med Genet B Neuropsychiatr Genet. 2008 Dec 5;147B(8):1379-86. doi: 10.1002/ajmg.b.30836.
9
Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.
Hum Genet. 2013 May;132(5):509-22. doi: 10.1007/s00439-013-1266-7. Epub 2013 Jan 22.
10
A generalized Kruskal-Wallis test incorporating group uncertainty with application to genetic association studies.
Biometrics. 2013 Jun;69(2):427-35. doi: 10.1111/biom.12006. Epub 2013 Feb 26.

引用本文的文献

1
Hardy-Weinberg Equilibrium in Meta-Analysis Studies and Large-Scale Genomic Sequencing Era.
Asian Pac J Cancer Prev. 2024 Jul 1;25(7):2229-2235. doi: 10.31557/APJCP.2024.25.7.2229.
2
Re-evaluating data quality of dog mitochondrial, Y chromosomal, and autosomal SNPs genotyped by SNP array.
Zool Res. 2016 Nov 18;37(6):356-360. doi: 10.13918/j.issn.2095-8137.2016.6.356.
3
A Joint Location-Scale Test Improves Power to Detect Associated SNPs, Gene Sets, and Pathways.
Am J Hum Genet. 2015 Jul 2;97(1):125-38. doi: 10.1016/j.ajhg.2015.05.015.

本文引用的文献

1
Low-coverage sequencing: implications for design of complex trait association studies.
Genome Res. 2011 Jun;21(6):940-51. doi: 10.1101/gr.117259.110. Epub 2011 Apr 1.
2
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.
Genet Epidemiol. 2010 Dec;34(8):816-34. doi: 10.1002/gepi.20533.
3
Fast and robust association tests for untyped SNPs in case-control studies.
Hum Hered. 2010;70(3):167-76. doi: 10.1159/000308456. Epub 2010 Jul 30.
4
Methods for testing association between uncertain genotypes and quantitative traits.
Biostatistics. 2011 Jan;12(1):1-17. doi: 10.1093/biostatistics/kxq039. Epub 2010 Jun 11.
5
Genotype imputation for genome-wide association studies.
Nat Rev Genet. 2010 Jul;11(7):499-511. doi: 10.1038/nrg2796.
8
Model-based quality assessment and base-calling for second-generation sequencing data.
Biometrics. 2010 Sep;66(3):665-74. doi: 10.1111/j.1541-0420.2009.01353.x.
9
Quantifying uncertainty in genotype calls.
Bioinformatics. 2010 Jan 15;26(2):242-9. doi: 10.1093/bioinformatics/btp624. Epub 2009 Nov 11.
10
Missing call bias in high-throughput genotyping.
BMC Genomics. 2009 Mar 13;10:106. doi: 10.1186/1471-2164-10-106.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验