在无重复基因分型的无关个体样本中进行基因分型错误检测。

Genotyping error detection in samples of unrelated individuals without replicate genotyping.

作者信息

Liu Nianjun, Zhang Dabao, Zhao Hongyu

机构信息

Department of Biostatistics, University of Alabama at Birmingham, Birmingham, Ala. 35294, USA.

出版信息

Hum Hered. 2009;67(3):154-62. doi: 10.1159/000181153. Epub 2008 Dec 15.

DOI:10.1159/000181153

PMID:19077433

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2782542/

Abstract

OBJECTIVE

Identifying genotyping errors is an important issue in genetic research, yet it has been relatively less studied in samples consisting of unrelated individuals. In this article, we consider several models of genotyping errors, which were originally proposed for pedigree data, for unrelated population samples with single nucleotide polymorphism (SNP) genotype data. The mathematical constraints are investigated for detecting genotyping errors without resampling replicates or genotyping relatives.

METHODS

For the various proposed genotyping error models, we unveil the conditions under which the parameters are identifiable. These results are verified through applications to simulated and real SNP data.

RESULTS

We show that, with constraints, two particular models provide both identifiable error rate and allele frequencies of an SNP for unrelated population data. The simulation study shows that these two models present unbiased estimates for the allele frequencies. One of the models also gives an unbiased estimate for the genotyping error rate.

CONCLUSION

While the Hardy-Weinberg equilibrium test can be used to detect genotyping errors, a key advantage of these models is the explicit estimates of genotyping error rates and allele frequencies. This work may help researchers to estimate error rates and to use the estimates in their analysis to increase power and decrease bias, without the extra work of genotyping family members or replicates.

摘要

目的

识别基因分型错误是基因研究中的一个重要问题，但在由无关个体组成的样本中对此研究相对较少。在本文中，我们考虑了几种最初为系谱数据提出的基因分型错误模型，用于具有单核苷酸多态性（SNP）基因型数据的无关群体样本。研究了在不进行重复抽样或对亲属进行基因分型的情况下检测基因分型错误的数学约束条件。

方法

对于各种提出的基因分型错误模型，我们揭示了参数可识别的条件。通过应用于模拟和真实的SNP数据对这些结果进行了验证。

结果

我们表明，在有约束条件下，两种特定模型可为无关群体数据提供可识别的SNP错误率和等位基因频率。模拟研究表明，这两种模型对等位基因频率给出了无偏估计。其中一种模型对基因分型错误率也给出了无偏估计。

结论

虽然哈迪 - 温伯格平衡检验可用于检测基因分型错误，但这些模型的一个关键优势是对基因分型错误率和等位基因频率的明确估计。这项工作可能有助于研究人员估计错误率，并在分析中使用这些估计值来提高检验效能和减少偏差，而无需对家庭成员或重复样本进行额外的基因分型工作。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

在无重复基因分型的无关个体样本中进行基因分型错误检测。

Genotyping error detection in samples of unrelated individuals without replicate genotyping.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

在无重复基因分型的无关个体样本中进行基因分型错误检测。

Genotyping error detection in samples of unrelated individuals without replicate genotyping.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献