Suppr超能文献

通过家系分析估计参考基因型和非参考基因型的不同错误率。

Distinct error rates for reference and nonreference genotypes estimated by pedigree analysis.

机构信息

Department of Biology, Indiana University, Bloomington, IN 47405, USA.

Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, USA.

出版信息

Genetics. 2021 Mar 3;217(1):1-10. doi: 10.1093/genetics/iyaa014.

Abstract

Errors in genotype calling can have perverse effects on genetic analyses, confounding association studies, and obscuring rare variants. Analyses now routinely incorporate error rates to control for spurious findings. However, reliable estimates of the error rate can be difficult to obtain because of their variance between studies. Most studies also report only a single estimate of the error rate even though genotypes can be miscalled in more than one way. Here, we report a method for estimating the rates at which different types of genotyping errors occur at biallelic loci using pedigree information. Our method identifies potential genotyping errors by exploiting instances where the haplotypic phase has not been faithfully transmitted. The expected frequency of inconsistent phase depends on the combination of genotypes in a pedigree and the probability of miscalling each genotype. We develop a model that uses the differences in these frequencies to estimate rates for different types of genotype error. Simulations show that our method accurately estimates these error rates in a variety of scenarios. We apply this method to a dataset from the whole-genome sequencing of owl monkeys (Aotus nancymaae) in three-generation pedigrees. We find significant differences between estimates for different types of genotyping error, with the most common being homozygous reference sites miscalled as heterozygous and vice versa. The approach we describe is applicable to any set of genotypes where haplotypic phase can reliably be called and should prove useful in helping to control for false discoveries.

摘要

基因型调用错误会对遗传分析产生反常影响,干扰关联研究,并掩盖罕见变异。现在的分析通常会纳入错误率来控制虚假发现。然而,由于研究之间存在差异,可靠估计错误率可能很困难。大多数研究即使基因型可能以不止一种方式被误报,也只报告单一的错误率估计值。在这里,我们报告了一种使用系谱信息估计双等位基因座不同类型基因分型错误发生率的方法。我们的方法通过利用未忠实传递单倍型相位的实例来识别潜在的基因分型错误。不一致相位的预期频率取决于系谱中基因型的组合以及每种基因型误报的概率。我们开发了一种模型,该模型利用这些频率的差异来估计不同类型基因型错误的速率。模拟表明,我们的方法在各种情况下都能准确估计这些错误率。我们将此方法应用于三代系谱中全基因组测序的猫头鹰猴(Aotus nancymaae)数据集。我们发现不同类型基因分型错误的估计值存在显著差异,最常见的是纯合参考位点误报为杂合子,反之亦然。我们描述的方法适用于任何可以可靠调用单倍型相位的基因型集合,应该有助于控制假发现。

相似文献

2
Genotype phasing in pedigrees using whole-genome sequence data.利用全基因组序列数据对家系进行基因型相位分析。
Eur J Hum Genet. 2020 Jun;28(6):790-803. doi: 10.1038/s41431-020-0574-3. Epub 2020 Jan 29.
3
Detection of Mendelian consistent genotyping errors in pedigrees.家系中孟德尔一致基因分型错误的检测。
Genet Epidemiol. 2014 May;38(4):291-9. doi: 10.1002/gepi.21806. Epub 2014 Apr 9.
10
Estimating and accounting for genotyping errors in RAD-seq experiments.估算和核算 RAD-seq 实验中的基因型错误。
Mol Ecol Resour. 2020 Jul;20(4):856-870. doi: 10.1111/1755-0998.13153. Epub 2020 Apr 6.

引用本文的文献

3
PidTools: Algorithm and web tools for crop pedigree identification analysis.PidTools:用于作物系谱鉴定分析的算法和网络工具。
Comput Struct Biotechnol J. 2024 Jul 5;23:2883-2891. doi: 10.1016/j.csbj.2024.07.004. eCollection 2024 Dec.

本文引用的文献

6
Reproductive Longevity Predicts Mutation Rates in Primates.生殖寿命可预测灵长类动物的突变率。
Curr Biol. 2018 Oct 8;28(19):3193-3197.e5. doi: 10.1016/j.cub.2018.08.050. Epub 2018 Sep 27.
10
The rate of meiotic gene conversion varies by sex and age.减数分裂基因转换率因性别和年龄而异。
Nat Genet. 2016 Nov;48(11):1377-1384. doi: 10.1038/ng.3669. Epub 2016 Sep 19.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验