Suppr超能文献

评估GAW18数据中测序、填充和基因芯片基因型调用之间的一致性。

Evaluating the concordance between sequencing, imputation and microarray genotype calls in the GAW18 data.

作者信息

Rogers Ally, Beck Andrew, Tintle Nathan L

机构信息

Department of Mathematics, Statistics and Computer Science, Dordt College, Sioux Center, IA 51250, USA.

Department of Mathematics, Loyola University Chicago, Chicago, IL 60660, USA.

出版信息

BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S22. doi: 10.1186/1753-6561-8-S1-S22. eCollection 2014.

Abstract

Genotype errors are well known to increase type I errors and/or decrease power in related tests of genotype-phenotype association, depending on whether the genotype error mechanism is associated with the phenotype. These relationships hold for both single and multimarker tests of genotype-phenotype association. To assess the potential for genotype errors in Genetic Analysis Workshop 18 (GAW18) data, where no gold standard genotype calls are available, we explored concordance rates between sequencing, imputation, and microarray genotype calls. Our analysis shows that missing data rates for sequenced individuals are high and that there is a modest amount of called genotype discordance between the 2 platforms, with discordance most common for lower minor allele frequency (MAF) single-nucleotide polymorphisms (SNPs). Some evidence for discordance rates that were different between phenotypes was observed, and we identified a number of cases where different technologies identified different bases at the variant site. Type I errors and power loss is possible as a result of missing genotypes and errors in called genotypes in downstream analysis of GAW18 data.

摘要

众所周知,基因型错误会增加I型错误和/或降低基因型-表型关联相关检验的效能,这取决于基因型错误机制是否与表型相关。这些关系在基因型-表型关联的单标记和多标记检验中均成立。为了评估遗传分析研讨会18(GAW18)数据中基因型错误的可能性(该数据没有金标准基因型分型结果),我们探讨了测序、填充和微阵列基因型分型结果之间的一致性率。我们的分析表明,测序个体的缺失数据率很高,并且两个平台之间存在一定数量的基因型分型不一致情况,对于低频次要等位基因频率(MAF)的单核苷酸多态性(SNP),不一致情况最为常见。观察到一些证据表明不同表型之间的不一致率存在差异,并且我们确定了许多不同技术在变异位点鉴定出不同碱基的情况。在GAW18数据的下游分析中,由于基因型缺失和基因型分型错误,可能会出现I型错误和效能损失。

相似文献

1
Evaluating the concordance between sequencing, imputation and microarray genotype calls in the GAW18 data.评估GAW18数据中测序、填充和基因芯片基因型调用之间的一致性。
BMC Proc. 2014 Jun 17;8(Suppl 1 Genetic Analysis Workshop 18Vanessa Olmo):S22. doi: 10.1186/1753-6561-8-S1-S22. eCollection 2014.
8
NGS allele counts versus called genotypes for testing genetic association.用于检测基因关联的二代测序等位基因计数与分型结果对比
Comput Struct Biotechnol J. 2022 Jul 11;20:3729-3733. doi: 10.1016/j.csbj.2022.07.016. eCollection 2022.

本文引用的文献

5
HiTEC: accurate error correction in high-throughput sequencing data.HiTEC:高通量测序数据中的精确错误校正。
Bioinformatics. 2011 Feb 1;27(3):295-302. doi: 10.1093/bioinformatics/btq653. Epub 2010 Nov 26.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验