Suppr超能文献

测试超深度测序共扩增基因家族的基因分型策略:雀形目鸟类中的 MHC Ⅰ类。

Testing genotyping strategies for ultra-deep sequencing of a co-amplifying gene family: MHC class I in a passerine bird.

机构信息

Institute of Nature Conservation, Polish Academy of Sciences, Al. Mickiewicza 33, 31-120, Kraków, Poland.

Evolutionary Biology Group, Faculty of Biology, Adam Mickiewicz University, ul. Umultowska 89, 61-614, Poznań, Poland.

出版信息

Mol Ecol Resour. 2017 Jul;17(4):642-655. doi: 10.1111/1755-0998.12612. Epub 2016 Nov 18.

Abstract

Characterization of highly duplicated genes, such as genes of the major histocompatibility complex (MHC), where multiple loci often co-amplify, has until recently been hindered by insufficient read depths per amplicon. Here, we used ultra-deep Illumina sequencing to resolve genotypes at exon 3 of MHC class I genes in the sedge warbler (Acrocephalus schoenobaenus). We sequenced 24 individuals in two replicates and used this data, as well as a simulated data set, to test the effect of amplicon coverage (range: 500-20 000 reads per amplicon) on the repeatability of genotyping using four different genotyping approaches. A third replicate employed unique barcoding to assess the extent of tag jumping, that is swapping of individual tag identifiers, which may confound genotyping. The reliability of MHC genotyping increased with coverage and approached or exceeded 90% within-method repeatability of allele calling at coverages of >5000 reads per amplicon. We found generally high agreement between genotyping methods, especially at high coverages. High reliability of the tested genotyping approaches was further supported by our analysis of the simulated data set, although the genotyping approach relying primarily on replication of variants in independent amplicons proved sensitive to repeatable errors. According to the most repeatable genotyping method, the number of co-amplifying variants per individual ranged from 19 to 42. Tag jumping was detectable, but at such low frequencies that it did not affect the reliability of genotyping. We thus demonstrate that gene families with many co-amplifying genes can be reliably genotyped using HTS, provided that there is sufficient per amplicon coverage.

摘要

高度重复基因(如主要组织相容性复合体(MHC)基因)的特征,其中多个基因座经常共同扩增,直到最近一直受到每个扩增子的读取深度不足的阻碍。在这里,我们使用超深度 Illumina 测序来解析草鹀(Acrocephalus schoenobaenus)MHC 类 I 基因外显子 3 的基因型。我们在两个重复中对 24 个个体进行了测序,并使用这些数据以及模拟数据集,测试了扩增子覆盖度(范围:每个扩增子 500-20000 个读长)对使用四种不同基因分型方法进行基因分型的可重复性的影响。第三个重复采用独特的条形码来评估标签跳跃的程度,即个体标签标识符的交换,这可能会混淆基因分型。随着覆盖度的增加,MHC 基因分型的可靠性增加,在每个扩增子覆盖度 >5000 个读长时,等位基因调用的方法内可重复性接近或超过 90%。我们发现基因分型方法之间的一致性通常很高,尤其是在高覆盖率下。在模拟数据集的分析中,进一步支持了测试基因分型方法的高可靠性,尽管主要依赖于独立扩增子中变体重复的基因分型方法对可重复的错误敏感。根据最具可重复性的基因分型方法,每个个体共扩增的变异数从 19 到 42 不等。标签跳跃是可检测的,但频率很低,不会影响基因分型的可靠性。因此,我们证明了只要每个扩增子的覆盖度足够,使用 HTS 就可以可靠地对具有许多共扩增基因的基因家族进行基因分型。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验