Roux P F, Marthey S, Djari A, Moroldo M, Esquerré D, Estellé J, Klopp C, Lagarrigue S, Demeure O
INRA, UMR1348 PEGASE, Saint-Gilles, F-35590, France; Agrocampus Ouest, UMR1348 PEGASE, Rennes, F-35000, France; Université Européenne de Bretagne, Rennes, France.
Anim Genet. 2015 Feb;46(1):82-6. doi: 10.1111/age.12248. Epub 2014 Dec 16.
The number of polymorphisms identified with next-generation sequencing approaches depends directly on the sequencing depth and therefore on the experimental cost. Although higher levels of depth ensure more sensitive and more specific SNP calls, economic constraints limit the increase of depth for whole-genome resequencing (WGS). For this reason, capture resequencing is used for studies focusing on only some specific regions of the genome. However, several biases in capture resequencing are known to have a negative impact on the sensitivity of SNP detection. Within this framework, the aim of this study was to compare the accuracy of WGS and capture resequencing on SNP detection and genotype calling, which differ in terms of both sequencing depth and biases. Indeed, we have evaluated the SNP calling and genotyping accuracy in a WGS dataset (13X) and in a capture resequencing dataset (87X) performed on 11 individuals. The percentage of SNPs not identified due to a sevenfold sequencing depth decrease was estimated at 7.8% using a down-sampling procedure on the capture sequencing dataset. A comparison of the 87X capture sequencing dataset with the WGS dataset revealed that capture-related biases were leading with the loss of 5.2% of SNPs detected with WGS. Nevertheless, when considering the SNPs detected by both approaches, capture sequencing appears to achieve far better SNP genotyping, with about 4.4% of the WGS genotypes that can be considered as erroneous and even 10% focusing on heterozygous genotypes. In conclusion, WGS and capture deep sequencing can be considered equivalent strategies for SNP detection, as the rate of SNPs not identified because of a low sequencing depth in the former is quite similar to SNPs missed because of method biases of the latter. On the other hand, capture deep sequencing clearly appears more adapted for studies requiring great accuracy in genotyping.
通过下一代测序方法鉴定出的多态性数量直接取决于测序深度,因此也取决于实验成本。尽管更高的深度能确保更灵敏、更特异的单核苷酸多态性(SNP)检测,但经济限制阻碍了全基因组重测序(WGS)深度的增加。因此,捕获重测序用于仅聚焦于基因组某些特定区域的研究。然而,已知捕获重测序中的几种偏差会对SNP检测的灵敏度产生负面影响。在此框架下,本研究的目的是比较WGS和捕获重测序在SNP检测和基因分型方面的准确性,二者在测序深度和偏差方面均存在差异。实际上,我们评估了对11名个体进行的WGS数据集(13X)和捕获重测序数据集(87X)中的SNP检测和基因分型准确性。通过对捕获测序数据集进行下采样程序,估计因测序深度降低七倍而未鉴定出的SNP百分比为7.8%。将87X捕获测序数据集与WGS数据集进行比较发现,与捕获相关的偏差导致WGS检测到的SNP中有5.2%丢失。然而,当考虑两种方法都检测到的SNP时,捕获测序似乎能实现更好的SNP基因分型,WGS基因分型中约4.4%可被视为错误,甚至在杂合子基因分型方面高达10%。总之,WGS和捕获深度测序在SNP检测方面可被视为等效策略,因为前者因低测序深度未鉴定出的SNP比例与后者因方法偏差遗漏的SNP比例相当。另一方面,捕获深度测序显然更适用于对基因分型准确性要求较高的研究。