Trinity Genome Sequencing Laboratory, Neuropsychiatric Genetics Research Group, Department of Psychiatry, Institute of Molecular Medicine, Trinity College Dublin, Ireland.
DNA Res. 2011 Feb;18(1):31-8. doi: 10.1093/dnares/dsq029. Epub 2010 Dec 16.
Screening large numbers of target regions in multiple DNA samples for sequence variation is an important application of next-generation sequencing but an efficient method to enrich the samples in parallel has yet to be reported. We describe an advanced method that combines DNA samples using indexes or barcodes prior to target enrichment to facilitate this type of experiment. Sequencing libraries for multiple individual DNA samples, each incorporating a unique 6-bp index, are combined in equal quantities, enriched using a single in-solution target enrichment assay and sequenced in a single reaction. Sequence reads are parsed based on the index, allowing sequence analysis of individual samples. We show that the use of indexed samples does not impact on the efficiency of the enrichment reaction. For three- and nine-indexed HapMap DNA samples, the method was found to be highly accurate for SNP identification. Even with sequence coverage as low as 8x, 99% of sequence SNP calls were concordant with known genotypes. Within a single experiment, this method can sequence the exonic regions of hundreds of genes in tens of samples for sequence and structural variation using as little as 1 μg of input DNA per sample.
对多个 DNA 样本中的大量目标区域进行序列变异筛查是下一代测序的一个重要应用,但仍缺乏有效的并行样本富集方法。我们描述了一种先进的方法,即在目标富集之前使用索引或条形码对 DNA 样本进行组合,以促进此类实验。将多个个体 DNA 样本的测序文库合并,每个文库都包含独特的 6 碱基索引,以相等的量进行组合,使用单一的溶液内目标富集测定法进行富集,并在单个反应中进行测序。根据索引解析序列读取,从而可以对各个样本进行序列分析。我们表明,使用带索引的样本不会影响富集反应的效率。对于三索引和九索引的 HapMap DNA 样本,该方法在 SNP 鉴定方面具有高度的准确性。即使测序覆盖率低至 8x,99%的序列 SNP 调用与已知基因型一致。在单个实验中,该方法可以使用每个样本少至 1μg 的输入 DNA 对数十个样本的数百个基因的外显子区域进行测序,以检测序列和结构变异。