Division of Life Science and Applied Genomics Centre, Hong Kong University of Science and Technology, 1 University Road, Clear Water Bay, Kowloon, Hong Kong, China.
BMC Genomics. 2011 Nov 17;12:564. doi: 10.1186/1471-2164-12-564.
To complement next-generation sequencing technologies, there is a pressing need for efficient pre-sequencing capture methods with reduced costs and DNA requirement. The Alu family of short interspersed nucleotide elements is the most abundant type of transposable elements in the human genome and a recognized source of genome instability. With over one million Alu elements distributed throughout the genome, they are well positioned to facilitate genome-wide sequence amplification and capture of regions likely to harbor genetic variation hotspots of biological relevance.
Here we report on the use of inter-Alu PCR with an enhanced range of amplicons in conjunction with next-generation sequencing to generate an Alu-anchored scan, or 'AluScan', of DNA sequences between Alu transposons, where Alu consensus sequence-based 'H-type' PCR primers that elongate outward from the head of an Alu element are combined with 'T-type' primers elongating from the poly-A containing tail to achieve huge amplicon range. To illustrate the method, glioma DNA was compared with white blood cell control DNA of the same patient by means of AluScan. The over 10 Mb sequences obtained, derived from more than 8,000 genes spread over all the chromosomes, revealed a highly reproducible capture of genomic sequences enriched in genic sequences and cancer candidate gene regions. Requiring only sub-micrograms of sample DNA, the power of AluScan as a discovery tool for genetic variations was demonstrated by the identification of 357 instances of loss of heterozygosity, 341 somatic indels, 274 somatic SNVs, and seven potential somatic SNV hotspots between control and glioma DNA.
AluScan, implemented with just a small number of H-type and T-type inter-Alu PCR primers, provides an effective capture of a diversity of genome-wide sequences for analysis. The method, by enabling an examination of gene-enriched regions containing exons, introns, and intergenic sequences with modest capture and sequencing costs, computation workload and DNA sample requirement is particularly well suited for accelerating the discovery of somatic mutations, as well as analysis of disease-predisposing germline polymorphisms, by making possible the comparative genome-wide scanning of DNA sequences from large human cohorts.
为了补充下一代测序技术,我们迫切需要成本更低、DNA 需求更少的高效预测序捕获方法。Alu 家族是短散在核元件,是人类基因组中最丰富的转座元件类型,也是公认的基因组不稳定来源。由于超过 100 万个 Alu 元件分布在整个基因组中,它们非常适合促进全基因组序列扩增和捕获可能含有生物相关遗传变异热点的区域。
我们在此报告了使用 Alu 间 PCR 结合下一代测序来生成 Alu 锚定扫描或“AluScan”的方法,该方法可以在 Alu 转座子之间的 DNA 序列中进行扫描。在该方法中,使用基于 Alu 共有序列的“H 型”PCR 引物从 Alu 元件的头部向外延伸,并与从含有 poly-A 的尾部延伸的“T 型”引物相结合,从而获得大范围的扩增子。为了说明该方法,我们用 AluScan 比较了胶质瘤 DNA 与同一患者的白细胞对照 DNA。从超过 8000 个基因获得的超过 10 Mb 序列分布在所有染色体上,高度重现性地捕获了富含基因序列和癌症候选基因区域的基因组序列。该方法仅需亚微克级别的样本 DNA,通过鉴定胶质瘤 DNA 与白细胞对照 DNA 之间的 357 个杂合性缺失、341 个体细胞插入缺失、274 个体细胞单核苷酸变异和 7 个潜在体细胞单核苷酸变异热点,证明了 AluScan 作为遗传变异发现工具的强大功能。
AluScan 仅使用少量的 H 型和 T 型 Alu 间 PCR 引物即可有效捕获全基因组范围内的多种序列进行分析。该方法通过对富含外显子、内含子和基因间序列的基因富集区域进行适度捕获,以较低的捕获和测序成本、计算工作量和 DNA 样本需求,特别适合加速体细胞突变的发现,以及分析致病种系多态性,因为它可以实现对大量人类队列的 DNA 序列进行全基因组比较扫描。