Suppr超能文献

使用Illumina基因组分析仪对靶向DNA区域进行条形码标记的多重测序。

Bar-coded, multiplexed sequencing of targeted DNA regions using the Illumina Genome Analyzer.

作者信息

Szelinger Szabolcs, Kurdoglu Ahmet, Craig David W

机构信息

Neurogenomics Division, Translational Genomics Research Institute, Phoenix, AZ, USA.

出版信息

Methods Mol Biol. 2011;700:89-104. doi: 10.1007/978-1-61737-954-3_7.

Abstract

To date, genome-wide association (GWA) studies, in which thousands of markers throughout the genome are simultaneously genotyped, have identified hundreds of loci underlying disease susceptibility. These regions typically span 5-100 kb, and resequencing efforts to identify potential functional variants within these loci represent the next logical step in the genetic characterization pipeline. Next-generation DNA sequencing technologies are, in principle, well-suited for this task, yet despite the massive sequencing capability afforded by these platforms, the present-day reality is that it remains difficult, time-consuming, and expensive to resequence large numbers of samples across moderately sized genomic regions. To address this obstacle, we developed a generalized framework for multiplexed resequencing of targeted regions of the human genome on the Illumina Genome Analyzer using degenerate, indexed DNA sequence barcodes ligated to fragmented DNA prior to sequencing. Using this method, the DNA of multiple individuals can be simultaneously sequenced at several regions. We find that achieving adequate coverage is one of the most important factors in the design of an experiment, but other key considerations include whether the objective is to discover genetic variants for genotyping later by a separate method, to genotype all identified variants by sequencing, or to exhaustively identify all common and rare variants in the region. Given the massive bandwidth of next-generation sequencing technologies and their low inherent throughput in terms of sequencing arrays per week, multiplexed sequencing using the barcoding approach offers a clear mechanism for focusing bandwidth to a smaller region across many more individuals or samples.

摘要

迄今为止,全基因组关联(GWA)研究已鉴定出数百个疾病易感性相关位点,该研究同时对全基因组中的数千个标记进行基因分型。这些区域通常跨度为5 - 100 kb,对这些位点内潜在功能变异进行鉴定的重测序工作是基因特征分析流程中的下一步合理举措。原则上,新一代DNA测序技术非常适合这项任务,然而尽管这些平台具备强大的测序能力,但目前的实际情况是,对中等大小基因组区域内的大量样本进行重测序仍然困难、耗时且昂贵。为解决这一障碍,我们开发了一个通用框架,用于在Illumina基因组分析仪上对人类基因组的目标区域进行多重重测序,该方法是在测序之前,将简并索引DNA序列条形码连接到片段化DNA上。使用这种方法,可以同时对多个个体的DNA在多个区域进行测序。我们发现,实现足够的覆盖度是实验设计中最重要的因素之一,但其他关键考虑因素包括目标是发现遗传变异以便之后通过单独方法进行基因分型、通过测序对所有已鉴定的变异进行基因分型,还是详尽地鉴定该区域内所有常见和罕见的变异。鉴于新一代测序技术的巨大带宽以及其每周测序阵列方面较低的固有通量,使用条形码方法进行多重测序提供了一种明确的机制,可将带宽集中到更多个体或样本的较小区域。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验