从多倍体植物物种的指纹细菌人工染色体文库构建物理图谱的可行性。
Feasibility of physical map construction from fingerprinted bacterial artificial chromosome libraries of polyploid plant species.
机构信息
Department of Plant Sciences, University of California, Davis, CA 95616, USA.
出版信息
BMC Genomics. 2010 Feb 19;11:122. doi: 10.1186/1471-2164-11-122.
BACKGROUND
The presence of closely related genomes in polyploid species makes the assembly of total genomic sequence from shotgun sequence reads produced by the current sequencing platforms exceedingly difficult, if not impossible. Genomes of polyploid species could be sequenced following the ordered-clone sequencing approach employing contigs of bacterial artificial chromosome (BAC) clones and BAC-based physical maps. Although BAC contigs can currently be constructed for virtually any diploid organism with the SNaPshot high-information-content-fingerprinting (HICF) technology, it is currently unknown if this is also true for polyploid species. It is possible that BAC clones from orthologous regions of homoeologous chromosomes would share numerous restriction fragments and be therefore included into common contigs. Because of this and other concerns, physical mapping utilizing the SNaPshot HICF of BAC libraries of polyploid species has not been pursued and the possibility of doing so has not been assessed. The sole exception has been in common wheat, an allohexaploid in which it is possible to construct single-chromosome or single-chromosome-arm BAC libraries from DNA of flow-sorted chromosomes and bypass the obstacles created by polyploidy.
RESULTS
The potential of the SNaPshot HICF technology for physical mapping of polyploid plants utilizing global BAC libraries was evaluated by assembling contigs of fingerprinted clones in an in silico merged BAC library composed of single-chromosome libraries of two wheat homoeologous chromosome arms, 3AS and 3DS, and complete chromosome 3B. Because the chromosome arm origin of each clone was known, it was possible to estimate the fidelity of contig assembly. On average 97.78% or more clones, depending on the library, were from a single chromosome arm. A large portion of the remaining clones was shown to be library contamination from other chromosomes, a feature that is unavoidable during the construction of single-chromosome BAC libraries.
CONCLUSIONS
The negligibly low level of incorporation of clones from homoeologous chromosome arms into a contig during contig assembly suggested that it is feasible to construct contigs and physical maps using global BAC libraries of wheat and almost certainly also of other plant polyploid species with genome sizes comparable to that of wheat. Because of the high purity of the resulting assembled contigs, they can be directly used for genome sequencing. It is currently unknown but possible that equally good BAC contigs can be also constructed for polyploid species containing smaller, more gene-rich genomes.
背景
在多倍体物种中,密切相关的基因组的存在使得从当前测序平台产生的 shotgun 序列读取中组装总基因组序列变得极其困难,如果不是不可能的话。多倍体物种的基因组可以通过使用细菌人工染色体 (BAC) 克隆的有序克隆测序方法和基于 BAC 的物理图谱进行测序。尽管目前可以使用 SNaPshot 高信息量指纹 (HICF) 技术构建几乎任何二倍体生物的 BAC 连续体,但目前尚不清楚这是否也适用于多倍体物种。来自同源染色体同源区域的 BAC 克隆可能会共享许多限制片段,并因此包含在共同的连续体中。由于这一点和其他一些原因,利用多倍体物种的 BAC 文库进行 SNaPshot HICF 的物理图谱绘制尚未得到探索,也没有评估其可能性。唯一的例外是普通小麦,这是一种异源六倍体,其中可以从流式分选染色体的 DNA 中构建单染色体或单染色体臂 BAC 文库,并绕过多倍体带来的障碍。
结果
通过组装由两个小麦同源染色体臂 3AS 和 3DS 的单染色体文库和完整的 3B 染色体组成的虚拟合并 BAC 文库中指纹克隆的连续体,评估了 SNaPshot HICF 技术在利用全球 BAC 文库进行多倍体植物物理图谱绘制的潜力。由于每个克隆的染色体臂来源是已知的,因此可以估计连续体组装的保真度。平均而言,取决于文库,97.78%或更多的克隆来自单个染色体臂。大量剩余的克隆被证明是来自其他染色体的文库污染,这是在构建单染色体 BAC 文库时不可避免的特征。
结论
在连续体组装过程中,同源染色体臂的克隆很少被整合到连续体中,这表明使用小麦的全球 BAC 文库构建连续体和物理图谱是可行的,而且几乎可以肯定也可以用于基因组大小与小麦相当的其他植物多倍体物种。由于所得组装连续体的高纯度,它们可以直接用于基因组测序。目前尚不清楚,但有可能为含有更小、基因更丰富基因组的多倍体物种构建同样好的 BAC 连续体。