Whitehead Institute, Cambridge, Massachusetts, USA.
Howard Hughes Medical Institute, Whitehead Institute, Cambridge, Massachusetts, USA.
Nat Protoc. 2018 Apr;13(4):787-809. doi: 10.1038/nprot.2018.019. Epub 2018 Mar 22.
The reference sequences of structurally complex regions can be obtained only through highly accurate clone-based approaches. We and others have successfully used single-haplotype iterative mapping and sequencing (SHIMS) 1.0 to assemble structurally complex regions across the sex chromosomes of several vertebrate species and to allow for targeted improvements to the reference sequences of human autosomes. However, SHIMS 1.0 is expensive and time consuming, requiring resources that only a genome center can provide. Here we introduce SHIMS 2.0, an improved SHIMS protocol that allows even a small laboratory to generate high-quality reference sequence from complex genomic regions. Using a streamlined and parallelized library-preparation protocol, and taking advantage of inexpensive high-throughput short-read-sequencing technologies, a small laboratory with both molecular biology and bioinformatics experience can sequence and assemble 192 large-insert bacterial artificial chromosome (BAC) or fosmid clones in 1 week. In SHIMS 2.0, in contrast to other pooling strategies, each clone is sequenced with a unique barcode, thus enabling clones containing nearly identical sequences to be multiplexed in a single sequencing run and assembled separately. Relative to SHIMS 1.0, SHIMS 2.0 decreases the required cost and time by two orders of magnitude while preserving high sequencing accuracy.
结构复杂区域的参考序列只能通过高度精确的基于克隆的方法获得。我们和其他人已经成功地使用单倍型迭代映射和测序(SHIMS)1.0 来组装几个脊椎动物物种的性染色体上的结构复杂区域,并允许对人类常染色体的参考序列进行有针对性的改进。然而,SHIMS 1.0 既昂贵又耗时,需要只有基因组中心才能提供的资源。在这里,我们介绍了 SHIMS 2.0,这是一种改进的 SHIMS 协议,即使是一个小实验室也可以从复杂的基因组区域生成高质量的参考序列。通过简化和并行化的文库制备协议,并利用廉价的高通量短读测序技术,一个具有分子生物学和生物信息学经验的小实验室可以在 1 周内对 192 个大插入细菌人工染色体(BAC)或fosmid 克隆进行测序和组装。在 SHIMS 2.0 中,与其他池化策略相反,每个克隆都用独特的条形码进行测序,从而使包含几乎相同序列的克隆能够在单个测序运行中多路复用,并分别进行组装。与 SHIMS 1.0 相比,SHIMS 2.0 在保留高测序准确性的同时,将所需的成本和时间降低了两个数量级。