Summer Elizabeth J
Department Biochemistry and Biophysics, Texas A&M University, College Station, TX, USA.
Methods Mol Biol. 2009;502:27-46. doi: 10.1007/978-1-60327-565-1_4.
The most efficient method to determine the genomic sequence of a dsDNA phage is to use a whole genome shotgun approach (WGSA). Preparation of a library where each genomic fragment has an equal chance of being represented is critical to the success of the WGSA. For many phages, there are regions of the genome likely to be under-represented in the shotgun library, which results in more gaps in the shotgun assembly than predicted by the Poisson distribution. However, as phage genomes are relatively small, this increased number of gaps does not present an insurmountable impediment to using the WGSA. This chapter will focus on construction of a high-quality random library and sequence analysis of this library in a 96-well format. Techniques are described for the mechanical fragmentation of genomic DNA into 2 kb average size fragments, preparation of the fragmented DNA for shotgun cloning, and advice on the choice of cloning vector for library preparation. Protocols for deepwell block culture, plasmid isolation, and sequencing in 96-well format are given. The rationale for determining the total number of random clones from a library to sequence for a 50 and 150 kb genome is explained. The steps involved in going from hundreds of shotgun sequencing traces to generating contigs will be outlined as well as how to close gaps in the sequence by primer walking on phage DNA and PCR-generated templates. Finally, examples will be given of how biological information about the phage genomic termini can be derived by analysis of the organization of individual clones in the shotgun sequence assembly. Specific examples are given for the circularly permuted termini of pac type phages, the direct terminal repeats found in most T7-like phages, variable host DNA at either end as in the Mu-like phages, and the 5' and 3' overhanging ends of cos type phages. The end result of these steps is the entire DNA sequence of a novel phage, ready for gene prediction.
确定双链DNA噬菌体基因组序列的最有效方法是采用全基因组鸟枪法(WGSA)。构建一个文库,使每个基因组片段都有同等机会被代表,这对WGSA的成功至关重要。对于许多噬菌体来说,基因组中有些区域在鸟枪文库中可能代表性不足,这导致鸟枪组装中的缺口比泊松分布预测的更多。然而,由于噬菌体基因组相对较小,缺口数量的增加对使用WGSA来说并非不可克服的障碍。本章将重点介绍高质量随机文库的构建以及以96孔板形式对该文库进行序列分析。文中描述了将基因组DNA机械片段化为平均大小为2 kb片段的技术、用于鸟枪克隆的片段化DNA的制备,以及关于文库制备中克隆载体选择的建议。给出了在96孔板中进行深孔板培养、质粒分离和测序的方案。解释了确定针对50 kb和150 kb基因组进行测序所需文库中随机克隆总数的基本原理。将概述从数百个鸟枪测序痕迹到生成重叠群所涉及的步骤,以及如何通过在噬菌体DNA上进行引物步移和PCR生成的模板来封闭序列中的缺口。最后,将举例说明如何通过分析鸟枪序列组装中单个克隆的组织来推导噬菌体基因组末端的生物学信息。针对pac型噬菌体的环状排列末端、大多数T7样噬菌体中发现的直接末端重复序列、Mu样噬菌体两端可变的宿主DNA以及cos型噬菌体的5'和3'突出末端给出了具体例子。这些步骤的最终结果是获得一个新噬菌体的完整DNA序列,可用于基因预测。