Kelkar H S, Griffith J, Case M E, Covert S F, Hall R D, Keith C H, Oliver J S, Orbach M J, Sachs M S, Wagner J R, Weise M J, Wunderlich J K, Arnold J
Department of Genetics, University of Georgia, Athens, Georgia 30602, USA.
Genetics. 2001 Mar;157(3):979-90. doi: 10.1093/genetics/157.3.979.
A Neurospora crassa cosmid library of 12,000 clones (at least nine genome equivalents) has been created using an improved cosmid vector pLorist6Xh, which contains a bacteriophage lambda origin of replication for low-copy-number replication in bacteria and the hygromycin phosphotransferase marker for direct selection in fungi. The electrophoretic karyotype of the seven chromosomes comprising the 42.9-Mb N. crassa genome was resolved using two translocation strains. Using gel-purified chromosomal DNAs as probes against the new cosmid library and the commonly used medium-copy-number pMOcosX N. crassa cosmid library in two independent screenings, the cosmids were assigned to chromosomes. Assignments of cosmids to linkage groups on the basis of the genetic map vs. the electrophoretic karyotype are 93 +/- 3% concordant. The size of each chromosome-specific subcollection of cosmids was found to be linearly proportional to the size of the particular chromosome. Sequencing of an entire cosmid containing the qa gene cluster indicated a gene density of 1 gene per 4 kbp; by extrapolation, 11,000 genes would be expected to be present in the N. crassa genome. By hybridizing 79 nonoverlapping cosmids with an average insert size of 34 kbp against cDNA arrays, the density of previously characterized expressed sequence tags (ESTs) was found to be slightly <1 per cosmid (i.e., 1 per 40 kbp), and most cosmids, on average, contained an identified N. crassa gene sequence as a starting point for gene identification.
利用改良的黏粒载体pLorist6Xh构建了一个包含12000个克隆(至少九个基因组当量)的粗糙脉孢菌黏粒文库,该载体含有噬菌体λ复制起点以便在细菌中进行低拷贝数复制,以及潮霉素磷酸转移酶标记以便在真菌中进行直接筛选。使用两个易位菌株解析了构成42.9 Mb粗糙脉孢菌基因组的七条染色体的电泳核型。在两次独立筛选中,使用凝胶纯化的染色体DNA作为探针,分别与新的黏粒文库和常用的中拷贝数pMOcosX粗糙脉孢菌黏粒文库杂交,将黏粒定位到染色体上。基于遗传图谱与电泳核型将黏粒定位到连锁群的一致性为93±3%。发现每个染色体特异性黏粒亚文库的大小与特定染色体的大小呈线性比例关系。对包含qa基因簇的一个完整黏粒进行测序表明,基因密度为每4 kbp有1个基因;通过推断,预计粗糙脉孢菌基因组中存在11000个基因。通过将平均插入片段大小为34 kbp的79个非重叠黏粒与cDNA阵列杂交,发现先前已鉴定的表达序列标签(EST)的密度略低于每个黏粒1个(即每40 kbp 1个),并且平均而言,大多数黏粒包含一个已鉴定的粗糙脉孢菌基因序列作为基因鉴定的起点。