Biotechnology Institute, Dalian Jiaotong University, Dalian 116028, China.
Mol Genet Genomics. 2011 Mar;285(3):197-205. doi: 10.1007/s00438-010-0598-4. Epub 2011 Jan 5.
Taxus mairei is a critically endangered and commercially important cultured medicinal gymnosperm in China and forms an important medicinal resource, but the research of its genome is absent. In this study, we constructed a T. mairei fosmid library and analyzed the fosmid end sequences to provide a preliminary assessment of the genome. The library consists of one million clones with an average insert size of about 39 kb, amounting to 3.9 genome equivalents. Fosmid stability assays indicate that T. mairei DNA was stable during propagation in the fosmid system. End sequencing of both 5' and 3' ends of 968 individual clones generated 1,923 sequences after trimming, with an average sequence length of 839 bp. BLASTN searches of the nr and EST databases of GenBank and BLASTX searches of the nr database resulted in 560 (29.1%) significant hits (E < e(-5)). Repetitive sequences analysis revealed that 20.8% of end sequences are repetitive elements, which were composed of retroelements, DNA transposons, satellites, simple repeats, and low complexity sequences. The distribution pattern of various repeat types was found to be more similar to the gymnosperm Pinus and Picea than to the monocot and dicot. The satellites of T. mairei were significantly longer than those of P. taeda and P. glauca. The tetra-nucleotide repeats of T. mairei were much longer than those of P. glauca and P. taeda. The fosmid library and the fosmid end sequences, for the first time, will serve as a useful resource for large-scale genome sequencing, physical mapping, SSR marker development and positional cloning, and provide a better understanding of the Taxus genome.
中国红豆杉是一种濒危且具有商业重要性的栽培药用裸子植物,是一种重要的药用资源,但对其基因组的研究尚属空白。本研究构建了南方红豆杉的fosmid 文库,并对 fosmid 末端序列进行分析,初步评估其基因组。该文库包含一百万克隆,平均插入片段大小约为 39kb,约为 3.9 个基因组当量。fosmid 稳定性测定表明,红豆杉 DNA 在 fosmid 系统中繁殖时是稳定的。对 968 个克隆的 5'和 3'末端进行末端测序,修剪后得到 1923 条序列,平均序列长度为 839bp。BLASTN 在 GenBank 的 nr 和 EST 数据库以及 BLASTX 在 nr 数据库中的搜索结果显示,560 个(29.1%)显著匹配(E < e(-5))。重复序列分析显示,20.8%的末端序列是重复元件,由反转录转座子、DNA 转座子、卫星、简单重复序列和低复杂度序列组成。各种重复类型的分布模式与松科的松树和云杉更为相似,而与单子叶植物和双子叶植物的相似性较低。红豆杉的卫星序列明显长于长叶松和白云杉。红豆杉的四核苷酸重复序列比白云杉和长叶松长得多。该 fosmid 文库和 fosmid 末端序列首次为大规模基因组测序、物理图谱构建、SSR 标记开发和定位克隆提供了有用的资源,并为进一步了解红豆杉基因组提供了依据。