Joint Center for Genomics Research (JCGR), King Abdulaziz City for Science and Technology (KACST) and Chinese Academy of Sciences (CAS), Riyadh, Kingdom of Saudi Arabia.
PLoS One. 2012;7(5):e37164. doi: 10.1371/journal.pone.0037164. Epub 2012 May 24.
Based on next-generation sequencing data, we assembled the mitochondrial (mt) genome of date palm (Phoenix dactylifera L.) into a circular molecule of 715,001 bp in length. The mt genome of P. dactylifera encodes 38 proteins, 30 tRNAs, and 3 ribosomal RNAs, which constitute a gene content of 6.5% (46,770 bp) over the full length. The rest, 93.5% of the genome sequence, is comprised of cp (chloroplast)-derived (10.3% with respect to the whole genome length) and non-coding sequences. In the non-coding regions, there are 0.33% tandem and 2.3% long repeats. Our transcriptomic data from eight tissues (root, seed, bud, fruit, green leaf, yellow leaf, female flower, and male flower) showed higher gene expression levels in male flower, root, bud, and female flower, as compared to four other tissues. We identified 120 potential SNPs among three date palm cultivars (Khalas, Fahal, and Sukry), and successfully found seven SNPs in the coding sequences. A phylogenetic analysis, based on 22 conserved genes of 15 representative plant mitochondria, showed that P. dactylifera positions at the root of all sequenced monocot mt genomes. In addition, consistent with previous discoveries, there are three co-transcribed gene clusters-18S-5S rRNA, rps3-rpl16 and nad3-rps12-in P. dactylifera, which are highly conserved among all known mitochondrial genomes of angiosperms.
基于下一代测序数据,我们将椰枣树(Phoenix dactylifera L.)的线粒体(mt)基因组组装成一个长 715,001 bp 的环状分子。P. dactylifera 的 mt 基因组编码 38 种蛋白质、30 种 tRNA 和 3 种核糖体 RNA,占全长的 6.5%(46,770 bp)。其余 93.5%的基因组序列由 cp(叶绿体)衍生序列(相对于整个基因组长度的 10.3%)和非编码序列组成。在非编码区,有 0.33%的串联重复和 2.3%的长重复。我们从八个组织(根、种子、芽、果实、绿叶、黄叶、雌花和雄花)的转录组数据中发现,与其他四个组织相比,雄花、根、芽和雌花的基因表达水平更高。在三个椰枣品种(Khalas、Fahal 和 Sukry)中,我们鉴定出 120 个潜在的 SNP,并在编码序列中成功发现了 7 个 SNP。基于 15 种代表性植物线粒体的 22 个保守基因的系统发育分析表明,P. dactylifera 在所有测序的单子叶 mt 基因组中处于根部位置。此外,与先前的发现一致,在 P. dactylifera 中有三个共转录基因簇-18S-5S rRNA、rps3-rpl16 和 nad3-rps12,它们在所有已知的被子植物线粒体基因组中高度保守。