Yang Yang, Dang Yuanye, Li Qing, Lu Jinjian, Li Xiwen, Wang Yitao
State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macau, China.
Department of Pharmacy, Shanghai Changzheng Hospital, Second Military Medical University, Shanghai, China.
PLoS One. 2014 Nov 3;9(11):e110656. doi: 10.1371/journal.pone.0110656. eCollection 2014.
Datura stramonium is a widely used poisonous plant with great medicinal and economic value. Its chloroplast (cp) genome is 155,871 bp in length with a typical quadripartite structure of the large (LSC, 86,302 bp) and small (SSC, 18,367 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,601 bp). The genome contains 113 unique genes, including 80 protein-coding genes, 29 tRNAs and four rRNAs. A total of 11 forward, 9 palindromic and 13 tandem repeats were detected in the D. stramonium cp genome. Most simple sequence repeats (SSR) are AT-rich and are less abundant in coding regions than in non-coding regions. Both SSRs and GC content were unevenly distributed in the entire cp genome. All preferred synonymous codons were found to use A/T ending codons. The difference in GC contents of entire genomes and of the three-codon positions suggests that the D. stramonium cp genome might possess different genomic organization, in part due to different mutational pressures. The five most divergent coding regions and four non-coding regions (trnH-psbA, rps4-trnS, ndhD-ccsA, and ndhI-ndhG) were identified using whole plastome alignment, which can be used to develop molecular markers for phylogenetics and barcoding studies within the Solanaceae. Phylogenetic analysis based on 68 protein-coding genes supported Datura as a sister to Solanum. This study provides valuable information for phylogenetic and cp genetic engineering studies of this poisonous and medicinal plant.
曼陀罗是一种广泛使用的有毒植物,具有很高的药用和经济价值。其叶绿体(cp)基因组长度为155,871 bp,具有典型的四分体结构,包括一个大的单拷贝区域(LSC,86,302 bp)和一个小的单拷贝区域(SSC,18,367 bp),由一对反向重复序列(IRs,25,601 bp)隔开。该基因组包含113个独特基因,包括80个蛋白质编码基因、29个tRNA和4个rRNA。在曼陀罗cp基因组中总共检测到11个正向重复、9个回文重复和13个串联重复。大多数简单序列重复(SSR)富含AT,且在编码区的丰度低于非编码区。SSR和GC含量在整个cp基因组中分布不均。所有偏好的同义密码子均以A/T结尾。整个基因组和三个密码子位置的GC含量差异表明,曼陀罗cp基因组可能具有不同的基因组组织,部分原因是不同的突变压力。通过全叶绿体基因组比对鉴定出五个差异最大的编码区和四个非编码区(trnH-psbA、rps4-trnS、ndhD-ccsA和ndhI-ndhG),可用于开发茄科植物系统发育和条形码研究的分子标记。基于68个蛋白质编码基因的系统发育分析支持曼陀罗是茄属的姐妹属。本研究为这种有毒药用植物的系统发育和cp基因工程研究提供了有价值的信息。