State Key Laboratory of Crop Genetics and Germplasm Enhancement, Key Laboratory of Biology and Genetic Improvement of Horticultural Crops (East China), Ministry of Agriculture and Rural Affairs of the P.R. China, Nanjing Agricultural University, Nanjing 210095, China.
Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China.
Gene. 2019 May 30;699:24-36. doi: 10.1016/j.gene.2019.02.075. Epub 2019 Mar 5.
Watercress (Nasturtium officinale R. Br.), an aquatic leafy vegetable of the Brassicaceae family, is known as a nutritional powerhouse. Here, we de novo sequenced and assembled the complete chloroplast (cp) genome of watercress based on combined PacBio and Illumina data. The cp genome is 155,106 bp in length, exhibiting a typical quadripartite structure including a pair of inverted repeats (IRA and IRB) of 26,505 bp separated by a large single copy (LSC) region of 84,265 bp and a small single copy (SSC) region of 17,831 bp. The genome contained 113 unique genes, including 79 protein-coding genes, 30 tRNAs and 4 rRNAs, with 20 duplicate in the IRs. Compared with the prior cp genome of watercress deposited in GenBank, 21 single nucleotide polymorphisms (SNPs) and 27 indels were identified, mainly located in noncoding sequences. A total of 49 repeat structures and 71 simple sequence repeats (SSRs) were detected. Codon usage showed a bias for A/T-ending codons in the cp genome of watercress. Moreover, 45 RNA editing sites were predicted in 16 genes, all for C-to-U transitions. A comparative plastome study with Cardamineae species revealed a conserved gene order and high similarity of protein-coding sequences. Analysis of the Ka/Ks ratios of Cardamineae suggested positive selection exerted on the ycf2 gene in watercress, which might reflect specific adaptations of watercress to its particular living environment. Phylogenetic analyses based on complete cp genomes and common protein-coding genes from 56 species showed that the genus Nasturtium was a sister to Cardamine in the Cardamineae tribe. Our study provides valuable resources for future evolution, population genetics and molecular biology studies of watercress.
西洋菜(Nasturtium officinale R. Br.),十字花科水生绿叶蔬菜,是一种营养丰富的食物。在这里,我们基于 PacBio 和 Illumina 数据的组合从头测序并组装了西洋菜的完整叶绿体(cp)基因组。cp 基因组长 155,106 bp,呈现出典型的四分体结构,包括一对长 26,505 bp 的反向重复(IRA 和 IRB),由长 84,265 bp 的大单拷贝(LSC)区和长 17,831 bp 的小单拷贝(SSC)区隔开。基因组包含 113 个独特基因,包括 79 个蛋白质编码基因、30 个 tRNA 和 4 个 rRNA,其中 20 个在 IRs 中重复。与先前保存在 GenBank 中的西洋菜 cp 基因组相比,发现了 21 个单核苷酸多态性(SNP)和 27 个插入缺失,主要位于非编码序列中。共检测到 49 个重复结构和 71 个简单序列重复(SSR)。叶绿体基因组中密码子使用偏向于 A/T 结尾的密码子。此外,在 16 个基因中预测到 45 个 RNA 编辑位点,均为 C 到 U 的转换。与十字花科物种的比较叶绿体研究显示,基因排列保守且蛋白质编码序列高度相似。对十字花科的 Ka/Ks 比值分析表明,ycf2 基因在西洋菜中受到正选择,这可能反映了西洋菜对其特殊生活环境的特定适应。基于 56 种物种的完整 cp 基因组和常见蛋白质编码基因的系统发育分析表明, Nasturtium 属与 Cardamine 属在 Cardamineae 族中是姐妹关系。我们的研究为西洋菜的未来进化、群体遗传学和分子生物学研究提供了有价值的资源。