College of Landscape Architecture and Horticulture Sciences, Southwest Research Center for Engineering Technology of Landscape Architecture(State Forestry and Grassland Administration), Yunnan Engineering Research Center for Functional Flower Resources and Industrialization, Research and Development Center of Landscape Plants and Horticulture Flowers, Southwest Forestry University, Kunming, Yunnan, 650224, China.
Department of Landscape Architecture and Plant Science, University of Connecticut, Storrs, CT, 06269, USA.
BMC Genomics. 2021 Jul 24;22(1):571. doi: 10.1186/s12864-021-07807-8.
Impatiens L. is a genus of complex taxonomy that belongs to the family Balsaminaceae (Ericales) and contains approximately 1000 species. The genus is well known for its economic, medicinal, ornamental, and horticultural value. However, knowledge about its germplasm identification, molecular phylogeny, and chloroplast genomics is limited, and taxonomic uncertainties still exist due to overlapping morphological features and insufficient genomic resources.
We sequenced the chloroplast genomes of six different species (Impatiens chlorosepala, Impatiens fanjingshanica, Impatiens guizhouensis, Impatiens linearisepala, Impatiens loulanensis, and Impatiens stenosepala) in the karst area of China and compared them with those of six previously published Balsaminaceae species. We contrasted genomic features and repeat sequences, assessed sequence divergence and constructed phylogenetic relationships. Except for those of I. alpicola, I. pritzelii and I. glandulifera, the complete chloroplast genomes ranging in size from 151,366 bp (I. alpicola) to 154,189 bp (Hydrocera triflora) encoded 115 distinct genes [81 protein-coding, 30 transfer RNA (tRNA), and 4 ribosomal RNA (rRNA) genes]. Moreover, the characteristics of the long repeat sequences and simple sequence repeats (SSRs) were determined. psbK-psbI, trnT-GGU-psbD, rpl36-rps8, rpoB-trnC-GCA, trnK-UUU-rps16, trnQ-UUG, trnP-UGG-psaJ, trnT-UGU-trnL-UAA, and ycf4-cemA were identified as divergence hotspot regions and thus might be suitable for species identification and phylogenetic studies. Additionally, the phylogenetic relationships based on Maximum likelihood (ML) and Bayesian inference (BI) of the whole chloroplast genomes showed that the chloroplast genome structure of I. guizhouensis represents the ancestral state of the Balsaminaceae family.
Our study provided detailed information about nucleotide diversity hotspots and the types of repeats, which can be used to develop molecular markers applicable to Balsaminaceae species. We also reconstructed and analyzed the relationships of some Impatiens species and assessed their taxonomic statuses based on the complete chloroplast genomes. Together, the findings of the current study might provide valuable genomic resources for systematic evolution of the Balsaminaceae species.
Impatiens L. 是一个分类学复杂的属,属于 Balsaminaceae 科(Ericales),包含约 1000 个物种。该属以其经济、药用、观赏和园艺价值而闻名。然而,由于形态特征重叠和基因组资源不足,其种质鉴定、分子系统发育和叶绿体基因组学的知识有限,分类学上仍存在不确定性。
我们对中国喀斯特地区的 6 种不同物种(Impatiens chlorosepala、Impatiens fanjingshanica、Impatiens guizhouensis、Impatiens linearisepala、Impatiens loulanensis 和 Impatiens stenosepala)的叶绿体基因组进行了测序,并与 6 种先前发表的 Balsaminaceae 物种的基因组进行了比较。我们比较了基因组特征和重复序列,评估了序列差异,并构建了系统发育关系。除了 I. alpicola、I. pritzelii 和 I. glandulifera 的基因组外,完整的叶绿体基因组大小从 151366 bp(I. alpicola)到 154189 bp(Hydrocera triflora),编码 115 个不同的基因[81 个蛋白质编码基因、30 个转移 RNA(tRNA)和 4 个核糖体 RNA(rRNA)基因]。此外,还确定了长重复序列和简单序列重复(SSR)的特征。psbK-psbI、trnT-GGU-psbD、rpl36-rps8、rpoB-trnC-GCA、trnK-UUU-rps16、trnQ-UUG、trnP-UGG-psaJ、trnT-UGU-trnL-UAA 和 ycf4-cemA 被鉴定为变异热点区域,因此可能适合物种鉴定和系统发育研究。此外,基于最大似然法(ML)和贝叶斯推断(BI)的整个叶绿体基因组的系统发育关系表明,I. guizhouensis 的叶绿体基因组结构代表了 Balsaminaceae 科的祖先状态。
我们的研究提供了核苷酸多样性热点和重复类型的详细信息,可用于开发适用于 Balsaminaceae 物种的分子标记。我们还根据完整的叶绿体基因组重建和分析了一些 Impatiens 物种的关系,并根据这些关系评估了它们的分类地位。总之,本研究的结果可能为 Balsaminaceae 物种的系统进化提供有价值的基因组资源。