Li Yongtan, Dong Yan, Liu Yichao, Yu Xiaoyue, Yang Minsheng, Huang Yinran
Forest Department, Forestry College, Hebei Agricultural University, Baoding, China.
Hebei Key Laboratory for Tree Genetic Resources and Forest Protection, Baoding, China.
Front Plant Sci. 2021 Feb 11;11:593984. doi: 10.3389/fpls.2020.593984. eCollection 2020.
In this study, we assembled and annotated the chloroplast (cp) genome of the species , , and , and performed a series of analyses to investigate gene structure, GC content, sequence alignment, and nucleic acid diversity, with the objectives of identifying positive selection genes and understanding evolutionary relationships. The results indicated that the cp genome was 156,860-157,611bp in length and exhibited a typical circular tetrad structure. Similar to the majority of angiosperm chloroplast genomes, the results yielded a large single-copy region (LSC) (85,826-86,299bp) and a small single-copy region (SSC) (18,319-18,536bp), separated by a pair of sequences (IRA and IRB; 26,341-26,700bp) with the same encoding but in opposite directions. The chloroplast genome was annotated to 130-131 genes, including 85-86 protein coding genes, 37 tRNA genes, and eight rRNA genes, with GC contents of 37.26-37.31%. The GC content was variable among regions and was highest in the inverted repeat (IR) region. The IR boundary of happened expanding resulting that the entered into IR region and doubled completely. Such fluctuations at the border positions might be helpful in determining evolutionary relationships among . The simple-sequence repeats (SSRs) of species were composed primarily of single nucleotides (A)n and (T)n, and were mostly 10-12bp in length, with an obvious A/T bias. We identified several loci with suitable polymorphism with the potential use as molecular markers for inferring the phylogeny within the genus . Signatures of positive selection were seen in protein encoding genes. Based on data from the whole chloroplast genome, common single copy genes, and the LSC, SSC, and IR regions, we constructed an evolutionary tree of and related species, the results of which were consistent with traditional taxonomic classifications. It showed that sister to the , whereby appeared as sister to . Our study provides important genetic information to support further investigations into the phylogenetic development and adaptive evolution of species.
在本研究中,我们组装并注释了[物种名称1]、[物种名称2]和[物种名称3]的叶绿体(cp)基因组,并进行了一系列分析以研究基因结构、GC含量、序列比对和核酸多样性,目的是鉴定正选择基因并了解进化关系。结果表明,[物种名称]的cp基因组长度为156,860 - 157,611bp,呈现典型的环状四分体结构。与大多数被子植物叶绿体基因组相似,结果产生了一个大单拷贝区域(LSC)(85,826 - 86,299bp)和一个小单拷贝区域(SSC)(18,319 - 18,536bp),由一对具有相同编码但方向相反的序列(IRA和IRB;26,341 - 26,700bp)隔开。叶绿体基因组被注释为130 - 131个基因,包括85 - 86个蛋白质编码基因、37个tRNA基因和8个rRNA基因,GC含量为37.26 - 37.31%。GC含量在各区域间存在差异,在反向重复(IR)区域最高。[物种名称]的IR边界发生扩张,导致[某基因或片段名称]进入IR区域并完全加倍。边界位置的这种波动可能有助于确定[物种名称]之间的进化关系。[物种名称]的简单序列重复(SSRs)主要由单核苷酸(A)n和(T)n组成,长度大多为10 - 12bp,具有明显的A/T偏向性。我们鉴定了几个具有合适多态性的位点,有可能用作推断该属内系统发育的分子标记。在[X]个蛋白质编码基因中发现了正选择信号。基于整个叶绿体基因组、常见单拷贝基因以及LSC、SSC和IR区域的数据,我们构建了[物种名称]及其相关物种的进化树,结果与传统分类学分类一致。结果显示,[物种名称1]与[物种名称2]为姐妹关系,由此[物种名称3]表现为与[另一物种名称]为姐妹关系。我们的研究提供了重要的遗传信息,以支持对[物种名称]物种的系统发育发展和适应性进化的进一步研究。