Chew Ivy Yee Yen, Chung Hung Hui, Lim Leonard Whye Kit, Lau Melinda Mei Lin, Gan Han Ming, Wee Boon Siong, Sim Siong Fong
Faculty of Resource Science and Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Sarawak, Malaysia.
Patriot Biotech Sdn Bhd, 47500 Subang Jaya, Selangor, Malaysia.
Data Brief. 2023 Mar 4;47:109029. doi: 10.1016/j.dib.2023.109029. eCollection 2023 Apr.
belongs to the genus under the Dipterocarpaceae family. It is a woody tree that grows in the rainforest in Southeast Asia. The complete chloroplast (cp) genome sequence of is reported here. The genomic size of is 150,778 bp and it possesses a circular structure with conserved constitute regions of large single copy (LSC, 83,681 bp) and small single copy (SSC, 19,813 bp) regions, as well as a pair of inverted repeats with a length of 23,642 bp. It has 112 unique genes, including 78 protein-coding genes, 30 tRNA genes, and four rRNA genes. The genome exhibits a similar GC content, gene order, structure, and codon usage when compared to previously reported chloroplast genomes from other plant species. The chloroplast genome of contained 262 SSRs, the most prevalent of which was A/T, followed by AAT/ATT. Furthermore, the sequences contain 43 long repeat sequences, practically most of them are forward or palindrome type long repeats. The genome structure of was compared to the genomic structures of closely related species from the same family, and eight mutational hotspots were discovered. The phylogenetic analysis demonstrated a close relationship between and species, indicating that is not monophyletic. The complete chloroplast genome sequence analysis of reported in this paper will contribute to further studies in molecular identification, genetic diversity, and phylogenetic research.
属于龙脑香科下的属。它是一种生长在东南亚雨林中的木本树。本文报道了其完整的叶绿体(cp)基因组序列。其基因组大小为150,778 bp,具有环状结构,包含保守的大单拷贝(LSC,83,681 bp)和小单拷贝(SSC,19,813 bp)区域,以及一对长度为23,642 bp的反向重复序列。它有112个独特基因,包括78个蛋白质编码基因、30个tRNA基因和4个rRNA基因。与先前报道的其他植物物种的叶绿体基因组相比,该基因组在GC含量、基因顺序、结构和密码子使用方面表现出相似性。该植物的叶绿体基因组包含262个简单序列重复(SSR),其中最常见的是A/T,其次是AAT/ATT。此外,这些序列包含43个长重复序列,实际上大多数是正向或回文型长重复序列。将该植物的基因组结构与同科近缘物种的基因组结构进行比较,发现了8个突变热点。系统发育分析表明该植物与其他物种之间关系密切,表明该植物不是单系的。本文报道的该植物完整叶绿体基因组序列分析将有助于分子鉴定、遗传多样性和系统发育研究的进一步开展。