Panda Subhasmita, Swain Subrat Kumar, Sahu Basanta Pravas, Sarangi Rachita
Department of Pediatrics, IMS and SUM Hospital, Siksha 'O' Anusandhan (Deemed to be University), K8, Kalinga Nagar, Bhubaneswar, Odisha 751003 India.
Medical Research Laboratory, IMS and SUM Hospital, Siksha 'O' Anusandhan (Deemed to be University), K8, Kalinga Nagar, Bhubaneswar, Odisha 751003 India.
3 Biotech. 2023 Nov;13(11):366. doi: 10.1007/s13205-023-03795-6. Epub 2023 Oct 13.
Microsatellite markers are being used for molecular identification and characterization as well as estimation of evolution patterns due to their highly polymorphic nature. The repeats hold 40% of the entire genome of (OT), but not yet been characterized. Thus, we investigated the genome-wide presence of microsatellites within nine complete genomes of OT and analyzed their distribution pattern, composition, and complexity. The study revealed that the genome of OT enriched with microsatellites having a total of 126,187 SSRs and 10,374 cSSRs throughout the genome, of which 70% and 30% are represented within the coding and non-coding regions, respectively. The relative density (RD) and relative abundance (RA) of SSRs were 42-44.43/kb and 6.25-6.59/kb, while for cSSRs this value ranged from 7.06 to 8.1/kb and 0.50 to 0.55/kb, respectively. However, RA and RD were weakly correlated with genome size and incidence of microsatellites. The mononucleotide repeats (54.55%) were prevalent over di- (33.22%), tri- (11.88%), tetra- (0.27%), penta- (0.02%), hexanucleotide (0.04%) repeats, with poly (A/T) richness over poly (G/C). The motif composition of cSSRs revealed that maximum cSSRs were made up of two microsatellites having unique duplication patterns such as AT-x-AT and CG-x-CG. To our knowledge, this is the first study of microsatellites in the OT genome, where characterization of such variations in repeat sequences would be important in deciphering the origin, rate of mutation, and role of repeat sequences in the genome. More numbers of microsatellites represented within the coding region provide an insight into the genome plasticity that may interfere with gene regulation to mitigate host-pathogen interaction and evolution of the species.
微卫星标记因其高度多态性,正被用于分子鉴定、特征分析以及进化模式估计。这些重复序列占(OT)整个基因组的40%,但尚未得到表征。因此,我们研究了OT九个完整基因组中微卫星在全基因组范围内的存在情况,并分析了它们的分布模式、组成和复杂性。研究表明,OT基因组富含微卫星,全基因组共有126,187个简单序列重复(SSR)和10,374个复合简单序列重复(cSSR),其中分别有70%和30%存在于编码区和非编码区。SSR的相对密度(RD)和相对丰度(RA)分别为42 - 44.43/kb和6.25 - 6.59/kb,而cSSR的这一数值范围分别为7.06至8.1/kb和0.50至0.55/kb。然而,RA和RD与基因组大小及微卫星发生率的相关性较弱。单核苷酸重复(54.55%)比二核苷酸(33.22%)、三核苷酸(11.88%)、四核苷酸(0.27%)、五核苷酸(0.02%)、六核苷酸(0.04%)重复更为普遍,聚(A/T)比聚(G/C)更丰富。cSSR的基序组成表明,最大数量的cSSR由两个具有独特重复模式的微卫星组成,如AT - x - AT和CG - x - CG。据我们所知,这是首次对OT基因组中的微卫星进行研究,其中重复序列这种变异的表征对于解读基因组中重复序列的起源、突变率及作用具有重要意义。编码区内存在更多数量的微卫星,有助于深入了解可能干扰基因调控以减轻宿主 - 病原体相互作用及物种进化的基因组可塑性。