Colombini Lorenzo, Santoro Francesco, Tirziu Mariana, Cuppone Anna Maria, Pozzi Gianni, Iannelli Francesco
Laboratory of Molecular Microbiology and Biotechnology, Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy.
NAR Genom Bioinform. 2025 Jun 26;7(2):lqaf085. doi: 10.1093/nargab/lqaf085. eCollection 2025 Jun.
Long inverted repeats (LIRs) of DNA sequences longer than 30 kb are rare in prokaryotes. Here, we identified two 69.9-kb LIRs in the genome of M247_Siena, a derivative of strain M247. Complete genome sequence of M247_Siena was determined using Nanopore and Illumina technologies, while genome structure was analyzed using ultra-long Nanopore read mapping and polymerase chain reaction (PCR). In the parental M247 genome, there was only one copy of the 69.9-kb segment, while a 15.4-kb DNA segment was present instead of the second 69.9-kb segment copy. Both segments were delimited by the same insertion sequences (IS and IS), and PCR analysis of the M247 population revealed low rates (∼1.28 per 10 chromosomes) of chromosomal rearrangements involving these regions. In contrast, the 69.9-kb LIRs in M247_Siena increased genomic instability, as evidenced by two alternative chromosomal structures detected at frequencies of 23.3% and 76.7% (∼1 out of 5 chromosomes). Comparative analysis of genomes revealed no LIRs similar to those of M247_Siena. However, long repeats of other DNA segments and chromosomal rearrangements, mostly associated with insertion sequences, were detected in 8 and 9 out of 25 genomes, respectively, highlighting genomic instability as a trait of the species.
长度超过30 kb的DNA序列的长反向重复序列(LIRs)在原核生物中很少见。在此,我们在M247菌株的衍生物M247_Siena的基因组中鉴定出两个69.9 kb的LIRs。使用纳米孔和Illumina技术确定了M247_Siena的完整基因组序列,同时使用超长纳米孔读数映射和聚合酶链反应(PCR)分析了基因组结构。在亲本M247基因组中,69.9 kb片段只有一个拷贝,而第二个69.9 kb片段拷贝的位置存在一个15.4 kb的DNA片段。两个片段都由相同的插入序列(IS和IS)界定,对M247群体的PCR分析显示,涉及这些区域的染色体重排发生率较低(每10条染色体约1.28次)。相比之下,M247_Siena中的69.9 kb LIRs增加了基因组不稳定性,这通过检测到的两种替代染色体结构得到证明,其频率分别为23.3%和76.7%(约每5条染色体中有1条)。基因组比较分析未发现与M247_Siena类似的LIRs。然而,在25个基因组中的8个和9个基因组中分别检测到其他DNA片段的长重复序列和染色体重排,其中大部分与插入序列相关,这突出了基因组不稳定性是该物种的一个特征。