Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru, 560012, Karnataka, India.
Department of Computational and Data Sciences, Indian Institute of Science, Bengaluru, 560012, Karnataka, India.
Biochimie. 2023 Nov;214(Pt B):228-236. doi: 10.1016/j.biochi.2023.07.018. Epub 2023 Jul 26.
The large-scale detection of putative intrinsic transcription terminators is limited to only a few bacteria currently. We discovered a group of hairpins, called cluster hairpins, present within 15 nucleotides from each other. These are expected to work in tandem to cause intrinsic transcription termination (ITT), while the single hairpin can do the same alone. Therefore, exploring these ITT sites and the hairpins across bacterial genomes becomes highly desirable. INTERPIN is the largest archived collection of in silico inferred ITT hairpins in bacteria, covering 12745 bacterial genomes and encompassing ten bacterial phyla for ∼25 million hairpins. Users can obtain details on operons, individual cluster, and single ITT hairpins that were screened therein. Integrated Genome Viewer (IGV) software interactively visualizes hairpin secondary and tertiary structures in the genomic context. We also discuss statistics for the occurrence of cluster or single hairpins and other termination alternatives while showing the validation of predicted hairpins against in vivo detected hairpins. The database is freely available at http://pallab.cds.iisc.ac.in/INTERPIN/. INTERPIN (database and software) can make predictions for both AT and GC-rich genomes, which has not been achieved by any other program so far. It can also be used to improve genome annotation as well as to get predictions to improve the understanding of the ITT pathway by further analysis.
目前,大规模检测推定的内在转录终止子的方法仅适用于少数几种细菌。我们发现了一组发夹,称为簇发夹,它们彼此之间的距离为 15 个核苷酸。这些发夹预计会协同作用导致内在转录终止(ITT),而单个发夹也可以单独完成相同的功能。因此,探索这些 ITT 位点和细菌基因组中的发夹变得非常有必要。INTERPIN 是细菌中最大的内在转录终止子发夹的存档集合,涵盖了 12745 个细菌基因组,包含十个细菌门,约有 2500 万个发夹。用户可以获得在那里筛选的操纵子、单个簇和单个 ITT 发夹的详细信息。集成基因组浏览器(IGV)软件可以在基因组背景下交互式地可视化发夹的二级和三级结构。我们还讨论了在展示预测发夹与体内检测到的发夹的验证结果的同时,簇发夹或单发夹以及其他终止替代物出现的统计信息。该数据库可在 http://pallab.cds.iisc.ac.in/INTERPIN/ 免费获得。INTERPIN(数据库和软件)可以对富含 AT 和 GC 的基因组进行预测,这是迄今为止任何其他程序都无法实现的。它还可以用于改进基因组注释,并通过进一步分析来获得预测,以加深对 ITT 途径的理解。