Miller H E, Montemayor D, Levy S, Sharma K, Frost B, Bishop A J R
Department of Cell Systems and Anatomy, UT Health San Antonio, San Antonio, TX, USA.
Greehey Children's Cancer Research Institute, UT Health San Antonio, San Antonio, TX, USA.
J Bioinform Syst Biol. 2023;6(4):364-378. doi: 10.26502/jbsb.5107071. Epub 2023 Dec 21.
We recently described the development of a database of 810 R-loop mapping datasets and used this data to conduct a meta-analysis of R-loops. R-loops are three-stranded nucleic acid structures containing RNA:DNA hybrids and we were able to verify that 30% of expressed genes have an associated R-loop in a location conserved manner.. Moreover, intergenic R-loops map to enhancers, super enhancers and with TAD domain boundaries. This work demonstrated that R-loop mapping via high-throughput sequencing can reveal novel insight into R-loop biology, however the analysis and quality control of these data is a non-trivial task for which few bioinformatic tools exist. Herein we describe RLSuite, an integrative R-loop bioinformatics framework for pre-processing, quality control, and downstream analysis of R-loop mapping data. RLSuite enables users to compare their data to hundreds of public datasets and generate a user-friendly analysis report for sharing with non-bioinformatician colleagues. Taken together, RLSuite is a novel analysis framework that should greatly benefit the emerging R-loop bioinformatics community in a rapidly expanding aspect of epigenetic control that is still poorly understood.
我们最近描述了一个包含810个R环映射数据集的数据库的开发,并使用这些数据对R环进行了荟萃分析。R环是包含RNA:DNA杂交体的三链核酸结构,我们能够证实30%的表达基因在一个保守的位置有一个相关的R环。此外,基因间R环映射到增强子、超级增强子以及拓扑相关结构域(TAD)边界。这项工作表明,通过高通量测序进行R环映射可以揭示R环生物学的新见解,然而,对这些数据的分析和质量控制是一项重要任务,目前几乎没有生物信息学工具可以完成。在此,我们描述了RLSuite,这是一个用于R环映射数据的预处理、质量控制和下游分析的综合R环生物信息学框架。RLSuite使用户能够将他们的数据与数百个公共数据集进行比较,并生成一份用户友好的分析报告,以便与非生物信息学领域的同事分享。综上所述,RLSuite是一个新颖的分析框架,在表观遗传控制这一仍未被充分理解且迅速扩展的领域,它将极大地造福新兴的R环生物信息学群体。