Mylarshchikov Dmitry E, Nikolskaya Arina I, Bogomaz Olesja D, Zharikova Anastasia A, Mironov Andrey A
Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia.
Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia.
NAR Genom Bioinform. 2024 May 20;6(2):lqae054. doi: 10.1093/nargab/lqae054. eCollection 2024 Jun.
Chromatin-associated non-coding RNAs play important roles in various cellular processes by targeting genomic loci. Two types of genome-wide NGS experiments exist to detect such targets: 'one-to-al', which focuses on targets of a single RNA, and 'all-to-al', which captures targets of all RNAs in a sample. As with many NGS experiments, they are prone to biases and noise, so it becomes essential to detect 'peaks'-specific interactions of an RNA with genomic targets. Here, we present BaRDIC-Binomial RNA-DNA Interaction Caller-a tailored method to detect peaks in both types of RNA-DNA interaction data. BaRDIC is the first tool to simultaneously take into account the two most prominent biases in the data: chromatin heterogeneity and distance-dependent decay of interaction frequency. Since RNAs differ in their interaction preferences, BaRDIC adapts peak sizes according to the abundances and contact patterns of individual RNAs. These features enable BaRDIC to make more robust predictions than currently applied peak-calling algorithms and better handle the characteristic sparsity of all-to-all data. The BaRDIC package is freely available at https://github.com/dmitrymyl/BaRDIC.
与染色质相关的非编码RNA通过靶向基因组位点在各种细胞过程中发挥重要作用。存在两种全基因组NGS实验来检测此类靶点:“一对一”实验,专注于单个RNA的靶点;以及“全对一”实验,捕获样本中所有RNA的靶点。与许多NGS实验一样,它们容易出现偏差和噪声,因此检测RNA与基因组靶点的“峰”——特异性相互作用变得至关重要。在这里,我们介绍了BaRDIC——二项式RNA-DNA相互作用调用器——一种用于检测两种类型RNA-DNA相互作用数据中峰的定制方法。BaRDIC是第一个同时考虑数据中两个最突出偏差的工具:染色质异质性和相互作用频率的距离依赖性衰减。由于RNA的相互作用偏好不同,BaRDIC根据单个RNA的丰度和接触模式调整峰大小。这些特性使BaRDIC能够做出比当前应用的峰调用算法更稳健的预测,并更好地处理全对全数据的特征稀疏性。BaRDIC软件包可在https://github.com/dmitrymyl/BaRDIC上免费获取。