Suppr超能文献

PATO:lncRNA-DNA 三螺旋的全基因组预测。

PATO: genome-wide prediction of lncRNA-DNA triple helices.

机构信息

Computer Architecture Group, Department of Computer Engineering, CITIC, Universidade da Coruña, Campus de Elviña, A Coruña 15071, Spain.

出版信息

Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad134.

Abstract

MOTIVATION

Long non-coding RNA (lncRNA) plays a key role in many biological processes. For instance, lncRNA regulates chromatin using different molecular mechanisms, including direct RNA-DNA hybridization via triplexes, cotranscriptional RNA-RNA interactions, and RNA-DNA binding mediated by protein complexes. While the functional annotation of lncRNA transcripts has been widely studied over the last 20 years, barely a handful of tools have been developed with the specific purpose of detecting and evaluating lncRNA-DNA triple helices. What is worse, some of these tools have nearly grown a decade old, making new triplex-centric pipelines depend on legacy software that cannot thoroughly process all the data made available by next-generation sequencing (NGS) technologies.

RESULTS

We present PATO, a modern, fast, and efficient tool for the detection of lncRNA-DNA triplexes that matches NGS processing capabilities. PATO enables the prediction of triple helices at the genome scale and can process in as little as 1 h more than 60 GB of sequence data using a two-socket server. Moreover, PATO's efficiency allows a more exhaustive search of the triplex-forming solution space, and so PATO achieves higher levels of prediction accuracy in far less time than other tools in the state of the art.

AVAILABILITY AND IMPLEMENTATION

Source code, user manual, and tests are freely available to download under the MIT License at https://github.com/UDC-GAC/pato.

摘要

动机

长非编码 RNA(lncRNA)在许多生物过程中发挥着关键作用。例如,lncRNA 通过三链体、共转录 RNA-RNA 相互作用和 RNA-DNA 结合蛋白复合物等不同的分子机制来调节染色质。虽然 lncRNA 转录本的功能注释在过去 20 年中得到了广泛的研究,但仅有少数工具是专门为检测和评估 lncRNA-DNA 三螺旋而开发的。更糟糕的是,其中一些工具已经几乎有十年的历史了,使得新的三螺旋中心管道依赖于遗留软件,这些软件无法彻底处理下一代测序 (NGS) 技术提供的所有数据。

结果

我们提出了 PATO,这是一种用于检测 lncRNA-DNA 三螺旋的现代、快速和高效的工具,它与 NGS 处理能力相匹配。PATO 能够在基因组范围内预测三螺旋,并使用双插槽服务器在短短 1 小时内处理超过 60GB 的序列数据。此外,PATO 的效率允许对三螺旋形成解决方案空间进行更详尽的搜索,因此 PATO 可以在比其他现有工具短得多的时间内达到更高的预测准确性水平。

可用性和实现

源代码、用户手册和测试可在 MIT 许可证下免费下载,网址为 https://github.com/UDC-GAC/pato。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/23dd/10049783/f26e3bbdd9bf/btad134f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验