Suppr超能文献

ProbeAlign:将高通量测序结构探测信息纳入 ncRNA 同源搜索。

ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search.

出版信息

BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S15. doi: 10.1186/1471-2105-15-S9-S15. Epub 2014 Sep 10.

Abstract

BACKGROUND

Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information.

RESULTS

In this paper, we present a novel approach, ProbeAlign, to incorporate the reactivities from high-throughput RNA structure probing into ncRNA homology search for functional annotation. To reduce the overhead of structure alignment on large-scale data, the specific pairing patterns in the query sequences are ignored. On the other hand, the partial structural information of the target sequences embedded in probing data is retrieved to guide the alignment. Thus the structure alignment problem is transformed into a sequence alignment problem with additional reactivity information. The benchmark results show that the prediction accuracy of ProbeAlign outperforms filter-based CMsearch with high computational efficiency. The application of ProbeAlign to the FragSeq data, which is based on genome-wide structure probing, has demonstrated its capability to search ncRNAs in a large-scale dataset from high-throughput sequencing.

CONCLUSIONS

By incorporating high-throughput sequencing-based structure probing information, ProbeAlign can improve the accuracy and efficiency of ncRNA homology search. It is a promising tool for ncRNA functional annotation on genome-wide datasets.

AVAILABILITY

The source code of ProbeAlign is available at http://genome.ucf.edu/ProbeAlign.

摘要

背景

基于高通量测序的 RNA 结构探测技术的最新进展提高了热力学折叠的准确性,提供了定量核苷酸分辨率的结构信息。

结果

本文提出了一种新方法 ProbeAlign,将高通量 RNA 结构探测的反应性纳入 ncRNA 同源搜索中,以进行功能注释。为了降低大规模数据结构比对的开销,忽略了查询序列中的特定配对模式。另一方面,从探测数据中检索目标序列中嵌入的部分结构信息来指导比对。因此,结构比对问题转化为具有附加反应性信息的序列比对问题。基准测试结果表明,ProbeAlign 的预测准确性优于具有高效计算能力的基于过滤的 CMsearch。ProbeAlign 应用于 FragSeq 数据,该数据基于全基因组结构探测,证明了它在高通量测序的大规模数据集搜索 ncRNA 的能力。

结论

通过整合基于高通量测序的结构探测信息,ProbeAlign 可以提高 ncRNA 同源搜索的准确性和效率。它是一种用于全基因组数据集 ncRNA 功能注释的有前途的工具。

可用性

ProbeAlign 的源代码可在 http://genome.ucf.edu/ProbeAlign 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/1b8dd7e21e69/1471-2105-15-S9-S15-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验