• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ProbeAlign:将高通量测序结构探测信息纳入 ncRNA 同源搜索。

ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search.

出版信息

BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S15. doi: 10.1186/1471-2105-15-S9-S15. Epub 2014 Sep 10.

DOI:10.1186/1471-2105-15-S9-S15
PMID:25253206
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4168714/
Abstract

BACKGROUND

Recent advances in RNA structure probing technologies, including the ones based on high-throughput sequencing, have improved the accuracy of thermodynamic folding with quantitative nucleotide-resolution structural information.

RESULTS

In this paper, we present a novel approach, ProbeAlign, to incorporate the reactivities from high-throughput RNA structure probing into ncRNA homology search for functional annotation. To reduce the overhead of structure alignment on large-scale data, the specific pairing patterns in the query sequences are ignored. On the other hand, the partial structural information of the target sequences embedded in probing data is retrieved to guide the alignment. Thus the structure alignment problem is transformed into a sequence alignment problem with additional reactivity information. The benchmark results show that the prediction accuracy of ProbeAlign outperforms filter-based CMsearch with high computational efficiency. The application of ProbeAlign to the FragSeq data, which is based on genome-wide structure probing, has demonstrated its capability to search ncRNAs in a large-scale dataset from high-throughput sequencing.

CONCLUSIONS

By incorporating high-throughput sequencing-based structure probing information, ProbeAlign can improve the accuracy and efficiency of ncRNA homology search. It is a promising tool for ncRNA functional annotation on genome-wide datasets.

AVAILABILITY

The source code of ProbeAlign is available at http://genome.ucf.edu/ProbeAlign.

摘要

背景

基于高通量测序的 RNA 结构探测技术的最新进展提高了热力学折叠的准确性,提供了定量核苷酸分辨率的结构信息。

结果

本文提出了一种新方法 ProbeAlign,将高通量 RNA 结构探测的反应性纳入 ncRNA 同源搜索中,以进行功能注释。为了降低大规模数据结构比对的开销,忽略了查询序列中的特定配对模式。另一方面,从探测数据中检索目标序列中嵌入的部分结构信息来指导比对。因此,结构比对问题转化为具有附加反应性信息的序列比对问题。基准测试结果表明,ProbeAlign 的预测准确性优于具有高效计算能力的基于过滤的 CMsearch。ProbeAlign 应用于 FragSeq 数据,该数据基于全基因组结构探测,证明了它在高通量测序的大规模数据集搜索 ncRNA 的能力。

结论

通过整合基于高通量测序的结构探测信息,ProbeAlign 可以提高 ncRNA 同源搜索的准确性和效率。它是一种用于全基因组数据集 ncRNA 功能注释的有前途的工具。

可用性

ProbeAlign 的源代码可在 http://genome.ucf.edu/ProbeAlign 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/bd48f28dc5ef/1471-2105-15-S9-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/1b8dd7e21e69/1471-2105-15-S9-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/6fcb8c64b65b/1471-2105-15-S9-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/5a40f56710c6/1471-2105-15-S9-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/605fa2e38d50/1471-2105-15-S9-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/bd48f28dc5ef/1471-2105-15-S9-S15-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/1b8dd7e21e69/1471-2105-15-S9-S15-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/6fcb8c64b65b/1471-2105-15-S9-S15-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/5a40f56710c6/1471-2105-15-S9-S15-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/605fa2e38d50/1471-2105-15-S9-S15-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7bf4/4168714/bd48f28dc5ef/1471-2105-15-S9-S15-5.jpg

相似文献

1
ProbeAlign: incorporating high-throughput sequencing-based structure probing information into ncRNA homology search.ProbeAlign:将高通量测序结构探测信息纳入 ncRNA 同源搜索。
BMC Bioinformatics. 2014;15 Suppl 9(Suppl 9):S15. doi: 10.1186/1471-2105-15-S9-S15. Epub 2014 Sep 10.
2
Structure-based whole-genome realignment reveals many novel noncoding RNAs.基于结构的全基因组重排揭示了许多新的非编码 RNA。
Genome Res. 2013 Jun;23(6):1018-27. doi: 10.1101/gr.137091.111. Epub 2013 Jan 7.
3
Chain-RNA: a comparative ncRNA search tool based on the two-dimensional chain algorithm.链 RNA:一种基于二维链算法的比较 ncRNA 搜索工具。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):274-85. doi: 10.1109/TCBB.2012.137.
4
Genome-scale NCRNA homology search using a Hamming distance-based filtration strategy.基于汉明距离过滤策略的全基因组非编码 RNA 同源搜索。
BMC Bioinformatics. 2012 Mar 21;13 Suppl 3(Suppl 3):S12. doi: 10.1186/1471-2105-13-S3-S12.
5
Designing secondary structure profiles for fast ncRNA identification.设计用于快速鉴定非编码RNA的二级结构图谱。
Comput Syst Bioinformatics Conf. 2008;7:145-56.
6
Efficient known ncRNA search including pseudoknots.高效已知 ncRNA 搜索包括假结。
BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S25. doi: 10.1186/1471-2105-14-S2-S25. Epub 2013 Jan 21.
7
Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based framework.通过将结构信息纳入基于MAFFT的框架提高多种非编码RNA比对的准确性。
BMC Bioinformatics. 2008 Apr 25;9:212. doi: 10.1186/1471-2105-9-212.
8
TurboFold: iterative probabilistic estimation of secondary structures for multiple RNA sequences.TurboFold:用于多个 RNA 序列的二级结构的迭代概率估计。
BMC Bioinformatics. 2011 Apr 20;12:108. doi: 10.1186/1471-2105-12-108.
9
ncRNA consensus secondary structure derivation using grammar strings.使用语法字符串推导ncRNA共有二级结构
J Bioinform Comput Biol. 2011 Apr;9(2):317-37. doi: 10.1142/s0219720011005501.
10
Structural alignment of RNA with triple helix structure.具有三螺旋结构的RNA的结构比对。
J Comput Biol. 2012 Apr;19(4):365-78. doi: 10.1089/cmb.2010.0052.

引用本文的文献

1
GraphClust2: Annotation and discovery of structured RNAs with scalable and accessible integrative clustering.GraphClust2:具有可扩展和可访问的集成聚类功能的结构化 RNA 的注释和发现。
Gigascience. 2019 Dec 1;8(12). doi: 10.1093/gigascience/giz150.
2
Probing-directed identification of novel structured RNAs.探针导向的新型结构化RNA鉴定
RNA Biol. 2016;13(2):232-42. doi: 10.1080/15476286.2015.1132140.
3
Computational analysis of RNA structures with chemical probing data.利用化学探针数据对RNA结构进行计算分析。

本文引用的文献

1
In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.在体全基因组 RNA 二级结构谱分析揭示新的调控特征。
Nature. 2014 Jan 30;505(7485):696-700. doi: 10.1038/nature12756. Epub 2013 Nov 24.
2
Infernal 1.1: 100-fold faster RNA homology searches. Infernal 1.1:100 倍更快的 RNA 同源性搜索。
Bioinformatics. 2013 Nov 15;29(22):2933-5. doi: 10.1093/bioinformatics/btt509. Epub 2013 Sep 4.
3
Non-polyadenylated transcription in embryonic stem cells reveals novel non-coding RNA related to pluripotency and differentiation.
Methods. 2015 Jun;79-80:60-6. doi: 10.1016/j.ymeth.2015.02.003. Epub 2015 Feb 14.
胚胎干细胞中的非多聚腺苷酸化转录揭示了与多能性和分化相关的新型非编码 RNA。
Nucleic Acids Res. 2013 Jul;41(12):6300-15. doi: 10.1093/nar/gkt316. Epub 2013 Apr 29.
4
Incorporating phylogenetic-based covarying mutations into RNAalifold for RNA consensus structure prediction.将基于系统发育的共变异突变纳入 RNAalifold 以预测 RNA 共识结构。
BMC Bioinformatics. 2013 Apr 27;14:142. doi: 10.1186/1471-2105-14-142.
5
Evaluating the accuracy of SHAPE-directed RNA secondary structure predictions.评估 SHAPE 指导的 RNA 二级结构预测的准确性。
Nucleic Acids Res. 2013 Mar 1;41(5):2807-16. doi: 10.1093/nar/gks1283. Epub 2013 Jan 15.
6
Rfam 11.0: 10 years of RNA families.RFAM 11.0:10 年的 RNA 家族。
Nucleic Acids Res. 2013 Jan;41(Database issue):D226-32. doi: 10.1093/nar/gks1005. Epub 2012 Nov 3.
7
Integrating chemical footprinting data into RNA secondary structure prediction.将化学足迹数据整合到 RNA 二级结构预测中。
PLoS One. 2012;7(10):e45160. doi: 10.1371/journal.pone.0045160. Epub 2012 Oct 16.
8
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
9
Global analysis of RNA secondary structure in two metazoans.两种后生动物的 RNA 二级结构的全局分析。
Cell Rep. 2012 Jan 26;1(1):69-82. doi: 10.1016/j.celrep.2011.10.002.
10
Analysis of energy-based algorithms for RNA secondary structure prediction.基于能量的 RNA 二级结构预测算法分析。
BMC Bioinformatics. 2012 Feb 1;13:22. doi: 10.1186/1471-2105-13-22.