• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用质心估计器改进结构保守指数。

Improvement of structure conservation index with centroid estimators.

作者信息

Okada Yohei, Sato Kengo, Sakakibara Yasubumi

机构信息

Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa 223-8522, Japan.

出版信息

Pac Symp Biocomput. 2010:88-97. doi: 10.1142/9789814295291_0011.

DOI:10.1142/9789814295291_0011
PMID:19908361
Abstract

RNAz, a support vector machine (SVM) approach for identifying functional non-coding RNAs (ncRNAs), has been proven to be one of the most accurate tools for this goal. Among the measurements used in RNAz, the Structure Conservation Index (SCI) which evaluates the evolutionary conservation of RNA secondary structures in terms of folding energies, has been reported to have an extremely high discrimination capability. However, for practical use of RNAz on the genome-wide search, a relatively high false discovery rate has unfortunately been estimated. It is conceivable that multiple alignments produced by a standard aligner that does not consider any secondary structures are not suitable for identifying ncRNAs in some cases and incur high false discovery rate. In this study, we propose C-SCI, an improved measurement based on the SCI applying gamma-centroid estimators to incorporate the robustness against low quality multiple alignments. Our experiments show that the C-SCI achieves higher accuracy than the original SCI for not only human-curated structural alignments but also low quality alignments produced by CLUSTAL W. Furthermore, the accuracy of the C-SCI on CLUSTAL W alignments is comparable with that of the original SCI on structural alignments generated with RAF for which 4.7-fold expensive computational time is required on average.

摘要

RNAz是一种用于识别功能性非编码RNA(ncRNA)的支持向量机(SVM)方法,已被证明是实现这一目标最准确的工具之一。在RNAz使用的度量中,结构保守指数(SCI)根据折叠能量评估RNA二级结构的进化保守性,据报道具有极高的区分能力。然而,遗憾的是,在全基因组搜索中实际使用RNAz时,估计有相对较高的错误发现率。可以想象,由不考虑任何二级结构的标准比对工具产生的多序列比对在某些情况下不适用于识别ncRNA,并导致高错误发现率。在本研究中,我们提出了C-SCI,这是一种基于SCI的改进度量,应用伽马质心估计器以纳入针对低质量多序列比对的稳健性。我们的实验表明,C-SCI不仅对于人工整理的结构比对,而且对于CLUSTAL W产生的低质量比对,都比原始SCI具有更高的准确性。此外,C-SCI在CLUSTAL W比对上的准确性与原始SCI在使用RAF生成的结构比对上的准确性相当,而使用RAF平均需要4.7倍的计算时间。

相似文献

1
Improvement of structure conservation index with centroid estimators.使用质心估计器改进结构保守指数。
Pac Symp Biocomput. 2010:88-97. doi: 10.1142/9789814295291_0011.
2
Improved measurements of RNA structure conservation with generalized centroid estimators.使用广义质心估计器改进RNA结构保守性的测量。
Front Genet. 2011 Aug 31;2:54. doi: 10.3389/fgene.2011.00054. eCollection 2011.
3
RNAz 2.0: improved noncoding RNA detection.RNAz 2.0:改进的非编码RNA检测
Pac Symp Biocomput. 2010:69-79.
4
Discovery of Novel ncRNA Sequences in Multiple Genome Alignments on the Basis of Conserved and Stable Secondary Structures.基于保守且稳定的二级结构在多个基因组比对中发现新型非编码RNA序列
PLoS One. 2015 Jun 15;10(6):e0130200. doi: 10.1371/journal.pone.0130200. eCollection 2015.
5
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
6
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
7
Identifying structural noncoding RNAs using RNAz.使用RNAz鉴定结构非编码RNA。
Curr Protoc Bioinformatics. 2007 Sep;Chapter 12:Unit 12.7. doi: 10.1002/0471250953.bi1207s19.
8
Structure-based whole-genome realignment reveals many novel noncoding RNAs.基于结构的全基因组重排揭示了许多新的非编码 RNA。
Genome Res. 2013 Jun;23(6):1018-27. doi: 10.1101/gr.137091.111. Epub 2013 Jan 7.
9
Robust and accurate prediction of noncoding RNAs from aligned sequences.从比对序列中准确预测非编码 RNA。
BMC Bioinformatics. 2010 Oct 15;11 Suppl 7(Suppl 7):S3. doi: 10.1186/1471-2105-11-S7-S3.
10
LocARNA-P: accurate boundary prediction and improved detection of structural RNAs.LocARNA-P:准确的边界预测和结构 RNA 的改进检测。
RNA. 2012 May;18(5):900-14. doi: 10.1261/rna.029041.111. Epub 2012 Mar 26.

引用本文的文献

1
Generalized centroid estimators in bioinformatics.生物信息学中的广义质心估计。
PLoS One. 2011 Feb 18;6(2):e16450. doi: 10.1371/journal.pone.0016450.
2
Improving the accuracy of predicting secondary structure for aligned RNA sequences.提高 RNA 序列比对二级结构预测的准确性。
Nucleic Acids Res. 2011 Jan;39(2):393-402. doi: 10.1093/nar/gkq792. Epub 2010 Sep 15.