• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用广义质心估计器改进RNA结构保守性的测量。

Improved measurements of RNA structure conservation with generalized centroid estimators.

作者信息

Okada Yohei, Saito Yutaka, Sato Kengo, Sakakibara Yasubumi

机构信息

Department of Biosciences and Informatics, Keio University Yokohama, Japan.

出版信息

Front Genet. 2011 Aug 31;2:54. doi: 10.3389/fgene.2011.00054. eCollection 2011.

DOI:10.3389/fgene.2011.00054
PMID:22303350
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3268607/
Abstract

Identification of non-protein-coding RNAs (ncRNAs) in genomes is a crucial task for not only molecular cell biology but also bioinformatics. Secondary structures of ncRNAs are employed as a key feature of ncRNA analysis since biological functions of ncRNAs are deeply related to their secondary structures. Although the minimum free energy (MFE) structure of an RNA sequence is regarded as the most stable structure, MFE alone could not be an appropriate measure for identifying ncRNAs since the free energy is heavily biased by the nucleotide composition. Therefore, instead of MFE itself, several alternative measures for identifying ncRNAs have been proposed such as the structure conservation index (SCI) and the base pair distance (BPD), both of which employ MFE structures. However, these measurements are unfortunately not suitable for identifying ncRNAs in some cases including the genome-wide search and incur high false discovery rate. In this study, we propose improved measurements based on SCI and BPD, applying generalized centroid estimators to incorporate the robustness against low quality multiple alignments. Our experiments show that our proposed methods achieve higher accuracy than the original SCI and BPD for not only human-curated structural alignments but also low quality alignments produced by CLUSTAL W. Furthermore, the centroid-based SCI on CLUSTAL W alignments is more accurate than or comparable with that of the original SCI on structural alignments generated with RAF, a high quality structural aligner, for which twofold expensive computational time is required on average. We conclude that our methods are more suitable for genome-wide alignments which are of low quality from the point of view on secondary structures than the original SCI and BPD.

摘要

在基因组中识别非蛋白质编码RNA(ncRNA)不仅是分子细胞生物学的一项关键任务,也是生物信息学的关键任务。ncRNA的二级结构被用作ncRNA分析的一个关键特征,因为ncRNA的生物学功能与其二级结构密切相关。尽管RNA序列的最小自由能(MFE)结构被认为是最稳定的结构,但仅MFE本身并不能作为识别ncRNA的合适指标,因为自由能受核苷酸组成的影响很大。因此,除了MFE本身,还提出了几种识别ncRNA的替代指标,如结构保守指数(SCI)和碱基对距离(BPD),这两种指标都采用MFE结构。然而,不幸的是,这些测量方法在某些情况下(包括全基因组搜索)并不适合识别ncRNA,并且会导致较高的错误发现率。在本研究中,我们提出了基于SCI和BPD的改进测量方法,应用广义质心估计器来提高对低质量多重比对的鲁棒性。我们的实验表明,我们提出的方法不仅在人工编辑的结构比对上,而且在CLUSTAL W生成的低质量比对上,都比原始的SCI和BPD具有更高的准确性。此外,基于质心的SCI在CLUSTAL W比对上比在由高质量结构比对工具RAF生成的结构比对上的原始SCI更准确或相当,而RAF平均需要两倍的计算时间。我们得出结论,从二级结构的角度来看,我们的方法比原始的SCI和BPD更适合低质量的全基因组比对。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/976ef17859ce/fgene-02-00054-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/30ab03c00acc/fgene-02-00054-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/9d8932843d39/fgene-02-00054-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/9a8b58a1d429/fgene-02-00054-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/976ef17859ce/fgene-02-00054-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/30ab03c00acc/fgene-02-00054-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/9d8932843d39/fgene-02-00054-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/9a8b58a1d429/fgene-02-00054-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c3b0/3268607/976ef17859ce/fgene-02-00054-g004.jpg

相似文献

1
Improved measurements of RNA structure conservation with generalized centroid estimators.使用广义质心估计器改进RNA结构保守性的测量。
Front Genet. 2011 Aug 31;2:54. doi: 10.3389/fgene.2011.00054. eCollection 2011.
2
Improvement of structure conservation index with centroid estimators.使用质心估计器改进结构保守指数。
Pac Symp Biocomput. 2010:88-97. doi: 10.1142/9789814295291_0011.
3
Prediction of RNA secondary structure using generalized centroid estimators.使用广义质心估计器预测RNA二级结构。
Bioinformatics. 2009 Feb 15;25(4):465-73. doi: 10.1093/bioinformatics/btn601. Epub 2008 Dec 18.
4
Considerations in the identification of functional RNA structural elements in genomic alignments.基因组比对中功能性RNA结构元件识别的考量因素。
BMC Bioinformatics. 2007 Jan 30;8:33. doi: 10.1186/1471-2105-8-33.
5
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
6
Discovery of Novel ncRNA Sequences in Multiple Genome Alignments on the Basis of Conserved and Stable Secondary Structures.基于保守且稳定的二级结构在多个基因组比对中发现新型非编码RNA序列
PLoS One. 2015 Jun 15;10(6):e0130200. doi: 10.1371/journal.pone.0130200. eCollection 2015.
7
Detection of RNA structures in porcine EST data and related mammals.在猪的EST数据及相关哺乳动物中检测RNA结构
BMC Genomics. 2007 Sep 10;8:316. doi: 10.1186/1471-2164-8-316.
8
Analysis of energy-based algorithms for RNA secondary structure prediction.基于能量的 RNA 二级结构预测算法分析。
BMC Bioinformatics. 2012 Feb 1;13:22. doi: 10.1186/1471-2105-13-22.
9
Sampled ensemble neutrality as a feature to classify potential structured RNAs.将采样总体中性作为一种特征来对潜在的结构化RNA进行分类。
BMC Genomics. 2015 Feb 5;16(1):35. doi: 10.1186/s12864-014-1203-8.
10
RNAdetect: efficient computational detection of novel non-coding RNAs.RNAdetect:高效计算新型非编码 RNA 的检测。
Bioinformatics. 2019 Apr 1;35(7):1133-1141. doi: 10.1093/bioinformatics/bty765.

本文引用的文献

1
Computational approaches for RNA energy parameter estimation.计算方法在 RNA 能量参数估计中的应用。
RNA. 2010 Dec;16(12):2304-18. doi: 10.1261/rna.1950510. Epub 2010 Oct 12.
2
Improving the accuracy of predicting secondary structure for aligned RNA sequences.提高 RNA 序列比对二级结构预测的准确性。
Nucleic Acids Res. 2011 Jan;39(2):393-402. doi: 10.1093/nar/gkq792. Epub 2010 Sep 15.
3
Computational prediction of novel non-coding RNAs in Arabidopsis thaliana.拟南芥中新型非编码RNA的计算预测
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S36. doi: 10.1186/1471-2105-10-S1-S36.
4
Prediction of RNA secondary structure using generalized centroid estimators.使用广义质心估计器预测RNA二级结构。
Bioinformatics. 2009 Feb 15;25(4):465-73. doi: 10.1093/bioinformatics/btn601. Epub 2008 Dec 18.
5
RNAalifold: improved consensus structure prediction for RNA alignments.RNAalifold:改进的RNA比对共有结构预测
BMC Bioinformatics. 2008 Nov 11;9:474. doi: 10.1186/1471-2105-9-474.
6
A max-margin model for efficient simultaneous alignment and folding of RNA sequences.一种用于RNA序列高效同时比对和折叠的最大间隔模型。
Bioinformatics. 2008 Jul 1;24(13):i68-76. doi: 10.1093/bioinformatics/btn177.
7
Dinucleotide controlled null models for comparative RNA gene prediction.用于比较RNA基因预测的二核苷酸控制空模型
BMC Bioinformatics. 2008 May 27;9:248. doi: 10.1186/1471-2105-9-248.
8
Centroid estimation in discrete high-dimensional spaces with applications in biology.离散高维空间中的质心估计及其在生物学中的应用
Proc Natl Acad Sci U S A. 2008 Mar 4;105(9):3209-14. doi: 10.1073/pnas.0712329105. Epub 2008 Feb 27.
9
Strategies for measuring evolutionary conservation of RNA secondary structures.测量RNA二级结构进化保守性的策略。
BMC Bioinformatics. 2008 Feb 26;9:122. doi: 10.1186/1471-2105-9-122.
10
Genome-wide discovery and verification of novel structured RNAs in Plasmodium falciparum.恶性疟原虫新型结构化RNA的全基因组发现与验证
Genome Res. 2008 Feb;18(2):281-92. doi: 10.1101/gr.6836108. Epub 2007 Dec 20.