• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

设计用于快速鉴定非编码RNA的二级结构图谱。

Designing secondary structure profiles for fast ncRNA identification.

作者信息

Sun Yanni, Buhler Jeremy

机构信息

Department of Computer Science and Engineering, Washington University, St. Louis, MO 63130, USA.

出版信息

Comput Syst Bioinformatics Conf. 2008;7:145-56.

PMID:19642276
Abstract

Detecting non-coding RNAs (ncRNAs) in genomic DNA is an important part of annotation. However, the most widely used tool for modeling ncRNA families, the covariance model (CM), incurs a high computational cost when used for search. This cost can be reduced by using a filter to exclude sequence that is unlikely to contain the ncRNA of interest, applying the CM only where it is likely to match strongly. Despite recent advances, designing an efficient filter that can detect nearly all ncRNA instances while excluding most irrelevant sequences remains challenging. This work proposes a systematic procedure to convert a CM for an ncRNA family to a secondary structure profile (SSP), which augments a conservation profile with secondary structure information but can still be efficiently scanned against long sequences. We use dynamic programming to estimate an SSP's sensitivity and FP rate, yielding an efficient, fully automated filter design algorithm. Our experiments demonstrate that designed SSP filters can achieve significant speedup over unfiltered CM search while maintaining high sensitivity for various ncRNA families, including those with and without strong sequence conservation. For highly structured ncRNA families, including secondary structure conservation yields better performance than using primary sequence conservation alone.

摘要

在基因组DNA中检测非编码RNA(ncRNA)是注释工作的重要组成部分。然而,用于对ncRNA家族进行建模的最广泛使用的工具——协方差模型(CM),在用于搜索时会产生高昂的计算成本。通过使用过滤器排除不太可能包含感兴趣的ncRNA的序列,仅在可能强烈匹配的地方应用CM,可以降低这种成本。尽管最近取得了进展,但设计一种能够检测几乎所有ncRNA实例同时排除大多数无关序列的高效过滤器仍然具有挑战性。这项工作提出了一种系统的程序,将ncRNA家族的CM转换为二级结构概况(SSP),它用二级结构信息增强了保守概况,但仍能有效地针对长序列进行扫描。我们使用动态规划来估计SSP的灵敏度和假阳性率,从而产生一种高效、全自动的过滤器设计算法。我们的实验表明,设计的SSP过滤器在保持对各种ncRNA家族(包括具有和不具有强序列保守性的家族)高灵敏度的同时,与未过滤的CM搜索相比,可以实现显著的加速。对于高度结构化的ncRNA家族,纳入二级结构保守性比仅使用一级序列保守性产生更好的性能。

相似文献

1
Designing secondary structure profiles for fast ncRNA identification.设计用于快速鉴定非编码RNA的二级结构图谱。
Comput Syst Bioinformatics Conf. 2008;7:145-56.
2
Designing filters for fast-known NcRNA identification.设计用于快速已知 NcRNA 鉴定的滤波器。
IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):774-87. doi: 10.1109/TCBB.2011.149.
3
A sequence-based filtering method for ncRNA identification and its application to searching for riboswitch elements.一种基于序列的非编码RNA识别过滤方法及其在核糖开关元件搜索中的应用。
Bioinformatics. 2006 Jul 15;22(14):e557-65. doi: 10.1093/bioinformatics/btl232.
4
Exploiting conserved structure for faster annotation of non-coding RNAs without loss of accuracy.利用保守结构在不损失准确性的情况下更快地注释非编码RNA。
Bioinformatics. 2004 Aug 4;20 Suppl 1:i334-41. doi: 10.1093/bioinformatics/bth925.
5
Chain-RNA: a comparative ncRNA search tool based on the two-dimensional chain algorithm.链 RNA:一种基于二维链算法的比较 ncRNA 搜索工具。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):274-85. doi: 10.1109/TCBB.2012.137.
6
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
7
Structural alignment of pseudoknotted RNA.假结RNA的结构比对
J Comput Biol. 2008 Jun;15(5):489-504. doi: 10.1089/cmb.2007.0214.
8
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
9
Structural alignment of RNA with triple helix structure.具有三螺旋结构的RNA的结构比对。
J Comput Biol. 2012 Apr;19(4):365-78. doi: 10.1089/cmb.2010.0052.
10
Sequence-based heuristics for faster annotation of non-coding RNA families.基于序列的启发式方法,用于更快地注释非编码RNA家族。
Bioinformatics. 2006 Jan 1;22(1):35-9. doi: 10.1093/bioinformatics/bti743. Epub 2005 Nov 2.

引用本文的文献

1
Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum.全基因组转录组分析显示,人畜共患寄生虫日本血吸虫存在广泛的可变RNA剪接。
BMC Genomics. 2014 Aug 26;15(1):715. doi: 10.1186/1471-2164-15-715.
2
Fast filtering for RNA homology search.快速过滤 RNA 同源搜索。
Bioinformatics. 2011 Nov 15;27(22):3102-9. doi: 10.1093/bioinformatics/btr545. Epub 2011 Sep 28.
3
Rfam: Wikipedia, clans and the "decimal" release.Rfam:维基百科、家族及“十进制”版本。
Nucleic Acids Res. 2011 Jan;39(Database issue):D141-5. doi: 10.1093/nar/gkq1129. Epub 2010 Nov 9.