• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有不完整序列的局部RNA结构比对。

Local RNA structure alignment with incomplete sequence.

作者信息

Kolbe Diana L, Eddy Sean R

机构信息

HHMI Janelia Farm Research Campus, Ashburn, VA 20147, USA.

出版信息

Bioinformatics. 2009 May 15;25(10):1236-43. doi: 10.1093/bioinformatics/btp154. Epub 2009 Mar 20.

DOI:10.1093/bioinformatics/btp154
PMID:19304875
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2677745/
Abstract

MOTIVATION

Accuracy of automated structural RNA alignment is improved by using models that consider not only primary sequence but also secondary structure information. However, current RNA structural alignment approaches tend to perform poorly on incomplete sequence fragments, such as single reads from metagenomic environmental surveys, because nucleotides that are expected to be base paired are missing.

RESULTS

We present a local RNA structural alignment algorithm, trCYK, for aligning and scoring incomplete sequences under a model using primary sequence conservation and secondary structure information when possible. The trCYK algorithm improves alignment accuracy and coverage of sequence fragments of structural RNAs in simulated metagenomic shotgun datasets.

AVAILABILITY

The source code for Infernal 1.0, which includes trCYK, is available at http://infernal.janelia.org.

摘要

动机

通过使用不仅考虑一级序列而且考虑二级结构信息的模型,可以提高自动化结构RNA比对的准确性。然而,当前的RNA结构比对方法在不完整的序列片段上往往表现不佳,例如宏基因组环境调查中的单条读数,因为预期会碱基配对的核苷酸缺失了。

结果

我们提出了一种局部RNA结构比对算法trCYK,用于在可能的情况下,根据使用一级序列保守性和二级结构信息的模型,比对不完整序列并进行评分。trCYK算法提高了模拟宏基因组鸟枪法数据集中结构RNA序列片段的比对准确性和覆盖率。

可用性

包含trCYK的Infernal 1.0的源代码可在http://infernal.janelia.org获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/0fe3e48f5e8b/btp154f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/53abc333e527/btp154f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/804ffda08e74/btp154f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/c9fc5e71c4c5/btp154f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/0fe3e48f5e8b/btp154f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/53abc333e527/btp154f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/804ffda08e74/btp154f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/c9fc5e71c4c5/btp154f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/491d/2677745/0fe3e48f5e8b/btp154f4.jpg

相似文献

1
Local RNA structure alignment with incomplete sequence.具有不完整序列的局部RNA结构比对。
Bioinformatics. 2009 May 15;25(10):1236-43. doi: 10.1093/bioinformatics/btp154. Epub 2009 Mar 20.
2
Infernal 1.0: inference of RNA alignments.Infernal 1.0:RNA比对推断
Bioinformatics. 2009 May 15;25(10):1335-7. doi: 10.1093/bioinformatics/btp157. Epub 2009 Mar 23.
3
Infernal 1.1: 100-fold faster RNA homology searches. Infernal 1.1:100 倍更快的 RNA 同源性搜索。
Bioinformatics. 2013 Nov 15;29(22):2933-5. doi: 10.1093/bioinformatics/btt509. Epub 2013 Sep 4.
4
RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.RNA采样器:一种基于采样的新算法,用于常见RNA二级结构预测和结构比对。
Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.
5
DAFS: simultaneous aligning and folding of RNA sequences via dual decomposition.DAFS:通过对偶分解实现 RNA 序列的同时对齐和折叠。
Bioinformatics. 2012 Dec 15;28(24):3218-24. doi: 10.1093/bioinformatics/bts612. Epub 2012 Oct 11.
6
Using tertiary structure for the computation of highly accurate multiple RNA alignments with the SARA-Coffee package.使用三级结构计算具有 SARA-Coffee 包的高度精确的多个 RNA 比对。
Bioinformatics. 2013 May 1;29(9):1112-9. doi: 10.1093/bioinformatics/btt096. Epub 2013 Feb 28.
7
Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structures.用于比对和预测假结RNA结构的成对随机树邻接文法
Proc IEEE Comput Syst Bioinform Conf. 2004:290-9.
8
Murlet: a practical multiple alignment tool for structural RNA sequences.Murlet:一种用于结构RNA序列的实用多序列比对工具。
Bioinformatics. 2007 Jul 1;23(13):1588-98. doi: 10.1093/bioinformatics/btm146. Epub 2007 Apr 25.
9
PARTS: probabilistic alignment for RNA joinT secondary structure prediction.PARTS:用于RNA联合二级结构预测的概率比对
Nucleic Acids Res. 2008 Apr;36(7):2406-17. doi: 10.1093/nar/gkn043. Epub 2008 Feb 26.
10
MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons.MARNA:基于序列结构比较的RNA多序列比对和共有结构预测
Bioinformatics. 2005 Aug 15;21(16):3352-9. doi: 10.1093/bioinformatics/bti550. Epub 2005 Jun 21.

引用本文的文献

1
STATISTICAL TESTS FOR LARGE TREE-STRUCTURED DATA.大型树形结构数据的统计检验
J Am Stat Assoc. 2017;112(520):1733-1743. doi: 10.1080/01621459.2016.1240081. Epub 2017 Aug 7.
2
Ribovore: ribosomal RNA sequence analysis for GenBank submissions and database curation.核糖体 RNA 序列分析用于 GenBank 提交和数据库管理。
BMC Bioinformatics. 2021 Aug 12;22(1):400. doi: 10.1186/s12859-021-04316-z.
3
DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data.DRAGoM:宏基因组数据中非编码RNA的分类与定量分析

本文引用的文献

1
Rfam: updates to the RNA families database.Rfam:RNA家族数据库的更新。
Nucleic Acids Res. 2009 Jan;37(Database issue):D136-40. doi: 10.1093/nar/gkn766. Epub 2008 Oct 25.
2
Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering.通过基于基因组规模结构的聚类推断非编码RNA家族和类别。
PLoS Comput Biol. 2007 Apr 13;3(4):e65. doi: 10.1371/journal.pcbi.0030065. Epub 2007 Feb 22.
3
Query-dependent banding (QDB) for faster RNA similarity searches.用于更快RNA相似性搜索的查询依赖条带法(QDB)。
Front Genet. 2021 May 5;12:669495. doi: 10.3389/fgene.2021.669495. eCollection 2021.
4
Reconstructing 16S rRNA genes in metagenomic data.重建宏基因组数据中的 16S rRNA 基因。
Bioinformatics. 2015 Jun 15;31(12):i35-43. doi: 10.1093/bioinformatics/btv231.
5
Conservation and losses of non-coding RNAs in avian genomes.鸟类基因组中非编码RNA的保守性与丢失情况
PLoS One. 2015 Mar 30;10(3):e0121797. doi: 10.1371/journal.pone.0121797. eCollection 2015.
6
Rfam 12.0: updates to the RNA families database.Rfam 12.0:RNA家族数据库的更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7. doi: 10.1093/nar/gku1063. Epub 2014 Nov 11.
7
RNA-CODE: a noncoding RNA classification tool for short reads in NGS data lacking reference genomes.RNA-CODE:一种在缺乏参考基因组的 NGS 数据中对短读进行非编码 RNA 分类的工具。
PLoS One. 2013 Oct 25;8(10):e77596. doi: 10.1371/journal.pone.0077596. eCollection 2013.
8
Infernal 1.1: 100-fold faster RNA homology searches. Infernal 1.1:100 倍更快的 RNA 同源性搜索。
Bioinformatics. 2013 Nov 15;29(22):2933-5. doi: 10.1093/bioinformatics/btt509. Epub 2013 Sep 4.
9
LocARNAscan: Incorporating thermodynamic stability in sequence and structure-based RNA homology search.LocARNAscan:在基于序列和结构的RNA同源性搜索中纳入热力学稳定性
Algorithms Mol Biol. 2013 Apr 20;8:14. doi: 10.1186/1748-7188-8-14. eCollection 2013.
10
RNIE: genome-wide prediction of bacterial intrinsic terminators.RNIE:细菌内在终止子的全基因组预测。
Nucleic Acids Res. 2011 Aug;39(14):5845-52. doi: 10.1093/nar/gkr168. Epub 2011 Apr 7.
PLoS Comput Biol. 2007 Mar 30;3(3):e56. doi: 10.1371/journal.pcbi.0030056. Epub 2007 Feb 7.
4
The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific.“魔法师二号”全球海洋采样探险:从西北大西洋到东热带太平洋
PLoS Biol. 2007 Mar;5(3):e77. doi: 10.1371/journal.pbio.0050077.
5
CMfinder--a covariance model based RNA motif finding algorithm.CMfinder——一种基于协方差模型的RNA基序查找算法。
Bioinformatics. 2006 Feb 15;22(4):445-52. doi: 10.1093/bioinformatics/btk008. Epub 2005 Dec 15.
6
Bioinformatics for whole-genome shotgun sequencing of microbial communities.用于微生物群落全基因组鸟枪法测序的生物信息学
PLoS Comput Biol. 2005 Jul;1(2):106-12. doi: 10.1371/journal.pcbi.0010024.
7
Metagenomics for studying unculturable microorganisms: cutting the Gordian knot.用于研究不可培养微生物的宏基因组学:快刀斩乱麻。
Genome Biol. 2005;6(8):229. doi: 10.1186/gb-2005-6-8-229. Epub 2005 Aug 1.
8
Local sequence-structure motifs in RNA.RNA中的局部序列-结构基序
J Bioinform Comput Biol. 2004 Dec;2(4):681-98. doi: 10.1142/s0219720004000818.
9
Rfam: annotating non-coding RNAs in complete genomes.Rfam:对完整基因组中的非编码RNA进行注释。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D121-4. doi: 10.1093/nar/gki081.
10
Structure, function and evolution of multidomain proteins.多结构域蛋白的结构、功能与进化
Curr Opin Struct Biol. 2004 Apr;14(2):208-16. doi: 10.1016/j.sbi.2004.03.011.