• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RNAdetect:高效计算新型非编码 RNA 的检测。

RNAdetect: efficient computational detection of novel non-coding RNAs.

机构信息

Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, USA.

TEES-AgriLife Center for Bioinformatics and Genomic Systems Engineering, Texas A&M University, College Station, TX, USA.

出版信息

Bioinformatics. 2019 Apr 1;35(7):1133-1141. doi: 10.1093/bioinformatics/bty765.

DOI:10.1093/bioinformatics/bty765
PMID:30169792
Abstract

MOTIVATION

Non-coding RNAs (ncRNAs) are known to play crucial roles in various biological processes, and there is a pressing need for accurate computational detection methods that could be used to efficiently scan genomes to detect novel ncRNAs. However, unlike coding genes, ncRNAs often lack distinctive sequence features that could be used for recognizing them. Although many ncRNAs are known to have a well conserved secondary structure, which provides useful cues for computational prediction, it has been also shown that a structure-based approach alone may not be sufficient for detecting ncRNAs in a single sequence. Currently, the most effective ncRNA detection methods combine structure-based techniques with a comparative genome analysis approach to improve the prediction performance.

RESULTS

In this paper, we propose RNAdetect, a computational method incorporating novel features for accurate detection of ncRNAs in combination with comparative genome analysis. Given a sequence alignment, RNAdetect can accurately detect the presence of functional ncRNAs by incorporating novel predictive features based on the concept of generalized ensemble defect (GED), which assesses the degree of structure conservation across multiple related sequences and the conformation of the individual folding structures to a common consensus structure. Furthermore, n-gram models (NGMs) are used to extract features that can effectively capture sequence homology to known ncRNA families. Utilization of NGMs can enhance the detection of ncRNAs that have sparse folding structures with many unpaired bases. Extensive performance evaluation based on the Rfam database and bacterial genomes demonstrate that RNAdetect can accurately and reliably detect novel ncRNAs, outperforming the current state-of-the-art methods.

AVAILABILITY AND IMPLEMENTATION

The source code for RNAdetect and the benchmark data used in this paper can be downloaded at https://github.com/bjyoontamu/RNAdetect.

摘要

动机

非编码 RNA(ncRNA)在各种生物过程中起着至关重要的作用,因此迫切需要准确的计算检测方法,以便有效地扫描基因组以检测新的 ncRNA。然而,与编码基因不同,ncRNA 通常缺乏可用于识别它们的独特序列特征。尽管许多 ncRNA 已知具有保守的二级结构,这为计算预测提供了有用的线索,但也表明仅基于结构的方法可能不足以在单个序列中检测 ncRNA。目前,最有效的 ncRNA 检测方法将基于结构的技术与比较基因组分析方法相结合,以提高预测性能。

结果

在本文中,我们提出了 RNAdetect,这是一种结合了比较基因组分析的计算方法,用于准确检测 ncRNA。给定序列比对,RNAdetect 可以通过结合基于广义集合缺陷(GED)概念的新预测特征来准确检测功能 ncRNA 的存在,该概念评估了多个相关序列之间结构保守程度以及单个折叠结构与常见共识结构的构象。此外,n 元模型(NGM)用于提取可有效捕获已知 ncRNA 家族序列同源性的特征。利用 NGM 可以增强对具有许多未配对碱基的稀疏折叠结构的 ncRNA 的检测。基于 Rfam 数据库和细菌基因组的广泛性能评估表明,RNAdetect 可以准确可靠地检测新的 ncRNA,优于当前最先进的方法。

可用性和实现

RNAdetect 的源代码和本文中使用的基准数据可在 https://github.com/bjyoontamu/RNAdetect 下载。

相似文献

1
RNAdetect: efficient computational detection of novel non-coding RNAs.RNAdetect:高效计算新型非编码 RNA 的检测。
Bioinformatics. 2019 Apr 1;35(7):1133-1141. doi: 10.1093/bioinformatics/bty765.
2
Discovery of Novel ncRNA Sequences in Multiple Genome Alignments on the Basis of Conserved and Stable Secondary Structures.基于保守且稳定的二级结构在多个基因组比对中发现新型非编码RNA序列
PLoS One. 2015 Jun 15;10(6):e0130200. doi: 10.1371/journal.pone.0130200. eCollection 2015.
3
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.
4
Biocomputational prediction of non-coding RNAs in model cyanobacteria.模式蓝藻中非编码RNA的生物计算预测
BMC Genomics. 2009 Mar 23;10:123. doi: 10.1186/1471-2164-10-123.
5
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
6
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
7
Computational prediction of novel non-coding RNAs in Arabidopsis thaliana.拟南芥中新型非编码RNA的计算预测
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S36. doi: 10.1186/1471-2105-10-S1-S36.
8
ncRFP: A Novel end-to-end Method for Non-Coding RNAs Family Prediction Based on Deep Learning.ncRFP:一种基于深度学习的新型非编码 RNA 家族预测端到端方法。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):784-789. doi: 10.1109/TCBB.2020.2982873. Epub 2021 Apr 6.
9
ncRNA consensus secondary structure derivation using grammar strings.使用语法字符串推导ncRNA共有二级结构
J Bioinform Comput Biol. 2011 Apr;9(2):317-37. doi: 10.1142/s0219720011005501.
10
Identification of cyanobacterial non-coding RNAs by comparative genome analysis.通过比较基因组分析鉴定蓝藻非编码RNA
Genome Biol. 2005;6(9):R73. doi: 10.1186/gb-2005-6-9-r73. Epub 2005 Aug 17.

引用本文的文献

1
Clinical application and detection techniques of liquid biopsy in gastric cancer.液体活检在胃癌中的临床应用及检测技术。
Mol Cancer. 2023 Jan 11;22(1):7. doi: 10.1186/s12943-023-01715-z.
2
BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria.生物自动化机器学习:自动化特征工程和元学习,用于预测细菌中的非编码 RNA。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac218.
3
Posttranscriptional Regulation in Response to Different Environmental Stresses in Campylobacter jejuni.空肠弯曲菌应对不同环境胁迫的转录后调控
Microbiol Spectr. 2022 Jun 29;10(3):e0020322. doi: 10.1128/spectrum.00203-22. Epub 2022 Jun 9.
4
An Integrated Database of Small RNAs and Their Interplay With Transcriptional Gene Regulatory Networks in Corynebacteria.棒状杆菌中小RNA及其与转录基因调控网络相互作用的综合数据库
Front Microbiol. 2021 Jun 17;12:656435. doi: 10.3389/fmicb.2021.656435. eCollection 2021.
5
Competing Endogenous RNAs, Non-Coding RNAs and Diseases: An Intertwined Story.竞争内源性 RNA、非编码 RNA 与疾病:一个错综复杂的故事。
Cells. 2020 Jun 28;9(7):1574. doi: 10.3390/cells9071574.