• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用正无标记机器学习识别mRNA 5'非翻译区序列中的潜在核糖开关元件。

Identification of potential riboswitch elements inmRNA 5'UTR sequences using Positive-Unlabeled machine learning.

作者信息

Raymond William S, DeRoo Jacob, Munsky Brian

机构信息

School of Biomedical Engineering, Colorado State University Fort Collins, CO 80523, USA.

Chemical and Biological Engineering, Colorado State University Fort Collins, CO 80523, USA.

出版信息

bioRxiv. 2024 Dec 6:2023.11.23.568398. doi: 10.1101/2023.11.23.568398.

DOI:10.1101/2023.11.23.568398
PMID:39677788
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11642740/
Abstract

Riboswitches are a class of noncoding RNA structures that interact with target ligands to cause a conformational change that can then execute some regulatory purpose within the cell. Riboswitches are ubiquitous and well characterized in bacteria and prokaryotes, with additional examples also being found in fungi, plants, and yeast. To date, no purely RNA-small molecule riboswitch has been discovered in . Several analogous riboswitch-like mechanisms have been described within the translatome within the past decade, prompting the question: Is there a riboswitch dependent on only small molecule ligands? In this work, we set out to train positive unlabeled machine learning classifiers on known riboswitch sequences and apply the classifiers to mRNA 5'UTR sequences found in the 5'UTR database, UTRdb, in the hope of identifying a set of mRNAs to investigate for riboswitch functionality. 67,683 riboswitch sequences were obtained from RNAcentral and sorted for ligand type and used as positive examples and 48,031 5'UTR sequences were used as unlabeled, unknown examples. Positive examples were sorted by ligand, and 20 positive-unlabeled classifiers were trained on sequence and secondary structure features while withholding one or two ligand classes. Cross validation was then performed on the withheld ligand sets to obtain a validation accuracy range of 75%-99%. The joint sets of 5'UTRs identified as potential riboswitches by the 20 classifiers were then analyzed. 15333 sequences were identified as a riboswitch by one or more classifier(s) and 436 of the 5'UTRs were labeled as harboring potential riboswitch elements by all 20 classifiers. These 436 sequences were mapped back to the most similar riboswitches within the positive data and examined. An online database of identified and ranked 5'UTRs, their features, and their most similar matches to known riboswitches, is provided to guide future experimental efforts to identify riboswitches.

摘要

核糖开关是一类非编码RNA结构,它们与靶标配体相互作用,引起构象变化,进而在细胞内执行某种调控功能。核糖开关在细菌和原核生物中普遍存在且特征明确,在真菌、植物和酵母中也发现了其他例子。迄今为止,尚未在[具体物种]中发现纯RNA-小分子核糖开关。在过去十年中,在翻译组中描述了几种类似核糖开关的机制,这引发了一个问题:是否存在仅依赖小分子配体的核糖开关?在这项工作中,我们着手在已知核糖开关序列上训练正无标签机器学习分类器,并将这些分类器应用于5'UTR数据库UTRdb中发现的mRNA 5'UTR序列,希望识别出一组mRNA以研究其核糖开关功能。从RNAcentral获得了67683个核糖开关序列,并根据配体类型进行分类,用作正例,48031个5'UTR序列用作无标签的未知例。正例按配体分类,在保留一或两个配体类别的同时,基于序列和二级结构特征训练了20个正无标签分类器。然后对保留的配体集进行交叉验证,以获得75%-99%的验证准确率范围。接着分析了被20个分类器识别为潜在核糖开关的5'UTR联合集。15333个序列被一个或多个分类器识别为核糖开关,所有20个分类器将436个5'UTR标记为含有潜在核糖开关元件。将这436个序列映射回正数据中最相似的核糖开关并进行检查。提供了一个已识别和排名的5'UTR在线数据库,包括它们的特征以及与已知核糖开关的最相似匹配,以指导未来识别[具体物种]核糖开关的实验工作。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/14833c6e9382/nihpp-2023.11.23.568398v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/2418ecf2df68/nihpp-2023.11.23.568398v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/15f41d71cc28/nihpp-2023.11.23.568398v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/adf5204faa8d/nihpp-2023.11.23.568398v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/bf38f30a272e/nihpp-2023.11.23.568398v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/e3303b6f20b0/nihpp-2023.11.23.568398v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/6a54ad4fdd77/nihpp-2023.11.23.568398v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/f1e1f8efe210/nihpp-2023.11.23.568398v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/dbf890e4c72e/nihpp-2023.11.23.568398v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/a804d7044e18/nihpp-2023.11.23.568398v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/d4295ebc3e1c/nihpp-2023.11.23.568398v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/14833c6e9382/nihpp-2023.11.23.568398v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/2418ecf2df68/nihpp-2023.11.23.568398v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/15f41d71cc28/nihpp-2023.11.23.568398v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/adf5204faa8d/nihpp-2023.11.23.568398v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/bf38f30a272e/nihpp-2023.11.23.568398v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/e3303b6f20b0/nihpp-2023.11.23.568398v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/6a54ad4fdd77/nihpp-2023.11.23.568398v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/f1e1f8efe210/nihpp-2023.11.23.568398v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/dbf890e4c72e/nihpp-2023.11.23.568398v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/a804d7044e18/nihpp-2023.11.23.568398v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/d4295ebc3e1c/nihpp-2023.11.23.568398v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/21a6/11642740/14833c6e9382/nihpp-2023.11.23.568398v2-f0011.jpg

相似文献

1
Identification of potential riboswitch elements inmRNA 5'UTR sequences using Positive-Unlabeled machine learning.使用正无标记机器学习识别mRNA 5'非翻译区序列中的潜在核糖开关元件。
bioRxiv. 2024 Dec 6:2023.11.23.568398. doi: 10.1101/2023.11.23.568398.
2
Identification of potential riboswitch elements in Homo sapiens mRNA 5'UTR sequences using positive-unlabeled machine learning.使用正无标记机器学习方法鉴定人类mRNA 5'非翻译区序列中的潜在核糖开关元件
PLoS One. 2025 Apr 24;20(4):e0320282. doi: 10.1371/journal.pone.0320282. eCollection 2025.
3
Riboswitch detection using profile hidden Markov models.利用分布隐藏马尔可夫模型进行核糖开关检测。
BMC Bioinformatics. 2009 Oct 8;10:325. doi: 10.1186/1471-2105-10-325.
4
The dynamic nature of RNA as key to understanding riboswitch mechanisms.RNA 的动态本质是理解核酶机制的关键。
Acc Chem Res. 2011 Dec 20;44(12):1339-48. doi: 10.1021/ar200035g. Epub 2011 Jun 16.
5
Riboswitches: discovery of drugs that target bacterial gene-regulatory RNAs.核糖开关:靶向细菌基因调控 RNA 的药物的发现。
Acc Chem Res. 2011 Dec 20;44(12):1329-38. doi: 10.1021/ar200039b. Epub 2011 May 26.
6
A computational approach for the identification of distant homologs of bacterial riboswitches based on inverse RNA folding.基于反向 RNA 折叠的细菌核糖体开关远程同源物识别的计算方法。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad110.
7
Ribocentre-switch: a database of riboswitches.Ribocentre-switch:一个核糖开关数据库。
Nucleic Acids Res. 2024 Jan 5;52(D1):D265-D272. doi: 10.1093/nar/gkad891.
8
Computational prediction of riboswitches.核糖开关的计算预测
Methods Enzymol. 2015;553:287-312. doi: 10.1016/bs.mie.2014.10.063. Epub 2015 Feb 19.
9
An Efficient Minimum Free Energy Structure-Based Search Method for Riboswitch Identification Based on Inverse RNA Folding.一种基于反向RNA折叠的高效的基于最小自由能结构的核糖开关识别搜索方法。
PLoS One. 2015 Jul 31;10(7):e0134262. doi: 10.1371/journal.pone.0134262. eCollection 2015.
10
Secondary structural entropy in RNA switch (Riboswitch) identification.RNA开关(核糖开关)识别中的二级结构熵
BMC Bioinformatics. 2015 Apr 28;16:133. doi: 10.1186/s12859-015-0523-2.

本文引用的文献

1
A computational approach for the identification of distant homologs of bacterial riboswitches based on inverse RNA folding.基于反向 RNA 折叠的细菌核糖体开关远程同源物识别的计算方法。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad110.
2
Systematic Comparison and Rational Design of Theophylline Riboswitches for Effective Gene Repression.茶碱核糖开关的有效基因抑制的系统比较和合理设计。
Microbiol Spectr. 2023 Feb 14;11(1):e0275222. doi: 10.1128/spectrum.02752-22. Epub 2023 Jan 23.
3
UTRdb 2.0: a comprehensive, expert curated catalog of eukaryotic mRNAs untranslated regions.
UTRdb 2.0:一个全面的、经过专家精心策划的真核 mRNA 非翻译区目录。
Nucleic Acids Res. 2023 Jan 6;51(D1):D337-D344. doi: 10.1093/nar/gkac1016.
4
Discovering riboswitches: the past and the future.发现核糖开关:过去与未来。
Trends Biochem Sci. 2023 Feb;48(2):119-141. doi: 10.1016/j.tibs.2022.08.009. Epub 2022 Sep 20.
5
PLUS: Predicting cancer metastasis potential based on positive and unlabeled learning.PLUS:基于阳性和无标签学习预测癌症转移潜能。
PLoS Comput Biol. 2022 Mar 29;18(3):e1009956. doi: 10.1371/journal.pcbi.1009956. eCollection 2022 Mar.
6
Identification of piRNA disease associations using deep learning.使用深度学习识别piRNA与疾病的关联。
Comput Struct Biotechnol J. 2022 Mar 3;20:1208-1217. doi: 10.1016/j.csbj.2022.02.026. eCollection 2022.
7
Positive-unlabeled learning in bioinformatics and computational biology: a brief review.生物信息学和计算生物学中的正无标记学习:简要综述。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab461.
8
Metabolic Targets of Coenzyme Q10 in Mitochondria.线粒体中辅酶Q10的代谢靶点
Antioxidants (Basel). 2021 Mar 26;10(4):520. doi: 10.3390/antiox10040520.
9
Transcriptional Riboswitches Integrate Timescales for Bacterial Gene Expression Control.转录核糖开关整合时间尺度以控制细菌基因表达
Front Mol Biosci. 2021 Jan 13;7:607158. doi: 10.3389/fmolb.2020.607158. eCollection 2020.
10
The Gene Ontology resource: enriching a GOld mine.基因本体论资源:丰富一个 GOld 矿。
Nucleic Acids Res. 2021 Jan 8;49(D1):D325-D334. doi: 10.1093/nar/gkaa1113.