• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

sRNAdeep:一种基于 DistilBERT 编码模式和深度学习算法的新型细菌 sRNA 预测工具。

sRNAdeep: a novel tool for bacterial sRNA prediction based on DistilBERT encoding mode and deep learning algorithms.

机构信息

School of Artificial Intelligence, Hangzhou Dianzi University, Hangzhou, 310018, P.R. China.

School of Biomedical Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China.

出版信息

BMC Genomics. 2024 Oct 31;25(1):1021. doi: 10.1186/s12864-024-10951-6.

DOI:10.1186/s12864-024-10951-6
PMID:39482572
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11526673/
Abstract

BACKGROUND

Bacterial small regulatory RNA (sRNA) plays a crucial role in cell metabolism and could be used as a new potential drug target in the treatment of pathogen-induced disease. However, experimental methods for identifying sRNAs still require a large investment of human and material resources.

METHODS

In this study, we propose a novel sRNA prediction model called sRNAdeep based on the DistilBERT feature extraction and TextCNN methods. The sRNA and non-sRNA sequences of bacteria were considered as sentences and then fed into a composite model consisting of deep learning models to evaluate classification performance.

RESULTS

By filtering sRNAs from BSRD database, we obtained a validation dataset comprised of 2438 positive and 4730 negative samples. The benchmark experiments showed that sRNAdeep displayed better performance in the various indexes compared to previous sRNA prediction tools. By applying our tool to Mycobacterium tuberculosis (MTB) genome, we have identified 21 sRNAs within the intergenic and intron regions. A set of 272 targeted genes regulated by these sRNAs were also captured in MTB. The coding proteins of two genes (lysX and icd1) are implicated in drug response, with significant active sites related to drug resistance mechanisms of MTB.

CONCLUSION

In conclusion, our newly developed sRNAdeep can help researchers identify bacterial sRNAs more precisely and can be freely available from https://github.com/pyajagod/sRNAdeep.git .

摘要

背景

细菌小调控 RNA(sRNA)在细胞代谢中起着至关重要的作用,可作为治疗病原体诱导疾病的新的潜在药物靶点。然而,鉴定 sRNA 的实验方法仍然需要大量的人力和物力投入。

方法

在这项研究中,我们提出了一种名为 sRNAdeep 的新型 sRNA 预测模型,该模型基于 DistilBERT 特征提取和 TextCNN 方法。将细菌的 sRNA 和非 sRNA 序列视为句子,并将其输入到由深度学习模型组成的组合模型中,以评估分类性能。

结果

通过从 BSRD 数据库中筛选 sRNA,我们获得了一个由 2438 个阳性和 4730 个阴性样本组成的验证数据集。基准实验表明,sRNAdeep 在各项指标上的性能均优于以前的 sRNA 预测工具。通过将我们的工具应用于结核分枝杆菌(MTB)基因组,我们在基因间和内含子区域内鉴定出了 21 个 sRNA。还捕获了这些 sRNA 调控的 272 个靶向基因。两个基因(lysX 和 icd1)的编码蛋白与 MTB 的药物反应有关,具有与 MTB 耐药机制相关的显著活性位点。

结论

总之,我们新开发的 sRNAdeep 可以帮助研究人员更准确地识别细菌 sRNA,并可从 https://github.com/pyajagod/sRNAdeep.git 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/e5c0e198ef59/12864_2024_10951_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/13b2e4910a8c/12864_2024_10951_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/c4946b738e6b/12864_2024_10951_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/361bec2bc465/12864_2024_10951_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/aab81bfecd73/12864_2024_10951_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/6c2c6c0b226e/12864_2024_10951_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/e5c0e198ef59/12864_2024_10951_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/13b2e4910a8c/12864_2024_10951_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/c4946b738e6b/12864_2024_10951_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/361bec2bc465/12864_2024_10951_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/aab81bfecd73/12864_2024_10951_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/6c2c6c0b226e/12864_2024_10951_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c351/11526673/e5c0e198ef59/12864_2024_10951_Fig6_HTML.jpg

相似文献

1
sRNAdeep: a novel tool for bacterial sRNA prediction based on DistilBERT encoding mode and deep learning algorithms.sRNAdeep:一种基于 DistilBERT 编码模式和深度学习算法的新型细菌 sRNA 预测工具。
BMC Genomics. 2024 Oct 31;25(1):1021. doi: 10.1186/s12864-024-10951-6.
2
sRNA Target Prediction Organizing Tool (SPOT) Integrates Computational and Experimental Data To Facilitate Functional Characterization of Bacterial Small RNAs.sRNA 靶标预测组织工具 (SPOT) 整合计算和实验数据,以促进细菌小 RNA 的功能表征。
mSphere. 2019 Jan 30;4(1):e00561-18. doi: 10.1128/mSphere.00561-18.
3
High-throughput, kingdom-wide prediction and annotation of bacterial non-coding RNAs.细菌非编码RNA的高通量、全领域预测与注释
PLoS One. 2008 Sep 12;3(9):e3197. doi: 10.1371/journal.pone.0003197.
4
Exploring the transcription start sites and other genomic features facilitates the accurate identification and annotation of small RNAs across multiple stress conditions in Mycobacterium tuberculosis.探索转录起始位点和其他基因组特征有助于在结核分枝杆菌的多种应激条件下准确识别和注释小 RNA。
Funct Integr Genomics. 2024 Sep 12;24(5):160. doi: 10.1007/s10142-024-01437-5.
5
sRNARFTarget: a fast machine-learning-based approach for transcriptome-wide sRNA target prediction.sRNA 靶标预测的快速机器学习方法:基于转录组范围的 sRNA 靶标预测。
RNA Biol. 2022;19(1):44-54. doi: 10.1080/15476286.2021.2012058. Epub 2021 Dec 31.
6
Identification of 17 Pseudomonas aeruginosa sRNAs and prediction of sRNA-encoding genes in 10 diverse pathogens using the bioinformatic tool sRNAPredict2.利用生物信息学工具sRNAPredict2鉴定17种铜绿假单胞菌小RNA并预测10种不同病原体中的小RNA编码基因。
Nucleic Acids Res. 2006;34(12):3484-93. doi: 10.1093/nar/gkl453.
7
Transcriptome analysis and molecular characterization of novel small RNAs in Mycobacterium tuberculosis Lineage 1.结核分枝杆菌 1 谱系中新小 RNA 的转录组分析与分子特征
World J Microbiol Biotechnol. 2024 Jul 25;40(9):279. doi: 10.1007/s11274-024-04089-6.
8
Genome-wide Annotation, Identification, and Global Transcriptomic Analysis of Regulatory or Small RNA Gene Expression in Staphylococcus aureus.金黄色葡萄球菌中调控性或小RNA基因表达的全基因组注释、鉴定及全局转录组分析
mBio. 2016 Feb 9;7(1):e01990-15. doi: 10.1128/mBio.01990-15.
9
Identification of novel growth phase- and media-dependent small non-coding RNAs in Streptococcus pyogenes M49 using intergenic tiling arrays.利用基因间平铺芯片鉴定酿脓链球菌 M49 中新型生长阶段和培养基依赖型的小非编码 RNA。
BMC Genomics. 2012 Oct 13;13:550. doi: 10.1186/1471-2164-13-550.
10
sRNAscanner: a computational tool for intergenic small RNA detection in bacterial genomes.sRNAscanner:一种用于在细菌基因组中检测基因间小 RNA 的计算工具。
PLoS One. 2010 Aug 5;5(8):e11970. doi: 10.1371/journal.pone.0011970.

引用本文的文献

1
Tiny but Mighty: Small RNAs-The Micromanagers of Bacterial Survival, Virulence, and Host-Pathogen Interactions.微小却强大:小RNA——细菌生存、毒力及宿主-病原体相互作用的微观管理者
Noncoding RNA. 2025 May 5;11(3):36. doi: 10.3390/ncrna11030036.

本文引用的文献

1
A survey of k-mer methods and applications in bioinformatics.生物信息学中k-mer方法及其应用综述。
Comput Struct Biotechnol J. 2024 May 21;23:2289-2303. doi: 10.1016/j.csbj.2024.05.025. eCollection 2024 Dec.
2
Identification of novel single nucleotide variants in the drug resistance mechanism of Mycobacterium tuberculosis isolates by whole-genome analysis.通过全基因组分析鉴定结核分枝杆菌分离株耐药机制中的新型单核苷酸变异。
BMC Genomics. 2024 May 14;25(1):478. doi: 10.1186/s12864-024-10390-3.
3
iAMP-Attenpred: a novel antimicrobial peptide predictor based on BERT feature extraction method and CNN-BiLSTM-Attention combination model.
iAMP-Attenpred:一种基于 BERT 特征提取方法和 CNN-BiLSTM-Attention 组合模型的新型抗菌肽预测器。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad443.
4
TargetRNA3: predicting prokaryotic RNA regulatory targets with machine learning.TargetRNA3:使用机器学习预测原核 RNA 调控靶标。
Genome Biol. 2023 Dec 1;24(1):276. doi: 10.1186/s13059-023-03117-2.
5
Cytoscape.js 2023 update: a graph theory library for visualization and analysis.Cytoscape.js 2023 更新:用于可视化和分析的图论库。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad031.
6
The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest.2023 年的 STRING 数据库:针对任何感兴趣的测序基因组的蛋白质-蛋白质关联网络和功能富集分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D638-D646. doi: 10.1093/nar/gkac1000.
7
DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update).DAVID:一个用于基因列表功能富集分析和功能注释的网络服务器(2021 更新)。
Nucleic Acids Res. 2022 Jul 5;50(W1):W216-W221. doi: 10.1093/nar/gkac194.
8
- and -Encoded Small Regulatory RNAs in .- 以及 - 中的编码小调节RNA 。 你提供的原文似乎不完整,请补充完整以便我能更准确地翻译。
Microorganisms. 2021 Sep 2;9(9):1865. doi: 10.3390/microorganisms9091865.
9
Inhibitors of aminoacyl-tRNA synthetases as antimycobacterial compounds: An up-to-date review.作为抗分枝杆菌化合物的氨酰-tRNA 合成酶抑制剂:最新综述。
Bioorg Chem. 2021 May;110:104806. doi: 10.1016/j.bioorg.2021.104806. Epub 2021 Mar 6.
10
Defining new chemical space for drug penetration into Gram-negative bacteria.定义新的化学空间,以促进药物穿透革兰氏阴性菌。
Nat Chem Biol. 2020 Dec;16(12):1293-1302. doi: 10.1038/s41589-020-00674-6. Epub 2020 Nov 16.