• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

pmiRScan:一种基于LightGBM的动物前体微小RNA预测方法。

pmiRScan: a LightGBM based method for prediction of animal pre-miRNAs.

作者信息

Venkatesan Amrit, Basak Jolly, Bahadur Ranjit Prasad

机构信息

Computational Structural Biology Lab, Department of Bioscience and Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, 721302, India.

Genomics of Plant Stress Biology Lab, Department of Biotechnology, Visva-Bharati, Santiniketan, West Bengal, 731235, India.

出版信息

Funct Integr Genomics. 2025 Jan 9;25(1):9. doi: 10.1007/s10142-025-01527-y.

DOI:10.1007/s10142-025-01527-y
PMID:39786653
Abstract

MicroRNAs (miRNA) are categorized as short endogenous non-coding RNAs, which have a significant role in post-transcriptional gene regulation. Identifying new animal precursor miRNA (pre-miRNA) and miRNA is crucial to understand the role of miRNAs in various biological processes including the development of diseases. The present study focuses on the development of a Light Gradient Boost (LGB) based method for the classification of animal pre-miRNAs using various sequence and secondary structural features. In various pre-miRNA families, distinct k-mer repeat signatures with a length of three nucleotides have been identified. Out of nine different classifiers that have been trained and tested in the present study, LGB has an overall better performance with an AUROC of 0.959. In comparison with the existing methods, our method 'pmiRScan' has an overall better performance with accuracy of 0.93, sensitivity of 0.86, specificity of 0.95 and F-score of 0.82. Moreover, pmiRScan effectively classifies pre-miRNAs from four distinct taxonomic groups: mammals, nematodes, molluscs and arthropods. We have used our classifier to predict genome-wide pre-miRNAs in human. We find a total of 313 pre-miRNA candidates using pmiRScan. A total of 180 potential mature miRNAs belonging to 60 distinct miRNA families are extracted from predicted pre-miRNAs; of which 128 were novel and are note reported in miRBase. These discoveries may enhance our current understanding of miRNAs and their targets in human. pmiRScan is freely available at http://www.csb.iitkgp.ac.in/applications/pmiRScan/index.php .

摘要

微小RNA(miRNA)被归类为短链内源性非编码RNA,其在转录后基因调控中发挥着重要作用。识别新的动物前体miRNA(pre-miRNA)和miRNA对于理解miRNA在包括疾病发展在内的各种生物学过程中的作用至关重要。本研究着重于开发一种基于轻梯度提升(LGB)的方法,利用各种序列和二级结构特征对动物pre-miRNA进行分类。在各种pre-miRNA家族中,已鉴定出长度为三个核苷酸的不同k-mer重复特征。在本研究中训练和测试的九种不同分类器中,LGB的整体性能更好,曲线下面积(AUROC)为0.959。与现有方法相比,我们的方法“pmiRScan”整体性能更好,准确率为0.93,灵敏度为0.86,特异性为0.95,F值为0.82。此外,pmiRScan能有效区分来自四个不同分类组的pre-miRNA:哺乳动物、线虫、软体动物和节肢动物。我们已使用我们的分类器预测人类全基因组中的pre-miRNA。使用pmiRScan我们共发现313个pre-miRNA候选物。从预测的pre-miRNA中总共提取了属于60个不同miRNA家族的180个潜在成熟miRNA;其中128个是新的,未在miRBase中报道。这些发现可能会增强我们目前对人类miRNA及其靶标的理解。pmiRScan可在http://www.csb.iitkgp.ac.in/applications/pmiRScan/index.php免费获取。

相似文献

1
pmiRScan: a LightGBM based method for prediction of animal pre-miRNAs.pmiRScan:一种基于LightGBM的动物前体微小RNA预测方法。
Funct Integr Genomics. 2025 Jan 9;25(1):9. doi: 10.1007/s10142-025-01527-y.
2
MaturePred: efficient identification of microRNAs within novel plant pre-miRNAs.MaturePred:新型植物前体 miRNA 中 microRNAs 的高效识别。
PLoS One. 2011;6(11):e27422. doi: 10.1371/journal.pone.0027422. Epub 2011 Nov 16.
3
MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans.MiRFinder:一种用于全基因组快速微小RNA前体扫描的改进方法及软件实现
BMC Bioinformatics. 2007 Sep 17;8:341. doi: 10.1186/1471-2105-8-341.
4
miRLocator: A Python Implementation and Web Server for Predicting miRNAs from Pre-miRNA Sequences.miRLocator:一种用于从pre-miRNA序列预测miRNA的Python实现及网络服务器。
Methods Mol Biol. 2019;1932:89-97. doi: 10.1007/978-1-4939-9042-9_6.
5
MatPred: Computational Identification of Mature MicroRNAs within Novel Pre-MicroRNAs.MatPred:新型前体微小RNA中成熟微小RNA的计算识别
Biomed Res Int. 2015;2015:546763. doi: 10.1155/2015/546763. Epub 2015 Nov 23.
6
MatureBayes: a probabilistic algorithm for identifying the mature miRNA within novel precursors.MatureBayes:一种用于识别新型前体中成熟 miRNA 的概率算法。
PLoS One. 2010 Aug 6;5(8):e11843. doi: 10.1371/journal.pone.0011843.
7
Ensemble-based classification approach for micro-RNA mining applied on diverse metagenomic sequences.基于集成的微RNA挖掘分类方法应用于多样的宏基因组序列。
BMC Res Notes. 2014 May 6;7:286. doi: 10.1186/1756-0500-7-286.
8
microPred: effective classification of pre-miRNAs for human miRNA gene prediction.microPred:用于人类miRNA基因预测的前体miRNA有效分类
Bioinformatics. 2009 Apr 15;25(8):989-95. doi: 10.1093/bioinformatics/btp107. Epub 2009 Feb 20.
9
Computational prediction of the localization of microRNAs within their pre-miRNA.计算预测 microRNA 在其 pre-miRNA 中的定位。
Nucleic Acids Res. 2013 Aug;41(15):7200-11. doi: 10.1093/nar/gkt466. Epub 2013 Jun 8.
10
Prediction of plant pre-microRNAs and their microRNAs in genome-scale sequences using structure-sequence features and support vector machine.利用结构序列特征和支持向量机在基因组规模序列中预测植物前体微小RNA及其微小RNA
BMC Bioinformatics. 2014 Dec 30;15(1):423. doi: 10.1186/s12859-014-0423-x.

本文引用的文献

1
Structural basis of microRNA biogenesis by Dicer-1 and its partner protein Loqs-PB.Dicer-1 及其伙伴蛋白 Loqs-PB 生成 microRNA 的结构基础。
Mol Cell. 2022 Nov 3;82(21):4049-4063.e6. doi: 10.1016/j.molcel.2022.09.002. Epub 2022 Sep 30.
2
Method for Identifying Sequence Motifs in Pre-miRNAs for Small-Molecule Binding.小分子结合前体 miRNA 序列基序的鉴定方法
ACS Chem Biol. 2022 Oct 21;17(10):2817-2827. doi: 10.1021/acschembio.2c00452. Epub 2022 Sep 23.
3
miRe2e: a full end-to-end deep model based on transformers for prediction of pre-miRNAs.
miRe2e:一种基于转换器的端到端深度模型,用于预测前 miRNA。
Bioinformatics. 2022 Feb 7;38(5):1191-1197. doi: 10.1093/bioinformatics/btab823.
4
MirGeneDB 2.1: toward a complete sampling of all major animal phyla.MirGeneDB 2.1:全面采样所有主要动物门。
Nucleic Acids Res. 2022 Jan 7;50(D1):D204-D210. doi: 10.1093/nar/gkab1101.
5
A hybrid CNN-LSTM model for pre-miRNA classification.用于 miRNA 前体分类的混合 CNN-LSTM 模型。
Sci Rep. 2021 Jul 8;11(1):14125. doi: 10.1038/s41598-021-93656-0.
6
High precision in microRNA prediction: A novel genome-wide approach with convolutional deep residual networks.高精度 microRNA 预测:一种基于卷积深度残差网络的全基因组新方法。
Comput Biol Med. 2021 Jul;134:104448. doi: 10.1016/j.compbiomed.2021.104448. Epub 2021 May 5.
7
PIN and CCCH Zn-finger domains coordinate RNA targeting in ZC3H12 family endoribonucleases.PIN 和 CCCH Zn 指结构域协调 ZC3H12 家族内切核酸酶中的 RNA 靶向。
Nucleic Acids Res. 2021 May 21;49(9):5369-5381. doi: 10.1093/nar/gkab316.
8
Integrative approaches for analysis of mRNA and microRNA high-throughput data.用于分析mRNA和微小RNA高通量数据的综合方法。
Comput Struct Biotechnol J. 2021 Jan 26;19:1154-1162. doi: 10.1016/j.csbj.2021.01.029. eCollection 2021.
9
Validating the validation: reanalyzing a large-scale comparison of deep learning and machine learning models for bioactivity prediction.验证验证:重新分析深度学习和机器学习模型在生物活性预测方面的大规模比较。
J Comput Aided Mol Des. 2020 Jul;34(7):717-730. doi: 10.1007/s10822-019-00274-0. Epub 2020 Jan 20.
10
Deep neural networks for human microRNA precursor detection.用于人类 microRNA 前体检测的深度神经网络。
BMC Bioinformatics. 2020 Jan 13;21(1):17. doi: 10.1186/s12859-020-3339-7.