• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

NCodR:一种用于区分绿色植物中非编码RNA的多类支持向量机分类方法。

NCodR: A multi-class support vector machine classification to distinguish non-coding RNAs in Viridiplantae.

作者信息

Nithin Chandran, Mukherjee Sunandan, Basak Jolly, Bahadur Ranjit Prasad

机构信息

Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology, Kharagpur 721302, India.

Laboratory of Computational Biology, Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, 02-089 Warsaw, Poland.

出版信息

Quant Plant Biol. 2022 Oct 7;3:e23. doi: 10.1017/qpb.2022.18. eCollection 2022.

DOI:10.1017/qpb.2022.18
PMID:37077974
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10095871/
Abstract

Non-coding RNAs (ncRNAs) are major players in the regulation of gene expression. This study analyses seven classes of ncRNAs in plants using sequence and secondary structure-based RNA folding measures. We observe distinct regions in the distribution of AU content along with overlapping regions for different ncRNA classes. Additionally, we find similar averages for minimum folding energy index across various ncRNAs classes except for pre-miRNAs and lncRNAs. Various RNA folding measures show similar trends among the different ncRNA classes except for pre-miRNAs and lncRNAs. We observe different k-mer repeat signatures of length three among various ncRNA classes. However, in pre-miRs and lncRNAs, a diffuse pattern of k-mers is observed. Using these attributes, we train eight different classifiers to discriminate various ncRNA classes in plants. Support vector machines employing radial basis function show the highest accuracy (average F1 of ~96%) in discriminating ncRNAs, and the classifier is implemented as a web server, NCodR.

摘要

非编码RNA(ncRNAs)是基因表达调控中的主要参与者。本研究使用基于序列和二级结构的RNA折叠方法分析了植物中的七类ncRNAs。我们观察到不同ncRNA类别的AU含量分布存在不同区域以及重叠区域。此外,除了前体miRNA和长链非编码RNA(lncRNAs)外,我们发现不同ncRNA类别的最小折叠能量指数平均值相似。除了前体miRNA和lncRNAs外,各种RNA折叠方法在不同ncRNA类别中显示出相似的趋势。我们观察到不同ncRNA类别中长度为三的k-mer重复特征不同。然而,在前体miRNA和lncRNAs中,观察到k-mer的分散模式。利用这些属性,我们训练了八个不同的分类器来区分植物中的各种ncRNA类别。采用径向基函数的支持向量机在区分ncRNAs方面显示出最高的准确率(平均F1约为96%),并且该分类器被实现为一个网络服务器,即NCodR。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/de2f8d674952/S2632882822000182_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/8d921681aa9b/S2632882822000182_figAb.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/8aecd9263f83/S2632882822000182_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/d8371358fd9f/S2632882822000182_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/de2f8d674952/S2632882822000182_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/8d921681aa9b/S2632882822000182_figAb.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/8aecd9263f83/S2632882822000182_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/d8371358fd9f/S2632882822000182_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6abe/10095871/de2f8d674952/S2632882822000182_fig3.jpg

相似文献

1
NCodR: A multi-class support vector machine classification to distinguish non-coding RNAs in Viridiplantae.NCodR:一种用于区分绿色植物中非编码RNA的多类支持向量机分类方法。
Quant Plant Biol. 2022 Oct 7;3:e23. doi: 10.1017/qpb.2022.18. eCollection 2022.
2
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change.基于预测的二级结构形成自由能变化检测非编码RNA。
BMC Bioinformatics. 2006 Mar 27;7:173. doi: 10.1186/1471-2105-7-173.
3
MncR: Late Integration Machine Learning Model for Classification of ncRNA Classes Using Sequence and Structural Encoding.MncR:一种基于序列和结构编码的用于 ncRNA 分类的晚期集成机器学习模型。
Int J Mol Sci. 2023 May 17;24(10):8884. doi: 10.3390/ijms24108884.
4
Classification of ncRNAs using position and size information in deep sequencing data.利用深度测序数据中的位置和大小信息对 ncRNAs 进行分类。
Bioinformatics. 2010 Sep 15;26(18):i426-32. doi: 10.1093/bioinformatics/btq363.
5
Prediction and classification of ncRNAs using structural information.利用结构信息对非编码RNA进行预测和分类。
BMC Genomics. 2014 Feb 13;15:127. doi: 10.1186/1471-2164-15-127.
6
De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures.利用全局和内在折叠度量对来自基因组假发夹结构的前体微小RNA进行从头支持向量机分类。
Bioinformatics. 2007 Jun 1;23(11):1321-30. doi: 10.1093/bioinformatics/btm026. Epub 2007 Jan 31.
7
nRC: non-coding RNA Classifier based on structural features.nRC:基于结构特征的非编码RNA分类器。
BioData Min. 2017 Aug 1;10:27. doi: 10.1186/s13040-017-0148-2. eCollection 2017.
8
Deep forest ensemble learning for classification of alignments of non-coding RNA sequences based on multi-view structure representations.基于多视图结构表示的非编码 RNA 序列比对分类的深度森林集成学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa354.
9
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.
10
Computational identification of human long intergenic non-coding RNAs using a GA-SVM algorithm.基于 GA-SVM 算法的人类长链非编码 RNA 计算识别。
Gene. 2014 Jan 1;533(1):94-9. doi: 10.1016/j.gene.2013.09.118. Epub 2013 Oct 9.

引用本文的文献

1
MncR: Late Integration Machine Learning Model for Classification of ncRNA Classes Using Sequence and Structural Encoding.MncR:一种基于序列和结构编码的用于 ncRNA 分类的晚期集成机器学习模型。
Int J Mol Sci. 2023 May 17;24(10):8884. doi: 10.3390/ijms24108884.
2
The 1972 Meadows report: A wake-up call for plant science.1972年的梅多斯报告:植物科学的警钟。
Quant Plant Biol. 2023 Mar 6;4:e3. doi: 10.1017/qpb.2023.2. eCollection 2023.

本文引用的文献

1
Development and validation of heat-responsive candidate gene and miRNA gene based SSR markers to analysis genetic diversity in wheat for heat tolerance breeding.基于热响应候选基因和 miRNA 基因的 SSR 标记的开发和验证,用于分析小麦耐热性育种的遗传多样性。
Mol Biol Rep. 2021 Jan;48(1):381-393. doi: 10.1007/s11033-020-06059-1. Epub 2021 Jan 3.
2
Classification of Long Noncoding RNAs by k-mer Content.基于 k--mer 含量的长链非编码 RNA 分类。
Methods Mol Biol. 2021;2254:41-60. doi: 10.1007/978-1-0716-1158-6_4.
3
PLncDB V2.0: a comprehensive encyclopedia of plant long noncoding RNAs.
PLncDB V2.0:一个全面的植物长非编码 RNA 百科全书。
Nucleic Acids Res. 2021 Jan 8;49(D1):D1489-D1495. doi: 10.1093/nar/gkaa910.
4
ncRDeep: Non-coding RNA classification with convolutional neural network.ncRDeep:基于卷积神经网络的非编码 RNA 分类
Comput Biol Chem. 2020 Oct;88:107364. doi: 10.1016/j.compbiolchem.2020.107364. Epub 2020 Aug 27.
5
NCBI Taxonomy: a comprehensive update on curation, resources and tools.NCBI 分类学:在管理、资源和工具方面的全面更新。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa062.
6
Development and characterization of non-coding RNA based simple sequence repeat markers in Capsicum species.基于非编码 RNA 的简单重复序列标记在辣椒属物种中的开发与特征分析。
Genomics. 2020 Mar;112(2):1554-1564. doi: 10.1016/j.ygeno.2019.09.005. Epub 2019 Sep 7.
7
RNAcentral: a hub of information for non-coding RNA sequences.RNAcentral:非编码RNA序列的信息中心。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1250-D1251. doi: 10.1093/nar/gky1206.
8
miRBase: from microRNA sequences to function.miRBase:从 microRNA 序列到功能。
Nucleic Acids Res. 2019 Jan 8;47(D1):D155-D162. doi: 10.1093/nar/gky1141.
9
Identification and characterization of differentially expressed Phaseolus vulgaris miRNAs and their targets during mungbean yellow mosaic India virus infection reveals new insight into Phaseolus-MYMIV interaction.鉴定和表征菜豆黄花叶病毒侵染过程中差异表达的菜豆 microRNAs 及其靶标,揭示了菜豆与 MYMIV 互作的新见解。
Genomics. 2019 Dec;111(6):1333-1342. doi: 10.1016/j.ygeno.2018.09.005. Epub 2018 Sep 17.
10
Functional classification of long non-coding RNAs by k-mer content.基于 k- -mer 含量对长非编码 RNA 进行功能分类。
Nat Genet. 2018 Oct;50(10):1474-1482. doi: 10.1038/s41588-018-0207-8. Epub 2018 Sep 17.