• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过前体miRNA序列和结构的互信息改进前体miRNA识别

Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures.

作者信息

Fu Xiangzheng, Zhu Wen, Cai Lijun, Liao Bo, Peng Lihong, Chen Yifan, Yang Jialiang

机构信息

College of Information Science and Engineering, Hunan University, Changsha, China.

School of Mathematics and Statistics, Hainan Normal University, Haikou, China.

出版信息

Front Genet. 2019 Feb 25;10:119. doi: 10.3389/fgene.2019.00119. eCollection 2019.

DOI:10.3389/fgene.2019.00119
PMID:30858864
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6397858/
Abstract

Playing critical roles as post-transcriptional regulators, microRNAs (miRNAs) are a family of short non-coding RNAs that are derived from longer transcripts called precursor miRNAs (pre-miRNAs). Experimental methods to identify pre-miRNAs are expensive and time-consuming, which presents the need for computational alternatives. In recent years, the accuracy of computational methods to predict pre-miRNAs has been increasing significantly. However, there are still several drawbacks. First, these methods usually only consider base frequencies or sequence information while ignoring the information between bases. Second, feature extraction methods based on secondary structures usually only consider the global characteristics while ignoring the mutual influence of the local structures. Third, methods integrating high-dimensional feature information is computationally inefficient. In this study, we have proposed a novel mutual information-based feature representation algorithm for pre-miRNA sequences and secondary structures, which is capable of catching the interactions between sequence bases and local features of the RNA secondary structure. In addition, the feature space is smaller than that of most popular methods, which makes our method computationally more efficient than the competitors. Finally, we applied these features to train a support vector machine model to predict pre-miRNAs and compared the results with other popular predictors. As a result, our method outperforms others based on both 5-fold cross-validation and the Jackknife test.

摘要

微小RNA(miRNA)作为转录后调节因子发挥着关键作用,它们是一类短的非编码RNA,来源于被称为前体微小RNA(pre-miRNA)的较长转录本。识别pre-miRNA的实验方法既昂贵又耗时,这就需要计算替代方法。近年来,预测pre-miRNA的计算方法的准确性有了显著提高。然而,仍然存在一些缺点。首先,这些方法通常只考虑碱基频率或序列信息,而忽略了碱基之间的信息。其次,基于二级结构的特征提取方法通常只考虑全局特征,而忽略了局部结构的相互影响。第三,整合高维特征信息的方法计算效率低下。在本研究中,我们提出了一种基于互信息的pre-miRNA序列和二级结构的新型特征表示算法,该算法能够捕捉序列碱基之间的相互作用以及RNA二级结构的局部特征。此外,特征空间比大多数流行方法的特征空间小,这使得我们的方法在计算上比竞争对手更有效。最后,我们应用这些特征训练了一个支持向量机模型来预测pre-miRNA,并将结果与其他流行的预测器进行比较。结果表明,在5折交叉验证和留一法检验中,我们的方法均优于其他方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/52d657dbe330/fgene-10-00119-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/f823e137618a/fgene-10-00119-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/63986879a981/fgene-10-00119-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/39613529af1d/fgene-10-00119-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/72eee1b13edd/fgene-10-00119-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/f73011b34f08/fgene-10-00119-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/52d657dbe330/fgene-10-00119-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/f823e137618a/fgene-10-00119-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/63986879a981/fgene-10-00119-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/39613529af1d/fgene-10-00119-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/72eee1b13edd/fgene-10-00119-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/f73011b34f08/fgene-10-00119-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/46c8/6397858/52d657dbe330/fgene-10-00119-g0006.jpg

相似文献

1
Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures.通过前体miRNA序列和结构的互信息改进前体miRNA识别
Front Genet. 2019 Feb 25;10:119. doi: 10.3389/fgene.2019.00119. eCollection 2019.
2
Prediction of plant pre-microRNAs and their microRNAs in genome-scale sequences using structure-sequence features and support vector machine.利用结构序列特征和支持向量机在基因组规模序列中预测植物前体微小RNA及其微小RNA
BMC Bioinformatics. 2014 Dec 30;15(1):423. doi: 10.1186/s12859-014-0423-x.
3
New 3D graphical representation for RNA structure analysis and its application in the pre-miRNA identification of plants.用于RNA结构分析的新型3D图形表示及其在植物前体miRNA鉴定中的应用。
RSC Adv. 2018 Sep 3;8(54):30833-30841. doi: 10.1039/c8ra04138e. eCollection 2018 Aug 30.
4
Identification of microRNA precursors with support vector machine and string kernel.利用支持向量机和字符串核识别微小RNA前体
Genomics Proteomics Bioinformatics. 2008 Jun;6(2):121-8. doi: 10.1016/S1672-0229(08)60027-3.
5
Deep neural networks for human microRNA precursor detection.用于人类 microRNA 前体检测的深度神经网络。
BMC Bioinformatics. 2020 Jan 13;21(1):17. doi: 10.1186/s12859-020-3339-7.
6
Genetic algorithm-based efficient feature selection for classification of pre-miRNAs.基于遗传算法的用于前体微小RNA分类的高效特征选择
Genet Mol Res. 2011 Apr 12;10(2):588-603. doi: 10.4238/vol10-2gmr969.
7
MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans.MiRFinder:一种用于全基因组快速微小RNA前体扫描的改进方法及软件实现
BMC Bioinformatics. 2007 Sep 17;8:341. doi: 10.1186/1471-2105-8-341.
8
PlantMiRNAPred: efficient classification of real and pseudo plant pre-miRNAs.PlantMiRNAPred:真核和拟南芥前体 miRNA 的高效分类。
Bioinformatics. 2011 May 15;27(10):1368-76. doi: 10.1093/bioinformatics/btr153. Epub 2011 Mar 26.
9
Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine.利用局部结构序列特征和支持向量机对真实和伪微小RNA前体进行分类
BMC Bioinformatics. 2005 Dec 29;6:310. doi: 10.1186/1471-2105-6-310.
10
MatPred: Computational Identification of Mature MicroRNAs within Novel Pre-MicroRNAs.MatPred:新型前体微小RNA中成熟微小RNA的计算识别
Biomed Res Int. 2015;2015:546763. doi: 10.1155/2015/546763. Epub 2015 Nov 23.

引用本文的文献

1
BreCML: identifying breast cancer cell state in scRNA-seq via machine learning.BreCML:通过机器学习在单细胞RNA测序中识别乳腺癌细胞状态。
Front Med (Lausanne). 2024 Nov 6;11:1482726. doi: 10.3389/fmed.2024.1482726. eCollection 2024.
2
Deciphering the role of miRNA-134 in the pathophysiology of depression: A comprehensive review.解读miRNA - 134在抑郁症病理生理学中的作用:一项综合综述。
Heliyon. 2024 Oct 5;10(19):e39026. doi: 10.1016/j.heliyon.2024.e39026. eCollection 2024 Oct 15.
3
ScnML models single-cell transcriptome to predict spinal cord neuronal cell status.

本文引用的文献

1
New 3D graphical representation for RNA structure analysis and its application in the pre-miRNA identification of plants.用于RNA结构分析的新型3D图形表示及其在植物前体miRNA鉴定中的应用。
RSC Adv. 2018 Sep 3;8(54):30833-30841. doi: 10.1039/c8ra04138e. eCollection 2018 Aug 30.
2
An Integrated Framework for Identifying Mutated Driver Pathway and Cancer Progression.用于识别突变驱动途径和癌症进展的综合框架。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Mar-Apr;16(2):455-464. doi: 10.1109/TCBB.2017.2788016. Epub 2017 Dec 29.
3
An efficient strategy for identifying cancer-related key genes based on graph entropy.
ScnML对单细胞转录组进行建模以预测脊髓神经元细胞状态。
Front Genet. 2024 Jun 4;15:1413484. doi: 10.3389/fgene.2024.1413484. eCollection 2024.
4
GeneAI 3.0: powerful, novel, generalized hybrid and ensemble deep learning frameworks for miRNA species classification of stationary patterns from nucleotides.GeneAI 3.0:强大的、新颖的、通用的混合和集成深度学习框架,用于对核苷酸静止模式的 miRNA 物种进行分类。
Sci Rep. 2024 Mar 26;14(1):7154. doi: 10.1038/s41598-024-56786-9.
5
NRTPredictor: identifying rice root cell state in single-cell RNA-seq via ensemble learning.NRTPredictor:通过集成学习在单细胞RNA测序中识别水稻根细胞状态
Plant Methods. 2023 Nov 4;19(1):119. doi: 10.1186/s13007-023-01092-0.
6
RNAinsecta: A tool for prediction of precursor microRNA in insects and search for their target in the model organism Drosophila melanogaster.RNAinsecta:一种用于预测昆虫前体 microRNA 的工具,并在模式生物黑腹果蝇中搜索其靶标。
PLoS One. 2023 Oct 9;18(10):e0287323. doi: 10.1371/journal.pone.0287323. eCollection 2023.
7
MncR: Late Integration Machine Learning Model for Classification of ncRNA Classes Using Sequence and Structural Encoding.MncR:一种基于序列和结构编码的用于 ncRNA 分类的晚期集成机器学习模型。
Int J Mol Sci. 2023 May 17;24(10):8884. doi: 10.3390/ijms24108884.
8
Deciphering the role of precursor miR-12136 and miR-8485 in the progression of intellectual disability (ID).解析前体miR-12136和miR-8485在智力障碍(ID)进展中的作用。
IBRO Neurosci Rep. 2022 Oct 21;13:393-401. doi: 10.1016/j.ibneur.2022.10.005. eCollection 2022 Dec.
9
De novo assembly and characterization of the draft genome of the cashew (Anacardium occidentale L.).腰果(Anacardium occidentale L.)基因组草图的从头组装与特征分析。
Sci Rep. 2022 Oct 28;12(1):18187. doi: 10.1038/s41598-022-22600-7.
10
ADH-PPI: An attention-based deep hybrid model for protein-protein interaction prediction.ADH-PPI:一种用于蛋白质-蛋白质相互作用预测的基于注意力机制的深度混合模型。
iScience. 2022 Sep 21;25(10):105169. doi: 10.1016/j.isci.2022.105169. eCollection 2022 Oct 21.
基于图熵的癌症相关关键基因识别的有效策略。
Comput Biol Chem. 2018 Jun;74:142-148. doi: 10.1016/j.compbiolchem.2018.03.022. Epub 2018 Mar 21.
4
SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides.SkipCPP-Pred:一种改进的、有前途的基于序列的细胞穿透肽预测器。
BMC Genomics. 2017 Oct 16;18(Suppl 7):742. doi: 10.1186/s12864-017-4128-1.
5
A novel method to identify pre-microRNA in various species knowledge base on various species.一种基于多种物种知识库来鉴定多种物种中前体微小RNA的新方法。
J Biomed Semantics. 2017 Sep 20;8(Suppl 1):30. doi: 10.1186/s13326-017-0143-z.
6
Sequence-based information-theoretic features for gene essentiality prediction.用于基因必需性预测的基于序列的信息论特征。
BMC Bioinformatics. 2017 Nov 9;18(1):473. doi: 10.1186/s12859-017-1884-5.
7
A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations.一种用于识别微小RNA与疾病关联的图正则化非负矩阵分解方法。
Bioinformatics. 2018 Jan 15;34(2):239-248. doi: 10.1093/bioinformatics/btx545.
8
Hypothalamic stem cells control ageing speed partly through exosomal miRNAs.下丘脑干细胞部分通过外泌体微小RNA控制衰老速度。
Nature. 2017 Aug 3;548(7665):52-57. doi: 10.1038/nature23282. Epub 2017 Jul 26.
9
Identification of microRNA precursors using reduced and hybrid features.利用简化特征和混合特征识别微小RNA前体
Mol Biosyst. 2017 Jul 25;13(8):1640-1645. doi: 10.1039/c7mb00115k.
10
Delineating the impact of machine learning elements in pre-microRNA detection.描绘机器学习元素在pre-miRNA检测中的影响。
PeerJ. 2017 Mar 29;5:e3131. doi: 10.7717/peerj.3131. eCollection 2017.