• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PLEKv2:基于内在序列特征和编码网络模型预测 lncRNAs 和 mRNAs。

PLEKv2: predicting lncRNAs and mRNAs based on intrinsic sequence features and the coding-net model.

机构信息

Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi'an University of Technology, Xi'an, Shaanxi, 710048, China.

Department of Information Engineering, College of Technology, Hubei Engineering University, Xiaogan, Hubei, 432000, China.

出版信息

BMC Genomics. 2024 Aug 2;25(1):756. doi: 10.1186/s12864-024-10662-y.

DOI:10.1186/s12864-024-10662-y
PMID:39095710
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11295476/
Abstract

BACKGROUND

Long non-coding RNAs (lncRNAs) are RNA transcripts of more than 200 nucleotides that do not encode canonical proteins. Their biological structure is similar to messenger RNAs (mRNAs). To distinguish between lncRNA and mRNA transcripts quickly and accurately, we upgraded the PLEK alignment-free tool to its next version, PLEKv2, and constructed models tailored for both animals and plants.

RESULTS

PLEKv2 can achieve 98.7% prediction accuracy for human datasets. Compared with classical tools and deep learning-based models, this is 8.1%, 3.7%, 16.6%, 1.4%, 4.9%, and 48.9% higher than CPC2, CNCI, Wen et al.'s CNN, LncADeep, PLEK, and NcResNet, respectively. The accuracy of PLEKv2 was > 90% for cross-species prediction. PLEKv2 is more effective and robust than CPC2, CNCI, LncADeep, PLEK, and NcResNet for primate datasets (including chimpanzees, macaques, and gorillas). Moreover, PLEKv2 is not only suitable for non-human primates that are closely related to humans, but can also predict the coding ability of RNA sequences in plants such as Arabidopsis.

CONCLUSIONS

The experimental results illustrate that the model constructed by PLEKv2 can distinguish lncRNAs and mRNAs better than PLEK. The PLEKv2 software is freely available at https://sourceforge.net/projects/plek2/ .

摘要

背景

长非编码 RNA(lncRNA)是长度超过 200 个核苷酸的 RNA 转录物,不编码规范蛋白质。它们的生物结构与信使 RNA(mRNA)相似。为了快速准确地区分 lncRNA 和 mRNA 转录物,我们将 PLEK 无比对工具升级到其下一个版本 PLEKv2,并为动物和植物构建了定制模型。

结果

PLEKv2 可以实现 98.7%的人类数据集预测准确率。与经典工具和基于深度学习的模型相比,这分别高出 8.1%、3.7%、16.6%、1.4%、4.9%和 48.9%。与 CPC2、CNCI、Wen 等人的 CNN、LncADeep、PLEK 和 NcResNet 相比,PLEKv2 的跨物种预测准确率>90%。PLEKv2 比 CPC2、CNCI、LncADeep、PLEK 和 NcResNet 对灵长类数据集(包括黑猩猩、猕猴和大猩猩)更有效和稳健。此外,PLEKv2 不仅适用于与人类密切相关的非灵长类动物,还可以预测拟南芥等植物中 RNA 序列的编码能力。

结论

实验结果表明,PLEKv2 构建的模型比 PLEK 能更好地区分 lncRNA 和 mRNA。PLEKv2 软件可在 https://sourceforge.net/projects/plek2/ 免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/ab556b919dac/12864_2024_10662_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/fcb9b688a139/12864_2024_10662_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/66790421103c/12864_2024_10662_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/849de8c5c3c0/12864_2024_10662_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/84a443ab9aac/12864_2024_10662_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/ab556b919dac/12864_2024_10662_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/fcb9b688a139/12864_2024_10662_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/66790421103c/12864_2024_10662_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/849de8c5c3c0/12864_2024_10662_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/84a443ab9aac/12864_2024_10662_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4492/11295476/ab556b919dac/12864_2024_10662_Fig5_HTML.jpg

相似文献

1
PLEKv2: predicting lncRNAs and mRNAs based on intrinsic sequence features and the coding-net model.PLEKv2:基于内在序列特征和编码网络模型预测 lncRNAs 和 mRNAs。
BMC Genomics. 2024 Aug 2;25(1):756. doi: 10.1186/s12864-024-10662-y.
2
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme.PLEK:一种基于改进的k-mer方案预测长链非编码RNA和信使RNA的工具。
BMC Bioinformatics. 2014 Sep 19;15(1):311. doi: 10.1186/1471-2105-15-311.
3
PlncRNA-HDeep: plant long noncoding RNA prediction using hybrid deep learning based on two encoding styles.PlncRNA-HDeep:基于两种编码方式的混合深度学习进行植物长链非编码RNA预测
BMC Bioinformatics. 2021 May 12;22(Suppl 3):242. doi: 10.1186/s12859-020-03870-2.
4
Prediction of Long Non-Coding RNAs Based on Deep Learning.基于深度学习的长非编码 RNA 预测。
Genes (Basel). 2019 Apr 3;10(4):273. doi: 10.3390/genes10040273.
5
BASiNET-BiologicAl Sequences NETwork: a case study on coding and non-coding RNAs identification.BASiNET-生物序列 NETwork:一个关于编码和非编码 RNA 鉴定的案例研究。
Nucleic Acids Res. 2018 Sep 19;46(16):e96. doi: 10.1093/nar/gky462.
6
lncRNA-MFDL: identification of human long non-coding RNAs by fusing multiple features and using deep learning.lncRNA-MFDL:通过融合多种特征并运用深度学习来鉴定人类长链非编码RNA
Mol Biosyst. 2015 Mar;11(3):892-7. doi: 10.1039/c4mb00650j. Epub 2015 Jan 15.
7
EVlncRNA-Dpred: improved prediction of experimentally validated lncRNAs by deep learning.EVlncRNA-Dpred:通过深度学习提高实验验证的 lncRNA 预测。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac583.
8
lncScore: alignment-free identification of long noncoding RNA from assembled novel transcripts.lncScore:基于组装新转录本的长链非编码 RNA 无比对识别
Sci Rep. 2016 Oct 6;6:34838. doi: 10.1038/srep34838.
9
LncADeep: an ab initio lncRNA identification and functional annotation tool based on deep learning.LncADeep:一种基于深度学习的从头鉴定长非编码 RNA 及其功能注释工具。
Bioinformatics. 2018 Nov 15;34(22):3825-3834. doi: 10.1093/bioinformatics/bty428.
10
PLIT: An alignment-free computational tool for identification of long non-coding RNAs in plant transcriptomic datasets.PLIT:一种用于鉴定植物转录组数据中长非编码 RNA 的无比对计算工具。
Comput Biol Med. 2019 Feb;105:169-181. doi: 10.1016/j.compbiomed.2018.12.014. Epub 2019 Jan 4.

引用本文的文献

1
LMFE: A Novel Method for Predicting Plant LncRNA Based on Multi-Feature Fusion and Ensemble Learning.LMFE:一种基于多特征融合与集成学习预测植物长链非编码RNA的新方法。
Genes (Basel). 2025 Mar 31;16(4):424. doi: 10.3390/genes16040424.

本文引用的文献

1
Ensembl Genomes 2022: an expanding genome resource for non-vertebrates.Ensembl Genomes 2022:一个不断扩展的非脊椎动物基因组资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D996-D1003. doi: 10.1093/nar/gkab1007.
2
Long noncoding RNAs in cancer metastasis.长非编码 RNA 在癌症转移中的作用。
Nat Rev Cancer. 2021 Jul;21(7):446-460. doi: 10.1038/s41568-021-00353-1. Epub 2021 May 5.
3
The computational approaches of lncRNA identification based on coding potential: and challenges.基于编码潜能的lncRNA识别计算方法及挑战
Comput Struct Biotechnol J. 2020 Nov 19;18:3666-3677. doi: 10.1016/j.csbj.2020.11.030. eCollection 2020.
4
GENCODE 2021.GENCODE 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923. doi: 10.1093/nar/gkaa1087.
5
Ensembl 2021.Ensembl 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D884-D891. doi: 10.1093/nar/gkaa942.
6
lncRNA_Mdeep: An Alignment-Free Predictor for Distinguishing Long Non-Coding RNAs from Protein-Coding Transcripts by Multimodal Deep Learning.lncRNA_Mdeep:一种基于多模态深度学习的无比对长非编码 RNA 与蛋白编码转录本区分预测器。
Int J Mol Sci. 2020 Jul 23;21(15):5222. doi: 10.3390/ijms21155222.
7
Research Progress on Plant Long Non-Coding RNA.植物长链非编码RNA的研究进展
Plants (Basel). 2020 Mar 25;9(4):408. doi: 10.3390/plants9040408.
8
Pervasive functional translation of noncanonical human open reading frames.广泛存在的非规范人类开放阅读框的功能翻译。
Science. 2020 Mar 6;367(6482):1140-1146. doi: 10.1126/science.aay0262.
9
A classification model for lncRNA and mRNA based on k-mers and a convolutional neural network.基于 k- -mer 和卷积神经网络的 lncRNA 和 mRNA 分类模型。
BMC Bioinformatics. 2019 Sep 13;20(1):469. doi: 10.1186/s12859-019-3039-3.
10
CPPred: coding potential prediction based on the global description of RNA sequence.CPPred:基于 RNA 序列全局描述的编码潜能预测。
Nucleic Acids Res. 2019 May 7;47(8):e43. doi: 10.1093/nar/gkz087.