• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于深度学习的长非编码 RNA 预测。

Prediction of Long Non-Coding RNAs Based on Deep Learning.

机构信息

School of Mathematics and Physics, University of Science and Technology Beijing, Beijing 100083, China.

出版信息

Genes (Basel). 2019 Apr 3;10(4):273. doi: 10.3390/genes10040273.

DOI:10.3390/genes10040273
PMID:30987229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6523782/
Abstract

With the rapid development of high-throughput sequencing technology, a large number of transcript sequences have been discovered, and how to identify long non-coding RNAs (lncRNAs) from transcripts is a challenging task. The identification and inclusion of lncRNAs not only can more clearly help us to understand life activities themselves, but can also help humans further explore and study the disease at the molecular level. At present, the detection of lncRNAs mainly includes two forms of calculation and experiment. Due to the limitations of bio sequencing technology and ineluctable errors in sequencing processes, the detection effect of these methods is  not very satisfactory. In this paper, we constructed a deep-learning model to effectively distinguish lncRNAs from mRNAs. We used k-mer embedding vectors obtained through training the GloVe algorithm as input features and set up the deep learning framework to include a bidirectional long short-term memory model (BLSTM) layer and a convolutional neural network (CNN) layer with three additional hidden layers. By testing our model, we have found that it obtained the best values of 97.9%, 96.4% and 99.0% in F1score, accuracy and auROC, respectively, which showed better classification performance than the traditional PLEK, CNCI and CPC methods for identifying lncRNAs. We hope that our model will provide effective help in distinguishing mature mRNAs from lncRNAs, and become a potential tool to help humans understand and detect the diseases associated with lncRNAs.

摘要

随着高通量测序技术的飞速发展,大量的转录本序列被发现,如何从转录本中鉴定长非编码 RNA(lncRNA)是一项具有挑战性的任务。鉴定和包含 lncRNA 不仅可以更清楚地帮助我们理解生命活动本身,还可以帮助人类在分子水平上进一步探索和研究疾病。目前,lncRNA 的检测主要包括计算和实验两种形式。由于生物测序技术的局限性和测序过程中不可避免的错误,这些方法的检测效果并不十分理想。在本文中,我们构建了一个深度学习模型,以有效地从 mRNAs 中区分 lncRNAs。我们使用通过训练 GloVe 算法获得的 k-mer 嵌入向量作为输入特征,并建立了深度学习框架,包括一个双向长短期记忆模型(BLSTM)层和一个卷积神经网络(CNN)层,其中包含三个额外的隐藏层。通过测试我们的模型,我们发现它在 F1score、准确性和 auROC 方面分别获得了 97.9%、96.4%和 99.0%的最佳值,这表明与传统的 PLEK、CNCI 和 CPC 方法相比,它在识别 lncRNA 方面具有更好的分类性能。我们希望我们的模型将为区分成熟的 mRNAs 和 lncRNAs 提供有效的帮助,并成为帮助人类理解和检测与 lncRNAs 相关疾病的潜在工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/62802d7dd427/genes-10-00273-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/404ef84b5533/genes-10-00273-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/74804c859aad/genes-10-00273-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/4d3657ae8839/genes-10-00273-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/68118f858b0c/genes-10-00273-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/62802d7dd427/genes-10-00273-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/404ef84b5533/genes-10-00273-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/74804c859aad/genes-10-00273-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/4d3657ae8839/genes-10-00273-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/68118f858b0c/genes-10-00273-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/577e/6523782/62802d7dd427/genes-10-00273-g005.jpg

相似文献

1
Prediction of Long Non-Coding RNAs Based on Deep Learning.基于深度学习的长非编码 RNA 预测。
Genes (Basel). 2019 Apr 3;10(4):273. doi: 10.3390/genes10040273.
2
PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme.PLEK:一种基于改进的k-mer方案预测长链非编码RNA和信使RNA的工具。
BMC Bioinformatics. 2014 Sep 19;15(1):311. doi: 10.1186/1471-2105-15-311.
3
lncRNA-MFDL: identification of human long non-coding RNAs by fusing multiple features and using deep learning.lncRNA-MFDL:通过融合多种特征并运用深度学习来鉴定人类长链非编码RNA
Mol Biosyst. 2015 Mar;11(3):892-7. doi: 10.1039/c4mb00650j. Epub 2015 Jan 15.
4
PlncRNA-HDeep: plant long noncoding RNA prediction using hybrid deep learning based on two encoding styles.PlncRNA-HDeep:基于两种编码方式的混合深度学习进行植物长链非编码RNA预测
BMC Bioinformatics. 2021 May 12;22(Suppl 3):242. doi: 10.1186/s12859-020-03870-2.
5
EVlncRNA-Dpred: improved prediction of experimentally validated lncRNAs by deep learning.EVlncRNA-Dpred:通过深度学习提高实验验证的 lncRNA 预测。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac583.
6
PLEKv2: predicting lncRNAs and mRNAs based on intrinsic sequence features and the coding-net model.PLEKv2:基于内在序列特征和编码网络模型预测 lncRNAs 和 mRNAs。
BMC Genomics. 2024 Aug 2;25(1):756. doi: 10.1186/s12864-024-10662-y.
7
LncRNAnet: long non-coding RNA identification using deep learning.LncRNAnet:使用深度学习进行长非编码 RNA 鉴定。
Bioinformatics. 2018 Nov 15;34(22):3889-3897. doi: 10.1093/bioinformatics/bty418.
8
A Support Vector Machine based method to distinguish long non-coding RNAs from protein coding transcripts.基于支持向量机的方法区分长非编码 RNA 与蛋白质编码转录本。
BMC Genomics. 2017 Oct 18;18(1):804. doi: 10.1186/s12864-017-4178-4.
9
LncMirNet: Predicting LncRNA-miRNA Interaction Based on Deep Learning of Ribonucleic Acid Sequences.LncMirNet:基于 RNA 序列深度学习的长非编码 RNA- miRNA 相互作用预测。
Molecules. 2020 Sep 23;25(19):4372. doi: 10.3390/molecules25194372.
10
Machine Learning-Based Annotation of Long Noncoding RNAs Using PLncPRO.基于机器学习的 PLncPRO 长非编码 RNA 注释
Methods Mol Biol. 2020;2107:253-260. doi: 10.1007/978-1-0716-0235-5_12.

引用本文的文献

1
Enhancing LncRNA-miRNA interaction prediction with multimodal contrastive representation learning.通过多模态对比表示学习增强长链非编码RNA-微小RNA相互作用预测
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf281.
2
RNA sequence analysis landscape: A comprehensive review of task types, databases, datasets, word embedding methods, and language models.RNA序列分析全景:任务类型、数据库、数据集、词嵌入方法及语言模型的全面综述
Heliyon. 2025 Jan 6;11(2):e41488. doi: 10.1016/j.heliyon.2024.e41488. eCollection 2025 Jan 30.
3
Leveraging State-of-the-Art AI Algorithms in Personalized Oncology: From Transcriptomics to Treatment.

本文引用的文献

1
Integrative analysis of mRNA and lncRNA profiles identified pathogenetic lncRNAs in esophageal squamous cell carcinoma.mRNA 和 lncRNA 谱的综合分析鉴定出食管鳞癌中的致病 lncRNA。
Gene. 2018 Jun 30;661:169-175. doi: 10.1016/j.gene.2018.03.066. Epub 2018 Mar 28.
2
Long Noncoding RNA LINC01619 Regulates MicroRNA-27a/Forkhead Box Protein O1 and Endoplasmic Reticulum Stress-Mediated Podocyte Injury in Diabetic Nephropathy.长链非编码 RNA LINC01619 通过调节微小 RNA-27a/叉头框蛋白 O1 和内质网应激介导的糖尿病肾病足细胞损伤
Antioxid Redox Signal. 2018 Aug 1;29(4):355-376. doi: 10.1089/ars.2017.7278. Epub 2018 Mar 12.
3
在个性化肿瘤学中利用最先进的人工智能算法:从转录组学到治疗
Diagnostics (Basel). 2024 Sep 29;14(19):2174. doi: 10.3390/diagnostics14192174.
4
Artificial intelligence and bioinformatics: a journey from traditional techniques to smart approaches.人工智能与生物信息学:从传统技术到智能方法的历程。
Gastroenterol Hepatol Bed Bench. 2024;17(3):241-252. doi: 10.22037/ghfbb.v17i3.2977.
5
Current understanding of functional peptides encoded by lncRNA in cancer.目前对lncRNA编码的功能性肽在癌症中的理解。
Cancer Cell Int. 2024 Jul 19;24(1):252. doi: 10.1186/s12935-024-03446-7.
6
Update on functional analysis of long non-coding RNAs in common crops.常见作物中长链非编码RNA的功能分析进展
Front Plant Sci. 2024 May 30;15:1389154. doi: 10.3389/fpls.2024.1389154. eCollection 2024.
7
BioDeepfuse: a hybrid deep learning approach with integrated feature extraction techniques for enhanced non-coding RNA classification.BioDeepfuse:一种混合深度学习方法,结合了集成特征提取技术,用于增强非编码 RNA 分类。
RNA Biol. 2024 Jan;21(1):1-12. doi: 10.1080/15476286.2024.2329451. Epub 2024 Mar 25.
8
DeepMethylation: a deep learning based framework with GloVe and Transformer encoder for DNA methylation prediction.DeepMethylation:一种基于深度学习的框架,使用 GloVe 和 Transformer 编码器进行 DNA 甲基化预测。
PeerJ. 2023 Sep 25;11:e16125. doi: 10.7717/peerj.16125. eCollection 2023.
9
Deep Learning Approaches for lncRNA-Mediated Mechanisms: A Comprehensive Review of Recent Developments.深度学习方法在 lncRNA 介导的机制研究中的应用:最新进展的综合评述。
Int J Mol Sci. 2023 Jun 18;24(12):10299. doi: 10.3390/ijms241210299.
10
High-Accuracy ncRNA Function Prediction via Deep Learning Using Global and Local Sequence Information.通过深度学习利用全局和局部序列信息进行高精度非编码RNA功能预测
Biomedicines. 2023 Jun 3;11(6):1631. doi: 10.3390/biomedicines11061631.
A deep learning method for lincRNA detection using auto-encoder algorithm.
一种使用自动编码器算法进行长链非编码RNA(lincRNA)检测的深度学习方法。
BMC Bioinformatics. 2017 Dec 6;18(Suppl 15):511. doi: 10.1186/s12859-017-1922-3.
4
Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding.基于 k- -mer 嵌入卷积长短期记忆网络的染色质可及性预测。
Bioinformatics. 2017 Jul 15;33(14):i92-i101. doi: 10.1093/bioinformatics/btx234.
5
The long noncoding RNA controls cardiac fibrosis and remodeling.长链非编码RNA控制心脏纤维化和重塑。
Sci Transl Med. 2017 Jun 21;9(395). doi: 10.1126/scitranslmed.aai9118.
6
Microarray profiling and co-expression network analysis of the lncRNAs and mRNAs associated with acute leukemia in adults.成人急性白血病相关lncRNA和mRNA的微阵列分析及共表达网络分析
Mol Biosyst. 2017 May 30;13(6):1102-1108. doi: 10.1039/c6mb00874g.
7
Genome-Wide lncRNA Microarray Profiling Identifies Novel Circulating lncRNAs for Detection of Gastric Cancer.全基因组lncRNA芯片分析鉴定出用于检测胃癌的新型循环lncRNA
Theranostics. 2017 Jan 1;7(1):213-227. doi: 10.7150/thno.16044. eCollection 2017.
8
LncRNApred: Classification of Long Non-Coding RNAs and Protein-Coding Transcripts by the Ensemble Algorithm with a New Hybrid Feature.LncRNApred:基于具有新型混合特征的集成算法对长链非编码RNA和蛋白质编码转录本进行分类
PLoS One. 2016 May 26;11(5):e0154567. doi: 10.1371/journal.pone.0154567. eCollection 2016.
9
Distinct Expression of Long Non-Coding RNAs in an Alzheimer's Disease Model.阿尔茨海默病模型中长链非编码RNA的差异表达
J Alzheimers Dis. 2015;45(3):837-49. doi: 10.3233/JAD-142919.
10
lncRNA-MFDL: identification of human long non-coding RNAs by fusing multiple features and using deep learning.lncRNA-MFDL:通过融合多种特征并运用深度学习来鉴定人类长链非编码RNA
Mol Biosyst. 2015 Mar;11(3):892-7. doi: 10.1039/c4mb00650j. Epub 2015 Jan 15.