• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PredLnc-GFStack:一种基于堆叠集成学习方法的全局序列特征,用于从转录本中预测 lncRNAs。

PredLnc-GFStack: A Global Sequence Feature Based on a Stacked Ensemble Learning Method for Predicting lncRNAs from Transcripts.

机构信息

College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.

School of Computer Science, Wuhan University, Wuhan 430072, China.

出版信息

Genes (Basel). 2019 Sep 3;10(9):672. doi: 10.3390/genes10090672.

DOI:10.3390/genes10090672
PMID:31484412
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6770532/
Abstract

Long non-coding RNAs (lncRNAs) are a class of RNAs with the length exceeding 200 base pairs (bps), which do not encode proteins, nevertheless, lncRNAs have many vital biological functions. A large number of novel transcripts were discovered as a result of the development of high-throughput sequencing technology. Under this circumstance, computational methods for lncRNA prediction are in great demand. In this paper, we consider global sequence features and propose a stacked ensemble learning-based method to predict lncRNAs from transcripts, abbreviated as PredLnc-GFStack. We extract the critical features from the candidate feature list using the genetic algorithm (GA) and then employ the stacked ensemble learning method to construct PredLnc-GFStack model. Computational experimental results show that PredLnc-GFStack outperforms several state-of-the-art methods for lncRNA prediction. Furthermore, PredLnc-GFStack demonstrates an outstanding ability for cross-species ncRNA prediction.

摘要

长链非编码 RNA(lncRNA)是一类长度超过 200 个碱基对(bp)的 RNA,不编码蛋白质,但 lncRNA 具有许多重要的生物学功能。高通量测序技术的发展发现了大量新的转录本。在这种情况下,对 lncRNA 预测的计算方法的需求很大。在本文中,我们考虑全局序列特征,并提出了一种基于堆叠集成学习的方法从转录本中预测 lncRNA,简称为 PredLnc-GFStack。我们使用遗传算法(GA)从候选特征列表中提取关键特征,然后使用堆叠集成学习方法构建 PredLnc-GFStack 模型。计算实验结果表明,PredLnc-GFStack 在 lncRNA 预测方面优于几种先进的方法。此外,PredLnc-GFStack 还展示了出色的跨物种 ncRNA 预测能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/742efaf9ecda/genes-10-00672-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/00b22d0b7e40/genes-10-00672-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/bf8d67fdf1f3/genes-10-00672-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/ec45792a60ac/genes-10-00672-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/742efaf9ecda/genes-10-00672-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/00b22d0b7e40/genes-10-00672-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/bf8d67fdf1f3/genes-10-00672-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/ec45792a60ac/genes-10-00672-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a455/6770532/742efaf9ecda/genes-10-00672-g004.jpg

相似文献

1
PredLnc-GFStack: A Global Sequence Feature Based on a Stacked Ensemble Learning Method for Predicting lncRNAs from Transcripts.PredLnc-GFStack:一种基于堆叠集成学习方法的全局序列特征,用于从转录本中预测 lncRNAs。
Genes (Basel). 2019 Sep 3;10(9):672. doi: 10.3390/genes10090672.
2
Predicting Long non-coding RNAs through feature ensemble learning.通过特征集成学习预测长非编码 RNA。
BMC Genomics. 2020 Dec 17;21(Suppl 13):865. doi: 10.1186/s12864-020-07237-y.
3
SFPEL-LPI: Sequence-based feature projection ensemble learning for predicting LncRNA-protein interactions.SFPEL-LPI:基于序列的特征投影集成学习预测 LncRNA-蛋白质相互作用。
PLoS Comput Biol. 2018 Dec 11;14(12):e1006616. doi: 10.1371/journal.pcbi.1006616. eCollection 2018 Dec.
4
CRlncRC: a machine learning-based method for cancer-related long noncoding RNA identification using integrated features.CRlncRC:一种基于机器学习的方法,利用整合特征识别癌症相关长链非编码RNA
BMC Med Genomics. 2018 Dec 31;11(Suppl 6):120. doi: 10.1186/s12920-018-0436-9.
5
Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs.对哺乳动物的进化分析揭示了长链非编码RNA的不同类别。
Genome Biol. 2016 Feb 2;17:19. doi: 10.1186/s13059-016-0880-9.
6
Methods to Study Long Noncoding RNA Expression and Dynamics in Zebrafish Using RNA Sequencing.利用RNA测序研究斑马鱼中长链非编码RNA表达及动态变化的方法
Methods Mol Biol. 2019;1912:77-110. doi: 10.1007/978-1-4939-8982-9_4.
7
RPITER: A Hierarchical Deep Learning Framework for ncRNA⁻Protein Interaction Prediction.RPITER:一种用于 ncRNA-蛋白质相互作用预测的分层深度学习框架。
Int J Mol Sci. 2019 Mar 1;20(5):1070. doi: 10.3390/ijms20051070.
8
Prediction of plant lncRNA by ensemble machine learning classifiers.基于集成机器学习分类器的植物 lncRNA 预测。
BMC Genomics. 2018 May 2;19(1):316. doi: 10.1186/s12864-018-4665-2.
9
MAGCNSE: predicting lncRNA-disease associations using multi-view attention graph convolutional network and stacking ensemble model.MAGCNSE:基于多视图注意力图卷积网络和堆叠集成模型预测 lncRNA-疾病关联
BMC Bioinformatics. 2022 May 19;23(1):189. doi: 10.1186/s12859-022-04715-w.
10
A systematic review of computational methods for predicting long noncoding RNAs.长非编码 RNA 预测的计算方法系统评价
Brief Funct Genomics. 2021 Jun 9;20(3):162-173. doi: 10.1093/bfgp/elab016.

引用本文的文献

1
Decoding the Non-coding: Tools and Databases Unveiling the Hidden World of "Junk" RNAs for Innovative Therapeutic Exploration.解码非编码:工具与数据库揭示“垃圾”RNA的隐秘世界以进行创新治疗探索
ACS Pharmacol Transl Sci. 2024 Jun 7;7(7):1901-1915. doi: 10.1021/acsptsci.3c00388. eCollection 2024 Jul 12.
2
RNAincoder: a deep learning-based encoder for RNA and RNA-associated interaction.RNAincoder:一种基于深度学习的 RNA 及其相关相互作用的编码器。
Nucleic Acids Res. 2023 Jul 5;51(W1):W509-W519. doi: 10.1093/nar/gkad404.
3
Computational prediction of disease related lncRNAs using machine learning.

本文引用的文献

1
Coding or Noncoding, the Converging Concepts of RNAs.编码或非编码,RNA的趋同概念
Front Genet. 2019 May 22;10:496. doi: 10.3389/fgene.2019.00496. eCollection 2019.
2
CPPred: coding potential prediction based on the global description of RNA sequence.CPPred:基于 RNA 序列全局描述的编码潜能预测。
Nucleic Acids Res. 2019 May 7;47(8):e43. doi: 10.1093/nar/gkz087.
3
SFPEL-LPI: Sequence-based feature projection ensemble learning for predicting LncRNA-protein interactions.SFPEL-LPI:基于序列的特征投影集成学习预测 LncRNA-蛋白质相互作用。
基于机器学习的疾病相关长非编码 RNA 的计算预测。
Sci Rep. 2023 Jan 16;13(1):806. doi: 10.1038/s41598-023-27680-7.
4
A large-scale benchmark study of tools for the classification of protein-coding and non-coding RNAs.大规模基准研究工具用于蛋白质编码和非编码 RNA 的分类。
Nucleic Acids Res. 2022 Nov 28;50(21):12094-12111. doi: 10.1093/nar/gkac1092.
5
Opportunities and Challenges of Predictive Approaches for the Non-coding RNA in Plants.植物非编码RNA预测方法的机遇与挑战
Front Plant Sci. 2022 Apr 14;13:890663. doi: 10.3389/fpls.2022.890663. eCollection 2022.
6
Common Features in lncRNA Annotation and Classification: A Survey.长链非编码RNA注释与分类的共同特征:一项综述。
Noncoding RNA. 2021 Dec 13;7(4):77. doi: 10.3390/ncrna7040077.
7
Advances in Computational Methodologies for Classification and Sub-Cellular Locality Prediction of Non-Coding RNAs.计算方法在非编码 RNA 分类和亚细胞定位预测中的研究进展。
Int J Mol Sci. 2021 Aug 13;22(16):8719. doi: 10.3390/ijms22168719.
8
The computational approaches of lncRNA identification based on coding potential: and challenges.基于编码潜能的lncRNA识别计算方法及挑战
Comput Struct Biotechnol J. 2020 Nov 19;18:3666-3677. doi: 10.1016/j.csbj.2020.11.030. eCollection 2020.
9
A Bioinformatics Tool for the Prediction of DNA N6-Methyladenine Modifications Based on Feature Fusion and Optimization Protocol.一种基于特征融合与优化协议的DNA N6-甲基腺嘌呤修饰预测的生物信息学工具。
Front Bioeng Biotechnol. 2020 Jun 4;8:502. doi: 10.3389/fbioe.2020.00502. eCollection 2020.
PLoS Comput Biol. 2018 Dec 11;14(12):e1006616. doi: 10.1371/journal.pcbi.1006616. eCollection 2018 Dec.
4
LncADeep: an ab initio lncRNA identification and functional annotation tool based on deep learning.LncADeep:一种基于深度学习的从头鉴定长非编码 RNA 及其功能注释工具。
Bioinformatics. 2018 Nov 15;34(22):3825-3834. doi: 10.1093/bioinformatics/bty428.
5
LncRNAnet: long non-coding RNA identification using deep learning.LncRNAnet:使用深度学习进行长非编码 RNA 鉴定。
Bioinformatics. 2018 Nov 15;34(22):3889-3897. doi: 10.1093/bioinformatics/bty418.
6
Prediction of plant lncRNA by ensemble machine learning classifiers.基于集成机器学习分类器的植物 lncRNA 预测。
BMC Genomics. 2018 May 2;19(1):316. doi: 10.1186/s12864-018-4665-2.
7
Predicting human splicing branchpoints by combining sequence-derived features and multi-label learning methods.通过结合序列衍生特征和多标签学习方法预测人类剪接分支点
BMC Bioinformatics. 2017 Dec 1;18(Suppl 13):464. doi: 10.1186/s12859-017-1875-6.
8
A Support Vector Machine based method to distinguish long non-coding RNAs from protein coding transcripts.基于支持向量机的方法区分长非编码 RNA 与蛋白质编码转录本。
BMC Genomics. 2017 Oct 18;18(1):804. doi: 10.1186/s12864-017-4178-4.
9
Technological Developments in lncRNA Biology.长链非编码RNA生物学的技术发展
Adv Exp Med Biol. 2017;1008:283-323. doi: 10.1007/978-981-10-5203-3_10.
10
CPC2: a fast and accurate coding potential calculator based on sequence intrinsic features.CPC2:一种基于序列固有特征的快速准确编码潜能计算器。
Nucleic Acids Res. 2017 Jul 3;45(W1):W12-W16. doi: 10.1093/nar/gkx428.