Suppr超能文献

深度学习在生物序列数据上的应用:实例与解决方案。

An introduction to deep learning on biological sequence data: examples and solutions.

机构信息

Department of Bio and Health Informatics.

Department of Applied Mathematics and Computer Science, Technical University of Denmark, Lyngby, Denmark.

出版信息

Bioinformatics. 2017 Nov 15;33(22):3685-3690. doi: 10.1093/bioinformatics/btx531.

Abstract

MOTIVATION

Deep neural network architectures such as convolutional and long short-term memory networks have become increasingly popular as machine learning tools during the recent years. The availability of greater computational resources, more data, new algorithms for training deep models and easy to use libraries for implementation and training of neural networks are the drivers of this development. The use of deep learning has been especially successful in image recognition; and the development of tools, applications and code examples are in most cases centered within this field rather than within biology.

RESULTS

Here, we aim to further the development of deep learning methods within biology by providing application examples and ready to apply and adapt code templates. Given such examples, we illustrate how architectures consisting of convolutional and long short-term memory neural networks can relatively easily be designed and trained to state-of-the-art performance on three biological sequence problems: prediction of subcellular localization, protein secondary structure and the binding of peptides to MHC Class II molecules.

AVAILABILITY AND IMPLEMENTATION

All implementations and datasets are available online to the scientific community at https://github.com/vanessajurtz/lasagne4bio.

CONTACT

skaaesonderby@gmail.com.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

近年来,卷积神经网络和长短期记忆网络等深度神经网络架构作为机器学习工具变得越来越流行。更大的计算资源、更多的数据、用于训练深度模型的新算法以及易于使用的神经网络实现和训练库是推动这一发展的因素。深度学习在图像识别中特别成功;并且工具、应用程序和代码示例的开发通常集中在这个领域内,而不是生物学领域内。

结果

在这里,我们旨在通过提供应用示例和可立即应用和改编的代码模板,在生物学中进一步开发深度学习方法。有了这样的例子,我们说明了如何相对容易地设计和训练由卷积和长短期记忆神经网络组成的架构,以达到三个生物学序列问题的最新性能:亚细胞定位、蛋白质二级结构和肽与 MHC 类 II 分子结合的预测。

可用性和实现

所有实现和数据集都可在网上向科学界提供,网址为 https://github.com/vanessajurtz/lasagne4bio。

联系人

skaaesonderby@gmail.com

补充信息

补充数据可在《生物信息学》在线获得。

相似文献

5
HLA class I binding prediction via convolutional neural networks.基于卷积神经网络的 HLA 类 I 结合预测。
Bioinformatics. 2017 Sep 1;33(17):2658-2665. doi: 10.1093/bioinformatics/btx264.
7
Deep learning improves antimicrobial peptide recognition.深度学习提高抗菌肽识别能力。
Bioinformatics. 2018 Aug 15;34(16):2740-2747. doi: 10.1093/bioinformatics/bty179.
9
Deep Learning and Its Applications in Biomedicine.深度学习及其在生物医学中的应用。
Genomics Proteomics Bioinformatics. 2018 Feb;16(1):17-32. doi: 10.1016/j.gpb.2017.07.003. Epub 2018 Mar 6.

引用本文的文献

1
Machine and deep learning to predict viral fusion peptides.用于预测病毒融合肽的机器学习与深度学习
Comput Struct Biotechnol J. 2025 Feb 18;27:692-704. doi: 10.1016/j.csbj.2025.02.011. eCollection 2025.
2
Reliable estimation of tree branch lengths using deep neural networks.利用深度神经网络可靠估计树枝长度。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012337. doi: 10.1371/journal.pcbi.1012337. eCollection 2024 Aug.
4
Protein sequence analysis in the context of drug repurposing.药物再利用背景下的蛋白质序列分析。
BMC Med Inform Decis Mak. 2024 May 13;24(1):122. doi: 10.1186/s12911-024-02531-1.
7
Artificial intelligence for dementia genetics and omics.人工智能在痴呆症遗传学和组学中的应用。
Alzheimers Dement. 2023 Dec;19(12):5905-5921. doi: 10.1002/alz.13427. Epub 2023 Aug 22.
8
Immunoinformatics: Predicting Peptide-MHC Binding.免疫信息学:预测肽-MHC结合
Annu Rev Biomed Data Sci. 2020 Jul;3:191-215. doi: 10.1146/annurev-biodatasci-021920-100259. Epub 2020 Apr 27.
10
Nucleotide augmentation for machine learning-guided protein engineering.用于机器学习引导蛋白质工程的核苷酸增强
Bioinform Adv. 2022 Dec 9;3(1):vbac094. doi: 10.1093/bioadv/vbac094. eCollection 2023.

本文引用的文献

1
Automatic Segmentation of MR Brain Images With a Convolutional Neural Network.基于卷积神经网络的磁共振脑图像自动分割。
IEEE Trans Med Imaging. 2016 May;35(5):1252-1261. doi: 10.1109/TMI.2016.2548501. Epub 2016 Mar 30.
5
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
7
Deep learning in neural networks: an overview.神经网络中的深度学习:综述。
Neural Netw. 2015 Jan;61:85-117. doi: 10.1016/j.neunet.2014.09.003. Epub 2014 Oct 13.
8
Deep learning of the tissue-regulated splicing code.深度学习组织调控的剪接代码。
Bioinformatics. 2014 Jun 15;30(12):i121-9. doi: 10.1093/bioinformatics/btu277.
10
The protein-folding problem, 50 years on.蛋白质折叠问题:50 年的探索
Science. 2012 Nov 23;338(6110):1042-6. doi: 10.1126/science.1219021.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验