Suppr超能文献

用于蛋白质结构和功能基因组规模预测的高性能深度学习工具箱。

High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function.

作者信息

Gao Mu, Lund-Andersen Peik, Morehead Alex, Mahmud Sajid, Chen Chen, Chen Xiao, Giri Nabin, Roy Raj S, Quadir Farhan, Effler T Chad, Prout Ryan, Abraham Subil, Elwasif Wael, Haas N Quentin, Skolnick Jeffrey, Cheng Jianlin, Sedova Ada

机构信息

Georgia Institute of Technology, Atlanta, GA.

University of Idaho, Moscow, ID.

出版信息

Workshop Mach Learn HPC Environ. 2021 Nov;2021:46-57. doi: 10.1109/mlhpc54614.2021.00010. Epub 2021 Dec 27.

Abstract

Computational biology is one of many scientific disciplines ripe for innovation and acceleration with the advent of high-performance computing (HPC). In recent years, the field of machine learning has also seen significant benefits from adopting HPC practices. In this work, we present a novel HPC pipeline that incorporates various machine-learning approaches for structure-based functional annotation of proteins on the scale of whole genomes. Our pipeline makes extensive use of deep learning and provides computational insights into best practices for training advanced deep-learning models for high-throughput data such as proteomics data. We showcase methodologies our pipeline currently supports and detail future tasks for our pipeline to envelop, including large-scale sequence comparison using SAdLSA and prediction of protein tertiary structures using AlphaFold2.

摘要

随着高性能计算(HPC)的出现,计算生物学是众多亟待创新和加速发展的科学学科之一。近年来,机器学习领域也从采用HPC实践中受益匪浅。在这项工作中,我们提出了一种新颖的HPC流程,该流程整合了各种机器学习方法,用于在全基因组规模上对蛋白质进行基于结构的功能注释。我们的流程广泛使用深度学习,并为训练针对蛋白质组学数据等高通量数据的先进深度学习模型的最佳实践提供计算见解。我们展示了我们的流程目前支持的方法,并详细说明了我们的流程未来需要涵盖的任务,包括使用SAdLSA进行大规模序列比较以及使用AlphaFold2预测蛋白质三级结构。

相似文献

9

引用本文的文献

本文引用的文献

2
Supercomputing Pipelines Search for Therapeutics Against COVID-19.超级计算管道寻找抗 COVID-19 的疗法。
Comput Sci Eng. 2020 Nov 6;23(1):7-16. doi: 10.1109/MCSE.2020.3036540. eCollection 2021 Jan.
7
Highly accurate protein structure prediction for the human proteome.高精准度的人类蛋白质组蛋白结构预测。
Nature. 2021 Aug;596(7873):590-596. doi: 10.1038/s41586-021-03828-1. Epub 2021 Jul 22.
8
Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。
Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验