Suppr超能文献

基于二重深度学习模型的残基间距离预测

Inter-Residue Distance Prediction From Duet Deep Learning Models.

作者信息

Zhang Huiling, Huang Ying, Bei Zhendong, Ju Zhen, Meng Jintao, Hao Min, Zhang Jingjing, Zhang Haiping, Xi Wenhui

机构信息

Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.

University of Chinese Academy of Sciences, Beijing, China.

出版信息

Front Genet. 2022 May 16;13:887491. doi: 10.3389/fgene.2022.887491. eCollection 2022.

Abstract

Residue distance prediction from the sequence is critical for many biological applications such as protein structure reconstruction, protein-protein interaction prediction, and protein design. However, prediction of fine-grained distances between residues with long sequence separations still remains challenging. In this study, we propose DuetDis, a method based on duet feature sets and deep residual network with squeeze-and-excitation (SE), for protein inter-residue distance prediction. DuetDis embraces the ability to learn and fuse features directly or indirectly extracted from the whole-genome/metagenomic databases and, therefore, minimize the information loss through ensembling models trained on different feature sets. We evaluate DuetDis and 11 widely used peer methods on a large-scale test set (610 proteins chains). The experimental results suggest that 1) prediction results from different feature sets show obvious differences; 2) ensembling different feature sets can improve the prediction performance; 3) high-quality multiple sequence alignment (MSA) used for both training and testing can greatly improve the prediction performance; and 4) DuetDis is more accurate than peer methods for the overall prediction, more reliable in terms of model prediction score, and more robust against shallow multiple sequence alignment (MSA).

摘要

从序列预测残基距离对于许多生物学应用至关重要,如蛋白质结构重建、蛋白质-蛋白质相互作用预测和蛋白质设计。然而,预测具有长序列间隔的残基之间的细粒度距离仍然具有挑战性。在本研究中,我们提出了DuetDis,一种基于对偶特征集和带有挤压激励(SE)的深度残差网络的方法,用于蛋白质残基间距离预测。DuetDis具备直接或间接从全基因组/宏基因组数据库中学习和融合特征的能力,因此,通过在不同特征集上训练的集成模型将信息损失降至最低。我们在一个大规模测试集(610个蛋白质链)上评估了DuetDis和11种广泛使用的同类方法。实验结果表明:1)不同特征集的预测结果存在明显差异;2)集成不同特征集可以提高预测性能;3)用于训练和测试的高质量多序列比对(MSA)可以极大地提高预测性能;4)在整体预测方面,DuetDis比同类方法更准确,在模型预测得分方面更可靠,并且对浅层多序列比对(MSA)更具鲁棒性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5887/9148999/8bbce41079f7/fgene-13-887491-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验