Suppr超能文献

使用机器学习进行序列比对以实现基于模板的准确蛋白质结构预测。

Sequence Alignment Using Machine Learning for Accurate Template-based Protein Structure Prediction.

作者信息

Makigaki Shuichiro, Ishida Takashi

机构信息

School of Computing, Tokyo Institute of Technology, Tokyo, Japan.

出版信息

Bio Protoc. 2020 May 5;10(9):e3600. doi: 10.21769/BioProtoc.3600.

Abstract

Template-based modeling, the process of predicting the tertiary structure of a protein by using homologous protein structures, is useful when good templates can be available. Indeed, modern homology detection methods can find remote homologs with high sensitivity. However, the accuracy of template-based models generated from the homology-detection-based alignments is often lower than that from ideal alignments. In this study, we propose a new method that generates pairwise sequence alignments for more accurate template-based modeling. Our method trains a machine learning model using the structural alignment of known homologs. When calculating sequence alignments, instead of a fixed substitution matrix, this method dynamically predicts a substitution score from the trained model.

摘要

基于模板的建模,即通过使用同源蛋白质结构预测蛋白质三级结构的过程,在有良好模板可用时非常有用。实际上,现代同源性检测方法能够以高灵敏度找到远缘同源物。然而,基于同源性检测比对生成的基于模板的模型的准确性通常低于理想比对生成的模型。在本研究中,我们提出了一种新方法,该方法生成成对序列比对以进行更准确的基于模板的建模。我们的方法使用已知同源物的结构比对来训练机器学习模型。在计算序列比对时,该方法不是使用固定的替换矩阵,而是从训练模型动态预测替换分数。

相似文献

3
Sequence alignment generation using intermediate sequence search for homology modeling.使用中间序列搜索进行同源建模的序列比对生成。
Comput Struct Biotechnol J. 2020 Jul 25;18:2043-2050. doi: 10.1016/j.csbj.2020.07.012. eCollection 2020.
4
Using structure to explore the sequence alignment space of remote homologs.利用结构探索远程同源物序列比对空间。
PLoS Comput Biol. 2011 Oct;7(10):e1002175. doi: 10.1371/journal.pcbi.1002175. Epub 2011 Oct 6.

本文引用的文献

6
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.
9
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验