• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用中间序列搜索进行同源建模的序列比对生成。

Sequence alignment generation using intermediate sequence search for homology modeling.

作者信息

Makigaki Shuichiro, Ishida Takashi

机构信息

Department of Computer Science, School of Computing, Tokyo Institute of Technology Ookayama, Meguro-ku, Tokyo 152-8550, Japan.

出版信息

Comput Struct Biotechnol J. 2020 Jul 25;18:2043-2050. doi: 10.1016/j.csbj.2020.07.012. eCollection 2020.

DOI:10.1016/j.csbj.2020.07.012
PMID:32802276
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7415839/
Abstract

Protein tertiary structure is important information in various areas of biological research, however, the experimental cost associated with structure determination is high, and computational prediction methods have been developed to facilitate a more economical approach. Currently, template-based modeling methods are considered to be the most practical because the resulting predicted structures are often accurate, provided an appropriate template protein is available. During the first stage of template-based modeling, sensitive homology detection is essential for accurate structure prediction. However, sufficient structural models cannot always be obtained due to a lack of quality in the sequence alignment generated by a homology detection program. Therefore, an automated method that detects remote homologs accurately and generates appropriate alignments for accurate structure prediction is needed. In this paper, we propose an algorithm for suitable alignment generation using an intermediate sequence search for use with template-based modeling. We used intermediate sequence search for remote homology detection and intermediate sequences for alignment generation of remote homologs. We then evaluated the proposed method by comparing the sensitivity and selectivity of homology detection. Furthermore, based on the accuracy of the predicted structure model, we verify the accuracy of the alignments generated by our method. We demonstrate that our method generates more appropriate alignments for template-based modeling, especially for remote homologs. All source codes are available at https://github.com/shuichiro-makigaki/agora.

摘要

蛋白质三级结构在生物学研究的各个领域都是重要信息,然而,与结构测定相关的实验成本很高,因此已经开发了计算预测方法以促进采用更经济的方法。目前,基于模板的建模方法被认为是最实用的,因为只要有合适的模板蛋白,所得到的预测结构通常是准确的。在基于模板的建模的第一阶段,灵敏的同源性检测对于准确的结构预测至关重要。然而,由于同源性检测程序生成的序列比对质量欠佳,往往无法获得足够的结构模型。因此,需要一种能够准确检测远源同源物并生成合适比对以进行准确结构预测的自动化方法。在本文中,我们提出了一种算法,用于使用中间序列搜索来生成合适的比对,以用于基于模板的建模。我们使用中间序列搜索进行远源同源性检测,并使用中间序列来生成远源同源物的比对。然后,我们通过比较同源性检测的灵敏度和选择性来评估所提出的方法。此外,基于预测结构模型的准确性,我们验证了我们的方法生成的比对的准确性。我们证明,我们的方法为基于模板的建模生成了更合适的比对,特别是对于远源同源物。所有源代码可在https://github.com/shuichiro-makigaki/agora获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/3413f23ba9c2/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/d82ea4e2f28a/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/488aba7b6e64/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/6be8a0caffea/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/1e89f6cd7dbf/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/8303687fc73d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/57aba0b282bb/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/dfaf88dfaef0/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/59b6d7d437ad/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/6e3562940bcf/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/3413f23ba9c2/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/d82ea4e2f28a/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/488aba7b6e64/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/6be8a0caffea/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/1e89f6cd7dbf/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/8303687fc73d/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/57aba0b282bb/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/dfaf88dfaef0/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/59b6d7d437ad/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/6e3562940bcf/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6dc/7415839/3413f23ba9c2/gr9.jpg

相似文献

1
Sequence alignment generation using intermediate sequence search for homology modeling.使用中间序列搜索进行同源建模的序列比对生成。
Comput Struct Biotechnol J. 2020 Jul 25;18:2043-2050. doi: 10.1016/j.csbj.2020.07.012. eCollection 2020.
2
Sequence alignment using machine learning for accurate template-based protein structure prediction.基于机器学习的序列比对在准确的基于模板的蛋白质结构预测中的应用。
Bioinformatics. 2020 Jan 1;36(1):104-111. doi: 10.1093/bioinformatics/btz483.
3
Sequence Alignment Using Machine Learning for Accurate Template-based Protein Structure Prediction.使用机器学习进行序列比对以实现基于模板的准确蛋白质结构预测。
Bio Protoc. 2020 May 5;10(9):e3600. doi: 10.21769/BioProtoc.3600.
4
MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.基于深度学习和距离预测的 MULTICOM2 开源蛋白质结构预测系统。
Sci Rep. 2021 Jun 23;11(1):13155. doi: 10.1038/s41598-021-92395-6.
5
Using structure to explore the sequence alignment space of remote homologs.利用结构探索远程同源物序列比对空间。
PLoS Comput Biol. 2011 Oct;7(10):e1002175. doi: 10.1371/journal.pcbi.1002175. Epub 2011 Oct 6.
6
A low-complexity add-on score for protein remote homology search with COMER.COMER 辅助的蛋白质远程同源搜索的低复杂度附加评分。
Bioinformatics. 2018 Jun 15;34(12):2037-2045. doi: 10.1093/bioinformatics/bty048.
7
Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles.利用修饰序列谱比对进行蛋白质-蛋白质复合物三维结构的基于同源性的建模。
Int J Biol Macromol. 2008 Aug 15;43(2):198-208. doi: 10.1016/j.ijbiomac.2008.05.004. Epub 2008 May 21.
8
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
9
On the role of structural information in remote homology detection and sequence alignment: new methods using hybrid sequence profiles.论结构信息在远程同源性检测和序列比对中的作用:使用混合序列谱的新方法
J Mol Biol. 2003 Dec 12;334(5):1043-62. doi: 10.1016/j.jmb.2003.10.025.
10
Application of protein structure alignments to iterated hidden Markov model protocols for structure prediction.蛋白质结构比对在用于结构预测的迭代隐马尔可夫模型协议中的应用。
BMC Bioinformatics. 2006 Sep 14;7:410. doi: 10.1186/1471-2105-7-410.

本文引用的文献

1
HITS-PR-HHblits: protein remote homology detection by combining PageRank and Hyperlink-Induced Topic Search.HITS-PR-HHblits:结合PageRank和超链接诱导主题搜索进行蛋白质远程同源性检测。
Brief Bioinform. 2020 Jan 17;21(1):298-308. doi: 10.1093/bib/bby104.
2
Protein Data Bank: the single global archive for 3D macromolecular structure data.蛋白质数据库:用于存储大分子三维结构数据的全球单一档案库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D520-D528. doi: 10.1093/nar/gky949.
3
A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core.
一个完全重新实现的 MPI 生物信息学工具包,其核心是一个新的 HHpred 服务器。
J Mol Biol. 2018 Jul 20;430(15):2237-2243. doi: 10.1016/j.jmb.2017.12.007. Epub 2017 Dec 16.
4
Database resources of the National Center for Biotechnology Information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2018 Jan 4;46(D1):D8-D13. doi: 10.1093/nar/gkx1095.
5
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.
6
Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling.基于概率多模板同源建模的蛋白质三维结构自动预测
PLoS Comput Biol. 2015 Oct 23;11(10):e1004343. doi: 10.1371/journal.pcbi.1004343. eCollection 2015 Oct.
7
SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures.SCOPe:蛋白质结构分类——扩展版,整合了 SCOP 和 ASTRAL 数据以及新结构的分类。
Nucleic Acids Res. 2014 Jan;42(Database issue):D304-9. doi: 10.1093/nar/gkt1240. Epub 2013 Dec 3.
8
Domain enhanced lookup time accelerated BLAST.基于域名的快速检索 BLAST。
Biol Direct. 2012 Apr 17;7:12. doi: 10.1186/1745-6150-7-12.
9
Fast and accurate automatic structure prediction with HHpred.HHpred:快速准确的自动结构预测。
Proteins. 2009;77 Suppl 9:128-32. doi: 10.1002/prot.22499.
10
Homology modeling in drug discovery: current trends and applications.药物发现中的同源建模:当前趋势与应用。
Drug Discov Today. 2009 Jul;14(13-14):676-83. doi: 10.1016/j.drudis.2009.04.006. Epub 2009 May 5.