• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EigenTHREADER:通过高效的接触图穿线实现类似蛋白质折叠的识别。

EigenTHREADER: analogous protein fold recognition by efficient contact map threading.

机构信息

Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK.

出版信息

Bioinformatics. 2017 Sep 1;33(17):2684-2690. doi: 10.1093/bioinformatics/btx217.

DOI:10.1093/bioinformatics/btx217
PMID:28419258
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5860056/
Abstract

MOTIVATION

Protein fold recognition when appropriate, evolutionarily-related, structural templates can be identified is often trivial and may even be viewed as a solved problem. However in cases where no homologous structural templates can be detected, fold recognition is a notoriously difficult problem ( Moult et al., 2014 ). Here we present EigenTHREADER, a novel fold recognition method capable of identifying folds where no homologous structures can be identified. EigenTHREADER takes a query amino acid sequence, generates a map of intra-residue contacts, and then searches a library of contact maps of known structures. To allow the contact maps to be compared, we use eigenvector decomposition to resolve the principal eigenvectors these can then be aligned using standard dynamic programming algorithms. The approach is similar to the Al-Eigen approach of Di Lena et al. (2010) , but with improvements made both to speed and accuracy. With this search strategy, EigenTHREADER does not depend directly on sequence homology between the target protein and entries in the fold library to generate models. This in turn enables EigenTHREADER to correctly identify analogous folds where little or no sequence homology information is.

RESULTS

EigenTHREADER outperforms well-established fold recognition methods such as pGenTHREADER and HHSearch in terms of True Positive Rate in the difficult task of analogous fold recognition. This should allow template-based modelling to be extended to many new protein families that were previously intractable to homology based fold recognition methods.

AVAILABILITY AND IMPLEMENTATION

All code used to generate these results and the computational protocol can be downloaded from https://github.com/DanBuchan/eigen_scripts . EigenTHREADER, the benchmark code and the data this paper is based on can be downloaded from: http://bioinfadmin.cs.ucl.ac.uk/downloads/eigenTHREADER/ .

CONTACT

d.t.jones@ucl.ac.uk.

摘要

动机

当可以识别出适当的、进化相关的结构模板时,蛋白质折叠识别通常很简单,甚至可以被视为一个已解决的问题。然而,在无法检测到同源结构模板的情况下,折叠识别是一个众所周知的难题(Moult 等人,2014 年)。在这里,我们提出了 EigenTHREADER,这是一种新的折叠识别方法,能够识别无法识别同源结构的折叠。EigenTHREADER 采用查询氨基酸序列,生成残基内接触图,然后搜索已知结构的接触图库。为了允许比较接触图,我们使用特征向量分解来解析主要特征向量,然后使用标准动态规划算法对齐这些特征向量。这种方法类似于 Di Lena 等人的 Al-Eigen 方法(2010 年),但在速度和准确性方面都有所改进。使用这种搜索策略,EigenTHREADER 不会直接依赖目标蛋白和折叠库中条目的序列同源性来生成模型。这反过来又使 EigenTHREADER 能够正确识别几乎没有或没有序列同源性信息的类似折叠。

结果

在类似折叠识别这一困难任务中,EigenTHREADER 在真阳性率方面优于 pGenTHREADER 和 HHSearch 等成熟的折叠识别方法。这应该允许基于模板的建模扩展到许多以前难以使用基于同源折叠识别方法的新蛋白质家族。

可用性和实现

生成这些结果的所有代码和计算协议都可以从 https://github.com/DanBuchan/eigen_scripts 下载。EigenTHREADER、基准代码和本文所依据的数据可以从 http://bioinfadmin.cs.ucl.ac.uk/downloads/eigenTHREADER/ 下载。

联系

d.t.jones@ucl.ac.uk。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/3016c17d6b65/btx217f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/440c7588b36d/btx217f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/3f3278488f47/btx217f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/9022fa66c877/btx217f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/463f2feb30d6/btx217f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/195f250e38a1/btx217f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/3016c17d6b65/btx217f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/440c7588b36d/btx217f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/3f3278488f47/btx217f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/9022fa66c877/btx217f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/463f2feb30d6/btx217f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/195f250e38a1/btx217f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b62b/5860056/3016c17d6b65/btx217f6.jpg

相似文献

1
EigenTHREADER: analogous protein fold recognition by efficient contact map threading.EigenTHREADER:通过高效的接触图穿线实现类似蛋白质折叠的识别。
Bioinformatics. 2017 Sep 1;33(17):2684-2690. doi: 10.1093/bioinformatics/btx217.
2
Does inclusion of residue-residue contact information boost protein threading?残基-残基接触信息的包含是否能提高蛋白质结构预测的准确性?
Proteins. 2019 Jul;87(7):596-606. doi: 10.1002/prot.25684. Epub 2019 Mar 28.
3
Detecting distant-homology protein structures by aligning deep neural-network based contact maps.通过对齐基于深度神经网络的接触图来检测远程同源蛋白结构。
PLoS Comput Biol. 2019 Oct 17;15(10):e1007411. doi: 10.1371/journal.pcbi.1007411. eCollection 2019 Oct.
4
DeepSF: deep convolutional neural network for mapping protein sequences to folds.DeepSF:一种将蛋白质序列映射到折叠结构的深度卷积神经网络。
Bioinformatics. 2018 Apr 15;34(8):1295-1303. doi: 10.1093/bioinformatics/btx780.
5
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV:结合协同进化方法用于精确预测蛋白质中的接触和长程氢键
Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.
6
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.
7
Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts.通过从预测的残基-残基接触中提取折叠特异性特征来提高蛋白质折叠识别。
Bioinformatics. 2017 Dec 1;33(23):3749-3757. doi: 10.1093/bioinformatics/btx514.
8
Protein threading using residue co-variation and deep learning.使用残基共变和深度学习进行蛋白质穿线。
Bioinformatics. 2018 Jul 1;34(13):i263-i273. doi: 10.1093/bioinformatics/bty278.
9
Fold recognition by scoring protein maps using the congruence coefficient.使用一致性系数对蛋白质图谱进行折叠识别。
Bioinformatics. 2021 May 1;37(4):506-513. doi: 10.1093/bioinformatics/btaa833.
10
Effective inter-residue contact definitions for accurate protein fold recognition.用于准确蛋白质折叠识别的有效残基间接触定义。
BMC Bioinformatics. 2012 Nov 9;13:292. doi: 10.1186/1471-2105-13-292.

引用本文的文献

1
Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.基于深度学习的单域和多域蛋白质结构预测与D-I-TASSER
Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02654-4.
2
PCP-GC-LM: single-sequence-based protein contact prediction using dual graph convolutional neural network and convolutional neural network.PCP-GC-LM:基于双图卷积神经网络和卷积神经网络的单序列蛋白质接触预测。
BMC Bioinformatics. 2024 Sep 2;25(1):287. doi: 10.1186/s12859-024-05914-3.
3
An interactive visualization tool for educational outreach in protein contact map overlap analysis.

本文引用的文献

1
Assessment of CASP11 contact-assisted predictions.半胱天冬酶11接触辅助预测的评估
Proteins. 2016 Sep;84 Suppl 1(Suppl 1):164-80. doi: 10.1002/prot.25020. Epub 2016 Mar 9.
2
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.MetaPSICOV:结合协同进化方法用于精确预测蛋白质中的接触和长程氢键
Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26.
3
UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.
一种用于蛋白质接触图重叠分析教育推广的交互式可视化工具。
Front Bioinform. 2024 Mar 15;4:1358550. doi: 10.3389/fbinf.2024.1358550. eCollection 2024.
4
PLMSearch: Protein language model powers accurate and fast sequence search for remote homology.PLMSearch:蛋白质语言模型为远程同源性的准确快速序列搜索提供动力。
Nat Commun. 2024 Mar 30;15(1):2775. doi: 10.1038/s41467-024-46808-5.
5
Recent Progress of Protein Tertiary Structure Prediction.蛋白质三级结构预测的最新进展。
Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832.
6
Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15.在 CASP15 中,通过深度学习、线程比对和多 MSAs 策略,实现高质量蛋白质单体和复合物结构预测。
Proteins. 2023 Dec;91(12):1684-1703. doi: 10.1002/prot.26585. Epub 2023 Aug 31.
7
Contact-Assisted Threading in Low-Homology Protein Modeling.接触辅助线程在低同源性蛋白质建模中的应用。
Methods Mol Biol. 2023;2627:41-59. doi: 10.1007/978-1-0716-2974-1_3.
8
Protein structure and folding pathway prediction based on remote homologs recognition using PAthreader.基于远程同源物识别的 PAthreader 进行蛋白质结构和折叠途径预测。
Commun Biol. 2023 Mar 4;6(1):243. doi: 10.1038/s42003-023-04605-8.
9
ProALIGN: Directly Learning Alignments for Protein Structure Prediction via Exploiting Context-Specific Alignment Motifs.ProALIGN:通过利用上下文特定的对齐基序直接学习蛋白质结构预测的对齐。
J Comput Biol. 2022 Feb;29(2):92-105. doi: 10.1089/cmb.2021.0430. Epub 2022 Jan 21.
10
FoldHSphere: deep hyperspherical embeddings for protein fold recognition.FoldHSphere:用于蛋白质折叠识别的深度超球嵌入。
BMC Bioinformatics. 2021 Oct 12;22(1):490. doi: 10.1186/s12859-021-04419-7.
UniRef聚类:一种用于改进序列相似性搜索的全面且可扩展的替代方法。
Bioinformatics. 2015 Mar 15;31(6):926-32. doi: 10.1093/bioinformatics/btu739. Epub 2014 Nov 13.
4
CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations.CCMpred--快速准确地预测蛋白质残基-残基接触的相关突变。
Bioinformatics. 2014 Nov 1;30(21):3128-30. doi: 10.1093/bioinformatics/btu500. Epub 2014 Jul 26.
5
MRFalign: protein homology detection through alignment of Markov random fields.MRFalign:通过马尔可夫随机场比对进行蛋白质同源性检测。
PLoS Comput Biol. 2014 Mar 27;10(3):e1003500. doi: 10.1371/journal.pcbi.1003500. eCollection 2014 Mar.
6
FreeContact: fast and free software for protein contact prediction from residue co-evolution.FreeContact:用于基于残基共进化预测蛋白质接触的快速免费软件。
BMC Bioinformatics. 2014 Mar 26;15:85. doi: 10.1186/1471-2105-15-85.
7
De novo structure prediction of globular proteins aided by sequence variation-derived contacts.基于序列变异衍生接触辅助的球状蛋白质从头结构预测。
PLoS One. 2014 Mar 17;9(3):e92197. doi: 10.1371/journal.pone.0092197. eCollection 2014.
8
BioShell-Threading: versatile Monte Carlo package for protein 3D threading.BioShell-Threading:用于蛋白质 3D 结构预测的多功能蒙特卡罗包。
BMC Bioinformatics. 2014 Jan 20;15:22. doi: 10.1186/1471-2105-15-22.
9
Critical assessment of methods of protein structure prediction (CASP)--round x.蛋白质结构预测方法的关键评估(CASP)——第x轮
Proteins. 2014 Feb;82 Suppl 2(0 2):1-6. doi: 10.1002/prot.24452. Epub 2013 Dec 17.
10
Protein threading using context-specific alignment potential.使用上下文特定对齐势的蛋白质穿线。
Bioinformatics. 2013 Jul 1;29(13):i257-65. doi: 10.1093/bioinformatics/btt210.