• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
FUpred: detecting protein domains through deep-learning-based contact map prediction.FUpred:基于深度学习的接触图预测的蛋白质结构域检测。
Bioinformatics. 2020 Jun 1;36(12):3749-3757. doi: 10.1093/bioinformatics/btaa217.
2
ThreaDom: extracting protein domain boundary information from multiple threading alignments.ThreaDom:从多重序列比对中提取蛋白质结构域边界信息。
Bioinformatics. 2013 Jul 1;29(13):i247-56. doi: 10.1093/bioinformatics/btt209.
3
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.ResPRE:通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。
Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.
4
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
5
Deep-learning contact-map guided protein structure prediction in CASP13.深度学习接触图指导的 CASP13 蛋白质结构预测。
Proteins. 2019 Dec;87(12):1149-1164. doi: 10.1002/prot.25792. Epub 2019 Aug 14.
6
DomBpred: Protein Domain Boundary Prediction Based on Domain-Residue Clustering Using Inter-Residue Distance.DomBpred:基于使用残基间距离的结构域-残基聚类的蛋白质结构域边界预测
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):912-922. doi: 10.1109/TCBB.2022.3175905. Epub 2023 Apr 3.
7
ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly.ThreaDomEx:一个通过多线程和片段组装预测连续和不连续蛋白质结构域的统一平台。
Nucleic Acids Res. 2017 Jul 3;45(W1):W400-W407. doi: 10.1093/nar/gkx410.
8
DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network.DNN-Dom:通过深度神经网络仅从序列预测蛋白质结构域边界。
Bioinformatics. 2019 Dec 15;35(24):5128-5136. doi: 10.1093/bioinformatics/btz464.
9
Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14.使用深度学习距离和氢键约束进行 CASP14 中的蛋白质结构预测。
Proteins. 2021 Dec;89(12):1734-1751. doi: 10.1002/prot.26193. Epub 2021 Aug 7.
10
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测
PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

引用本文的文献

1
Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER.基于深度学习的单域和多域蛋白质结构预测与D-I-TASSER
Nat Biotechnol. 2025 May 23. doi: 10.1038/s41587-025-02654-4.
2
Treponema denticola major surface protein (Msp): a key player in periodontal pathogenicity and immune evasion.齿垢密螺旋体主要表面蛋白(Msp):牙周致病性和免疫逃逸中的关键因子。
Arch Microbiol. 2025 Jan 18;207(2):36. doi: 10.1007/s00203-024-04223-w.
3
Machine Learning Techniques to Infer Protein Structure and Function from Sequences: A Comprehensive Review.基于序列推断蛋白质结构和功能的机器学习技术:全面综述。
Methods Mol Biol. 2025;2867:79-104. doi: 10.1007/978-1-0716-4196-5_5.
4
Hierarchical Analysis of Protein Structures: From Secondary Structures to Protein Units and Domains.蛋白质结构的层次分析:从二级结构到蛋白质单元和结构域。
Methods Mol Biol. 2025;2870:357-370. doi: 10.1007/978-1-0716-4213-9_18.
5
Protein domain embeddings for fast and accurate similarity search.蛋白质结构域嵌入用于快速准确的相似性搜索。
Genome Res. 2024 Oct 11;34(9):1434-1444. doi: 10.1101/gr.279127.124.
6
Chainsaw: protein domain segmentation with fully convolutional neural networks.链锯:基于全卷积神经网络的蛋白质结构域分割。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae296.
7
DEMO-EM2: assembling protein complex structures from cryo-EM maps through intertwined chain and domain fitting.DEMO-EM2:通过链和结构域的交错拟合从冷冻电镜密度图组装蛋白质复合物结构。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae113.
8
Importance of Inter-residue Contacts for Understanding Protein Folding and Unfolding Rates, Remote Homology, and Drug Design.残基间接触对于理解蛋白质折叠与解折叠速率、远程同源性及药物设计的重要性。
Mol Biotechnol. 2025 Mar;67(3):862-884. doi: 10.1007/s12033-024-01119-4. Epub 2024 Mar 18.
9
Recent Progress of Protein Tertiary Structure Prediction.蛋白质三级结构预测的最新进展。
Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832.
10
Merizo: a rapid and accurate protein domain segmentation method using invariant point attention.Merizo:一种使用不变点注意力的快速准确的蛋白质结构域分割方法。
Nat Commun. 2023 Dec 19;14(1):8445. doi: 10.1038/s41467-023-43934-4.

本文引用的文献

1
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.DeepMSA:构建深度多重序列比对以改进远距离同源蛋白质的接触预测和折叠识别。
Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863.
2
Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13.基于深度残差神经网络的原始共进化特征集成方法在 CASP13 中用于接触图预测。
Proteins. 2019 Dec;87(12):1082-1091. doi: 10.1002/prot.25798. Epub 2019 Aug 22.
3
DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network.DNN-Dom:通过深度神经网络仅从序列预测蛋白质结构域边界。
Bioinformatics. 2019 Dec 15;35(24):5128-5136. doi: 10.1093/bioinformatics/btz464.
4
ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.ResPRE:通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。
Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.
5
SCOPe: classification of large macromolecular structures in the structural classification of proteins-extended database.SCOPe:蛋白质结构分类扩展数据库中大分子结构的分类。
Nucleic Acids Res. 2019 Jan 8;47(D1):D475-D481. doi: 10.1093/nar/gky1134.
6
ConDo: protein domain boundary prediction using coevolutionary information.ConDo:利用共进化信息进行蛋白质结构域边界预测。
Bioinformatics. 2019 Jul 15;35(14):2411-2417. doi: 10.1093/bioinformatics/bty973.
7
ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly.ThreaDomEx:一个通过多线程和片段组装预测连续和不连续蛋白质结构域的统一平台。
Nucleic Acids Res. 2017 Jul 3;45(W1):W400-W407. doi: 10.1093/nar/gkx410.
8
An ambiguity principle for assigning protein structural domains.一种用于分配蛋白质结构域的不明确性原理。
Sci Adv. 2017 Jan 13;3(1):e1600552. doi: 10.1126/sciadv.1600552. eCollection 2017 Jan.
9
SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins - extended Database.SCOPe:蛋白质结构分类中的人工整理与伪迹去除——扩展数据库
J Mol Biol. 2017 Feb 3;429(3):348-355. doi: 10.1016/j.jmb.2016.11.023. Epub 2016 Nov 30.
10
SCOPe: Structural Classification of Proteins--extended, integrating SCOP and ASTRAL data and classification of new structures.SCOPe:蛋白质结构分类——扩展版,整合了 SCOP 和 ASTRAL 数据以及新结构的分类。
Nucleic Acids Res. 2014 Jan;42(Database issue):D304-9. doi: 10.1093/nar/gkt1240. Epub 2013 Dec 3.

FUpred:基于深度学习的接触图预测的蛋白质结构域检测。

FUpred: detecting protein domains through deep-learning-based contact map prediction.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109.

Computer Science and Engineering Department, Michigan State University, East Lansing, MI 48824, USA.

出版信息

Bioinformatics. 2020 Jun 1;36(12):3749-3757. doi: 10.1093/bioinformatics/btaa217.

DOI:10.1093/bioinformatics/btaa217
PMID:32227201
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7320627/
Abstract

MOTIVATION

Protein domains are subunits that can fold and function independently. Correct domain boundary assignment is thus a critical step toward accurate protein structure and function analyses. There is, however, no efficient algorithm available for accurate domain prediction from sequence. The problem is particularly challenging for proteins with discontinuous domains, which consist of domain segments that are separated along the sequence.

RESULTS

We developed a new algorithm, FUpred, which predicts protein domain boundaries utilizing contact maps created by deep residual neural networks coupled with coevolutionary precision matrices. The core idea of the algorithm is to retrieve domain boundary locations by maximizing the number of intra-domain contacts, while minimizing the number of inter-domain contacts from the contact maps. FUpred was tested on a large-scale dataset consisting of 2549 proteins and generated correct single- and multi-domain classifications with a Matthew's correlation coefficient of 0.799, which was 19.1% (or 5.3%) higher than the best machine learning (or threading)-based method. For proteins with discontinuous domains, the domain boundary detection and normalized domain overlapping scores of FUpred were 0.788 and 0.521, respectively, which were 17.3% and 23.8% higher than the best control method. The results demonstrate a new avenue to accurately detect domain composition from sequence alone, especially for discontinuous, multi-domain proteins.

AVAILABILITY AND IMPLEMENTATION

https://zhanglab.ccmb.med.umich.edu/FUpred.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

蛋白质结构域是可以独立折叠和发挥功能的亚基。因此,正确的结构域边界分配是准确分析蛋白质结构和功能的关键步骤。然而,目前还没有有效的算法可以从序列中准确预测结构域。对于具有不连续结构域的蛋白质,该问题尤其具有挑战性,因为这些蛋白质的结构域由沿着序列分离的结构域片段组成。

结果

我们开发了一种新算法 FUpred,该算法利用深度残差神经网络与共进化精确矩阵创建的接触图来预测蛋白质结构域边界。该算法的核心思想是通过最大化结构域内接触的数量,同时最小化接触图中外结构域接触的数量来检索结构域边界位置。FUpred 在一个包含 2549 个蛋白质的大规模数据集上进行了测试,生成的单结构域和多结构域分类的马修斯相关系数为 0.799,比最佳机器学习(或线程)方法高 19.1%(或 5.3%)。对于具有不连续结构域的蛋白质,FUpred 的结构域边界检测和归一化结构域重叠得分分别为 0.788 和 0.521,比最佳对照方法高 17.3%和 23.8%。结果表明了一种从序列中准确检测结构域组成的新途径,特别是对于不连续的多结构域蛋白质。

可用性和实施情况

https://zhanglab.ccmb.med.umich.edu/FUpred。

补充信息

补充数据可在 Bioinformatics 在线获取。