• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过评估阿尔法折叠在预测蛋白质中缺失残基和结构无序方面的功效来处理非结构化问题。

Navigating the unstructured by evaluating alphafold's efficacy in predicting missing residues and structural disorder in proteins.

作者信息

Zheng Sen

机构信息

Bio-Electron Microscopy Facility, iHuman Institution, ShanghaiTech University, Shanghai, China.

出版信息

PLoS One. 2025 Mar 25;20(3):e0313812. doi: 10.1371/journal.pone.0313812. eCollection 2025.

DOI:10.1371/journal.pone.0313812
PMID:40131945
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11936262/
Abstract

The study investigated regions with undefined structures, known as "missing" segments in X-ray crystallography and cryo-electron microscopy (Cryo-EM) data, by assessing their predicted structural confidence and disorder scores. Utilizing a comprehensive dataset from the Protein Data Bank (PDB), residues were categorized as "modeled", "hard missing" and "soft missing" based on their visibility in structural datasets. Key features were determined, including a confidence score predicted local distance difference test (pLDDT) from AlphaFold2, an advanced structural prediction tool, and a disorder score from IUPred, a traditional disorder prediction method. To enhance prediction performance for unstructured residues, we employed a Long Short-Term Memory (LSTM) model, integrating both scores with amino acid sequences. Notable patterns such as composition, region lengths and prediction scores were observed in unstructured residues and regions identified through structural experiments over our studied period. Our findings also indicate that "hard missing" residues often align with low confidence scores, whereas "soft missing" residues exhibit dynamic behavior that can complicate predictions. The incorporation of pLDDT, IUPred scores, and sequence data into the LSTM model has improved the differentiation between structured and unstructured residues, particularly for shorter unstructured regions. This research elucidates the relationship between established computational predictions and experimental structural data, enhancing our ability to target structurally significant areas for research and guiding experimental designs toward functionally relevant regions.

摘要

该研究通过评估X射线晶体学和冷冻电子显微镜(Cryo-EM)数据中结构未定义的区域(即所谓的“缺失”片段)的预测结构置信度和无序分数,对这些区域展开了调查。利用蛋白质数据库(PDB)的综合数据集,根据残基在结构数据集中的可见性,将其分类为“已建模”、“硬缺失”和“软缺失”。确定了关键特征,包括来自先进结构预测工具AlphaFold2的预测局部距离差异测试(pLDDT)置信分数,以及来自传统无序预测方法IUPred的无序分数。为了提高对非结构化残基的预测性能,我们采用了长短期记忆(LSTM)模型,将这两个分数与氨基酸序列相结合。在我们研究期间,通过结构实验确定的非结构化残基和区域中观察到了诸如组成、区域长度和预测分数等显著模式。我们的研究结果还表明,“硬缺失”残基通常与低置信分数相关,而“软缺失”残基表现出动态行为,这可能会使预测变得复杂。将pLDDT、IUPred分数和序列数据纳入LSTM模型,改善了结构化和非结构化残基之间的区分,特别是对于较短的非结构化区域。这项研究阐明了既定计算预测与实验结构数据之间的关系,增强了我们针对结构上重要区域进行研究的能力,并指导实验设计朝着功能相关区域发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/24dbe2ee2435/pone.0313812.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/ec5a1b6fc2fe/pone.0313812.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/d24c4bf709be/pone.0313812.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/24dbe2ee2435/pone.0313812.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/ec5a1b6fc2fe/pone.0313812.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/d24c4bf709be/pone.0313812.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0117/11936262/24dbe2ee2435/pone.0313812.g003.jpg

相似文献

1
Navigating the unstructured by evaluating alphafold's efficacy in predicting missing residues and structural disorder in proteins.通过评估阿尔法折叠在预测蛋白质中缺失残基和结构无序方面的功效来处理非结构化问题。
PLoS One. 2025 Mar 25;20(3):e0313812. doi: 10.1371/journal.pone.0313812. eCollection 2025.
2
Intrinsic disorder in the Protein Data Bank.蛋白质数据库中的内在无序状态。
J Biomol Struct Dyn. 2007 Feb;24(4):325-42. doi: 10.1080/07391102.2007.10507123.
3
The complexity of protein interactions unravelled from structural disorder.从结构无序中揭示蛋白质相互作用的复杂性。
PLoS Comput Biol. 2021 Jan 8;17(1):e1008546. doi: 10.1371/journal.pcbi.1008546. eCollection 2021 Jan.
4
Random, de novo, and conserved proteins: How structure and disorder predictors perform differently.随机、从头开始和保守的蛋白质:结构和无序预测器的表现有何不同。
Proteins. 2024 Jun;92(6):757-767. doi: 10.1002/prot.26652. Epub 2024 Jan 16.
5
Disordered Residues and Patterns in the Protein Data Bank.蛋白质数据库中的无序残基和模式。
Molecules. 2020 Mar 27;25(7):1522. doi: 10.3390/molecules25071522.
6
Experimentally Determined Long Intrinsically Disordered Protein Regions Are Now Abundant in the Protein Data Bank.实验确定的长固有无序蛋白质区域现在在蛋白质数据库中大量存在。
Int J Mol Sci. 2020 Jun 24;21(12):4496. doi: 10.3390/ijms21124496.
7
Predicting Conformational Disorder.预测构象无序。
Methods Mol Biol. 2016;1415:265-99. doi: 10.1007/978-1-4939-3572-7_14.
8
Revealing Missing Protein-Ligand Interactions Using AlphaFold Predictions.利用 AlphaFold 预测揭示缺失的蛋白配体相互作用。
J Mol Biol. 2024 Dec 1;436(23):168852. doi: 10.1016/j.jmb.2024.168852. Epub 2024 Nov 6.
9
Resolving the ambiguity: Making sense of intrinsic disorder when PDB structures disagree.解决歧义:当蛋白质数据银行(PDB)结构不一致时理解内在无序性。
Protein Sci. 2016 Mar;25(3):676-88. doi: 10.1002/pro.2864. Epub 2016 Jan 9.
10
Structure Determination by Single-Particle Cryo-Electron Microscopy: Only the Sky (and Intrinsic Disorder) is the Limit.利用单颗粒冷冻电子显微镜进行结构测定:只有天空(和固有无序)是极限。
Int J Mol Sci. 2019 Aug 27;20(17):4186. doi: 10.3390/ijms20174186.

本文引用的文献

1
The power and pitfalls of AlphaFold2 for structure prediction beyond rigid globular proteins.AlphaFold2 在刚性球状蛋白以外的结构预测中的优势和陷阱。
Nat Chem Biol. 2024 Aug;20(8):950-959. doi: 10.1038/s41589-024-01638-w. Epub 2024 Jun 21.
2
AlphaFold2 modeling and molecular dynamics simulations of an intrinsically disordered protein.AlphaFold2 对一个固有无序蛋白的建模和分子动力学模拟。
PLoS One. 2024 May 13;19(5):e0301866. doi: 10.1371/journal.pone.0301866. eCollection 2024.
3
Conformational ensembles of the human intrinsically disordered proteome.
人类内在无序蛋白质组的构象集合
Nature. 2024 Feb;626(8000):897-904. doi: 10.1038/s41586-023-07004-5. Epub 2024 Jan 31.
4
AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination.AlphaFold 的预测结果是有价值的假说,可以加速但不能替代实验结构确定。
Nat Methods. 2024 Jan;21(1):110-116. doi: 10.1038/s41592-023-02087-4. Epub 2023 Nov 30.
5
DisProt in 2024: improving function annotation of intrinsically disordered proteins.2024 年的 DisProt:改善无序蛋白质的功能注释。
Nucleic Acids Res. 2024 Jan 5;52(D1):D434-D441. doi: 10.1093/nar/gkad928.
6
The Origin of Discrepancies between Predictions and Annotations in Intrinsically Disordered Proteins.固有无序蛋白质中预测和注释差异的起源。
Biomolecules. 2023 Sep 25;13(10):1442. doi: 10.3390/biom13101442.
7
Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2.利用 AlphaFold2 系统识别条件折叠的固有无序区域。
Proc Natl Acad Sci U S A. 2023 Oct 31;120(44):e2304302120. doi: 10.1073/pnas.2304302120. Epub 2023 Oct 25.
8
Tutorial: a guide for the selection of fast and accurate computational tools for the prediction of intrinsic disorder in proteins.教程:用于选择快速准确的计算工具预测蛋白质内无序性的指南。
Nat Protoc. 2023 Nov;18(11):3157-3172. doi: 10.1038/s41596-023-00876-x. Epub 2023 Sep 22.
9
Clustering predicted structures at the scale of the known protein universe.对已知蛋白质宇宙尺度的预测结构进行聚类。
Nature. 2023 Oct;622(7983):637-645. doi: 10.1038/s41586-023-06510-w. Epub 2023 Sep 13.
10
Uncovering new families and folds in the natural protein universe.揭示自然蛋白质宇宙中的新家族和新折叠。
Nature. 2023 Oct;622(7983):646-653. doi: 10.1038/s41586-023-06622-3. Epub 2023 Sep 13.