Suppr超能文献

基于同位素图谱识别的蛋白质核磁共振波谱解析。

Protein NMR assignment by isotope pattern recognition.

机构信息

School of Chemistry, University of Southampton, University Road, Southampton SO17 1BJ, UK.

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.

出版信息

Sci Adv. 2024 Sep 6;10(36):eado0403. doi: 10.1126/sciadv.ado0403. Epub 2024 Sep 4.

Abstract

The current standard method for amino acid signal identification in protein NMR spectra is sequential assignment using triple-resonance experiments. Good software and elaborate heuristics exist, but the process remains laboriously manual. Machine learning does help, but its training databases need millions of samples that cover all relevant physics and every kind of instrumental artifact. In this communication, we offer a solution to this problem. We propose polyadic decompositions to store millions of simulated three-dimensional NMR spectra, on-the-fly generation of artifacts during training, a probabilistic way to incorporate prior and posterior information, and integration with the industry standard CcpNmr software framework. The resulting neural nets take [H,C] slices of mixed pyruvate-labeled HNCA spectra (different CA signal shapes for different residue types) and return an amino acid probability table. In combination with primary sequence information, backbones of common proteins (GB1, MBP, and INMT) are rapidly assigned from just the HNCA spectrum.

摘要

目前,在蛋白质 NMR 谱中识别氨基酸信号的标准方法是使用三共振实验进行顺序赋值。有很好的软件和精心设计的启发式方法,但这个过程仍然是繁琐的手动操作。机器学习确实有所帮助,但它的训练数据库需要覆盖所有相关物理和各种仪器伪影的数百万个样本。在本通讯中,我们提供了一种解决此问题的方法。我们提出了多线性分解来存储数百万个模拟的三维 NMR 谱,在训练过程中实时生成伪影,一种概率方法来合并先验和后验信息,并与行业标准的 CcpNmr 软件框架集成。生成的神经网络采用混合丙酮酸标记的 HNCA 谱的 [H,C] 切片(不同残基类型的 CA 信号形状不同),并返回氨基酸概率表。结合原始序列信息,仅从 HNCA 谱即可快速分配常见蛋白质(GB1、MBP 和 INMT)的骨架。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b1d2/11373586/d42de2881f12/sciadv.ado0403-f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验