Suppr超能文献

CoSpred: Machine Learning Workflow to Predict Tandem Mass Spectrum in Proteomics.

作者信息

Xue Liang, Tiwary Shivani, Bordyuh Mykola, Stanton Robert

机构信息

Machine Learning and Computational Sciences, Pfizer Worldwide R&D, Cambridge, Massachusetts, USA.

Machine Learning and Computational Sciences, Pfizer Worldwide R&D, Berlin, Germany.

出版信息

Proteomics. 2025 Aug;25(15):27-41. doi: 10.1002/pmic.70004. Epub 2025 Jun 30.

Abstract

In mass spectrometry-based proteomics, the use of deep learning algorithms can help improve the identification rates of peptides and proteins through the generation of high-fidelity theoretical spectrum which can be used as the basis of a more complete spectral library than those presently available, especially for unobserved protein/genetic variants. Here we focus on providing an end-to-end user-friendly machine learning workflow, which we call Complete Spectrum Predictor (CoSpred). Using CoSpred users can create their own machine learning compatible training dataset and then train a machine learning model to predict both backbone and non-backbone ions. For the model a transformer encoder architecture is used to predict the complete MS/MS spectrum from a given peptide sequence. In addition to the transformer model provided in the package, the code is built modularly to allow for alternate ML models to be easily "plugged in," allowing for spectrum prediction optimization given different experimental conditions. The CoSpred workflow (preprocessing→training→inference) provides a path for state-of-art ML capabilities to be more accessible to proteomics scientists.

摘要

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验