Ionmob：用于预测肽段碰撞截面值的 Python 包。

Ionmob: a Python package for prediction of peptide collisional cross-section values.

机构信息

Institute of Computer Science, Johannes Gutenberg University, 55128 Mainz, Germany.

Institute for Immunology, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany.

出版信息

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad486.

DOI:10.1093/bioinformatics/btad486

PMID:37540201

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10521631/

Abstract

MOTIVATION

Including ion mobility separation (IMS) into mass spectrometry proteomics experiments is useful to improve coverage and throughput. Many IMS devices enable linking experimentally derived mobility of an ion to its collisional cross-section (CCS), a highly reproducible physicochemical property dependent on the ion's mass, charge and conformation in the gas phase. Thus, known peptide ion mobilities can be used to tailor acquisition methods or to refine database search results. The large space of potential peptide sequences, driven also by posttranslational modifications of amino acids, motivates an in silico predictor for peptide CCS. Recent studies explored the general performance of varying machine-learning techniques, however, the workflow engineering part was of secondary importance. For the sake of applicability, such a tool should be generic, data driven, and offer the possibility to be easily adapted to individual workflows for experimental design and data processing.

RESULTS

We created ionmob, a Python-based framework for data preparation, training, and prediction of collisional cross-section values of peptides. It is easily customizable and includes a set of pretrained, ready-to-use models and preprocessing routines for training and inference. Using a set of ≈21 000 unique phosphorylated peptides and ≈17 000 MHC ligand sequences and charge state pairs, we expand upon the space of peptides that can be integrated into CCS prediction. Lastly, we investigate the applicability of in silico predicted CCS to increase confidence in identified peptides by applying methods of re-scoring and demonstrate that predicted CCS values complement existing predictors for that task.

AVAILABILITY AND IMPLEMENTATION

The Python package is available at github: https://github.com/theGreatHerrLebert/ionmob.

摘要

动机

在质谱蛋白质组学实验中纳入离子淌度分离（IMS）有助于提高覆盖率和通量。许多 IMS 设备能够将实验中获得的离子淌度与其碰撞截面（CCS）相关联，CCS 是一种高度可重现的物理化学性质，取决于离子在气相中的质量、电荷和构象。因此，可以使用已知的肽离子淌度来定制采集方法或改进数据库搜索结果。氨基酸的翻译后修饰也会推动潜在肽序列的大量产生，这就需要开发一个用于预测肽 CCS 的计算工具。最近的研究探索了不同机器学习技术的一般性能，然而，工作流程工程部分相对次要。为了适用性，这样的工具应该是通用的、基于数据的，并提供易于适应个体实验设计和数据处理工作流程的可能性。

结果

我们创建了 ionmob，这是一个基于 Python 的框架，用于肽的 CCS 值的准备、训练和预测。它易于定制，并且包含一套预训练的、可立即使用的模型和预处理例程，用于训练和推理。使用一组约 21000 个独特的磷酸化肽和约 17000 个 MHC 配体序列和电荷状态对，我们扩展了可以集成到 CCS 预测中的肽的范围。最后，我们通过应用重新评分方法来研究计算预测的 CCS 值在提高鉴定肽的置信度方面的适用性，并证明预测的 CCS 值补充了现有预测器在该任务中的应用。

可用性和实现

该 Python 包可在以下网址获得：https://github.com/theGreatHerrLebert/ionmob。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36eb/10521631/5aec7f0d770b/btad486f1.jpg

相似文献

Ionmob: a Python package for prediction of peptide collisional cross-section values.Ionmob：用于预测肽段碰撞截面值的 Python 包。

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad486.

Accurate Prediction of Ion Mobility Collision Cross-Section Using Ion's Polarizability and Molecular Mass with Limited Data.利用离子极化率和分子量并基于有限数据准确预测离子迁移率碰撞截面

J Chem Inf Model. 2024 Mar 11;64(5):1533-1542. doi: 10.1021/acs.jcim.3c01491. Epub 2024 Feb 23.

AutoCCS: automated collision cross-section calculation software for ion mobility spectrometry-mass spectrometry.AutoCCS：用于离子淌度谱-质谱联用的自动碰撞截面计算软件。

Bioinformatics. 2021 Nov 18;37(22):4193-4201. doi: 10.1093/bioinformatics/btab429.

Evaluation of a Reference-Free Collision Cross Section Calibration Strategy for Proteomics Using SLIM-Based High-Resolution Ion Mobility Spectrometry-Mass Spectrometry.基于 SLIM 的高分辨离子淌度质谱联用技术在蛋白质组学中无参比碰撞截面校准策略的评估。

J Am Soc Mass Spectrom. 2024 Jul 3;35(7):1539-1549. doi: 10.1021/jasms.4c00141. Epub 2024 Jun 12.

Artificial neural networks for the prediction of peptide drift time in ion mobility mass spectrometry.人工神经网络在离子淌度质谱中预测肽漂移时间的应用。

BMC Bioinformatics. 2010 Apr 11;11:182. doi: 10.1186/1471-2105-11-182.

Peptide collision cross sections of 22 post-translational modifications.22 种翻译后修饰的肽碰撞截面。

Anal Bioanal Chem. 2023 Nov;415(27):6633-6645. doi: 10.1007/s00216-023-04957-4. Epub 2023 Sep 28.

Deep learning the collisional cross sections of the peptide universe from a million experimental values.从一百万个实验值中深度学习肽宇宙的碰撞截面。

Nat Commun. 2021 Feb 19;12(1):1185. doi: 10.1038/s41467-021-21352-8.

AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics.AlphaPeptDeep：用于蛋白质组学的模块化深度学习框架，用于预测肽性质。

Nat Commun. 2022 Nov 24;13(1):7238. doi: 10.1038/s41467-022-34904-3.

High-Throughput Measurement and Machine Learning-Based Prediction of Collision Cross Sections for Drugs and Drug Metabolites.高通量测量和基于机器学习的药物和药物代谢物碰撞截面预测。

J Am Soc Mass Spectrom. 2022 Jun 1;33(6):1061-1072. doi: 10.1021/jasms.2c00111. Epub 2022 May 11.

PIXiE: an algorithm for automated ion mobility arrival time extraction and collision cross section calculation using global data association.PIXiE：一种使用全局数据关联进行自动离子迁移到达时间提取和碰撞截面计算的算法。

Bioinformatics. 2017 Sep 1;33(17):2715-2722. doi: 10.1093/bioinformatics/btx305.

引用本文的文献

Collisional Cross-Section Prediction for Multiconformational Peptide Ions with IM2Deep.使用IM2Deep预测多构象肽离子的碰撞截面

Anal Chem. 2025 Jul 22;97(28):15113-15121. doi: 10.1021/acs.analchem.5c01142. Epub 2025 Jul 8.

Leveraging pretrained deep protein language model to predict peptide collision cross section.利用预训练的深度蛋白质语言模型预测肽段的碰撞截面。

Commun Chem. 2025 May 6;8(1):137. doi: 10.1038/s42004-025-01540-z.

Rustims: An Open-Source Framework for Rapid Development and Processing of timsTOF Data-Dependent Acquisition Data.Rustims：一个用于快速开发和处理timsTOF数据依赖型采集数据的开源框架。

J Proteome Res. 2025 May 2;24(5):2358-2368. doi: 10.1021/acs.jproteome.4c00966. Epub 2025 Apr 22.

Peptide Property Prediction for Mass Spectrometry Using AI: An Introduction to State of the Art Models.使用人工智能进行质谱肽特性预测：最新模型介绍

Proteomics. 2025 May;25(9-10):e202400398. doi: 10.1002/pmic.202400398. Epub 2025 Apr 10.

Maximizing Immunopeptidomics-Based Bacterial Epitope Discovery by Multiple Search Engines and Rescoring.通过多搜索引擎和重新评分最大化基于免疫肽组学的细菌表位发现

J Proteome Res. 2025 Apr 4;24(4):2141-2151. doi: 10.1021/acs.jproteome.4c00864. Epub 2025 Mar 13.

SWAPS: A Modular Deep-Learning Empowered Peptide Identity Propagation Framework Beyond Match-Between-Run.SWAPS：一种模块化的深度学习赋能的肽段身份传播框架，超越了批次间匹配。

J Proteome Res. 2025 Apr 4;24(4):1926-1940. doi: 10.1021/acs.jproteome.4c00972. Epub 2025 Mar 7.

diaPASEF Analysis for HLA-I Peptides Enables Quantification of Common Cancer Neoantigens.用于HLA-I肽段的diaPASEF分析能够对常见癌症新抗原进行定量分析。

Mol Cell Proteomics. 2025 Apr;24(4):100938. doi: 10.1016/j.mcpro.2025.100938. Epub 2025 Mar 3.

TIMSRescore: A Data Dependent Acquisition-Parallel Accumulation and Serial Fragmentation-Optimized Data-Driven Rescoring Pipeline Based on MSRescore.TIMS重评分：一种基于MSRescore的数据依赖采集-并行累积与串行碎片化优化的数据驱动重评分流程。

J Proteome Res. 2025 Mar 7;24(3):1067-1076. doi: 10.1021/acs.jproteome.4c00609. Epub 2025 Feb 6.

ProPept-MT: A Multi-Task Learning Model for Peptide Feature Prediction.ProPept-MT：用于肽段特征预测的多任务学习模型。

Int J Mol Sci. 2024 Jun 30;25(13):7237. doi: 10.3390/ijms25137237.

Liquid-phase separations coupled with ion mobility-mass spectrometry for next-generation biopharmaceutical analysis.液相反相分离与离子淌度-质谱联用在新一代生物制药分析中的应用。

Expert Rev Proteomics. 2024 May-Jun;21(5-6):259-270. doi: 10.1080/14789450.2024.2373707. Epub 2024 Jul 1.

本文引用的文献

AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics.AlphaPeptDeep：用于蛋白质组学的模块化深度学习框架，用于预测肽性质。

Nat Commun. 2022 Nov 24;13(1):7238. doi: 10.1038/s41467-022-34904-3.

MSRescore: Data-Driven Rescoring Dramatically Boosts Immunopeptide Identification Rates.MSRescore：数据驱动的重新评分极大地提高了免疫肽识别率。

Mol Cell Proteomics. 2022 Aug;21(8):100266. doi: 10.1016/j.mcpro.2022.100266. Epub 2022 Jul 6.

A novel immunopeptidomic-based pipeline for the generation of personalized oncolytic cancer vaccines.一种新型免疫肽组学为基础的流水线，用于生成个体化溶瘤癌症疫苗。

Elife. 2022 Mar 22;11:e71156. doi: 10.7554/eLife.71156.

Ultra-high sensitivity mass spectrometry quantifies single-cell proteome changes upon perturbation.超高灵敏度质谱定量分析扰动后单细胞蛋白质组的变化。

Mol Syst Biol. 2022 Mar;18(3):e10798. doi: 10.15252/msb.202110798.

A Deep Convolutional Neural Network for Prediction of Peptide Collision Cross Sections in Ion Mobility Spectrometry.用于预测离子淌度谱中肽段碰撞截面的深度卷积神经网络。

Biomolecules. 2021 Dec 19;11(12):1904. doi: 10.3390/biom11121904.

Trapped Ion Mobility Spectrometry and Parallel Accumulation-Serial Fragmentation in Proteomics.离子阱淌度质谱技术及其在蛋白质组学中的平行累积-串联碎裂。

Mol Cell Proteomics. 2021;20:100138. doi: 10.1016/j.mcpro.2021.100138. Epub 2021 Aug 17.

MaxDIA enables library-based and library-free data-independent acquisition proteomics.MaxDIA支持基于文库和无文库的数据非依赖型采集蛋白质组学。

Nat Biotechnol. 2021 Dec;39(12):1563-1573. doi: 10.1038/s41587-021-00968-7. Epub 2021 Jul 8.

Sequence-Specific Model for Predicting Peptide Collision Cross Section Values in Proteomic Ion Mobility Spectrometry.蛋白质组学离子淌度谱中预测肽段碰撞截面值的序列特异性模型

J Proteome Res. 2021 Jun 16. doi: 10.1021/acs.jproteome.1c00185.

OpenTIMS, TimsPy, and TimsR: Open and Easy Access to timsTOF Raw Data.OpenTIMS、TimsPy 和 TimsR：轻松访问 timsTOF 原始数据

J Proteome Res. 2021 Apr 2;20(4):2122-2129. doi: 10.1021/acs.jproteome.0c00962. Epub 2021 Mar 16.

Deep learning the collisional cross sections of the peptide universe from a million experimental values.从一百万个实验值中深度学习肽宇宙的碰撞截面。

Nat Commun. 2021 Feb 19;12(1):1185. doi: 10.1038/s41467-021-21352-8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Ionmob：用于预测肽段碰撞截面值的 Python 包。

Ionmob: a Python package for prediction of peptide collisional cross-section values.

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

动机

结果

可用性和实现

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献