加热牛奶的原理验证研究中用于选择和鉴定标记肽的 Python 工作流程。

Python workflow for the selection and identification of marker peptides-proof-of-principle study with heated milk.

机构信息

GALAB Laboratories GmbH, Am Schleusengraben 7, 21029, Hamburg, Germany.

Department of Food Chemistry and Analysis, Institute of Food Technology and Food Chemistry, Technical University Berlin, Gustav Meyer Allee 25, 13355, Berlin, Germany.

出版信息

Anal Bioanal Chem. 2024 Jun;416(14):3349-3360. doi: 10.1007/s00216-024-05286-w. Epub 2024 Apr 12.

DOI:10.1007/s00216-024-05286-w

PMID:38607384

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11106092/

Abstract

The analysis of almost holistic food profiles has developed considerably over the last years. This has also led to larger amounts of data and the ability to obtain more information about health-beneficial and adverse constituents in food than ever before. Especially in the field of proteomics, software is used for evaluation, and these do not provide specific approaches for unique monitoring questions. An additional and more comprehensive way of evaluation can be done with the programming language Python. It offers broad possibilities by a large ecosystem for mass spectrometric data analysis, but needs to be tailored for specific sets of features, the research questions behind. It also offers the applicability of various machine-learning approaches. The aim of the present study was to develop an algorithm for selecting and identifying potential marker peptides from mass spectrometric data. The workflow is divided into three steps: (I) feature engineering, (II) chemometric data analysis, and (III) feature identification. The first step is the transformation of the mass spectrometric data into a structure, which enables the application of existing data analysis packages in Python. The second step is the data analysis for selecting single features. These features are further processed in the third step, which is the feature identification. The data used exemplarily in this proof-of-principle approach was from a study on the influence of a heat treatment on the milk proteome/peptidome.

摘要

近年来，对整体食品分析的研究有了显著的发展。这也导致了数据量的增加，使我们能够比以往任何时候都能获得更多关于食品中有益健康和不利成分的信息。特别是在蛋白质组学领域，软件被用于评估，但它们没有为独特的监测问题提供具体的方法。一种额外的、更全面的评估方法是使用编程语言 Python。它通过大规模质谱数据分析的庞大生态系统提供了广泛的可能性，但需要针对特定的特征集和研究问题进行定制。它还提供了各种机器学习方法的适用性。本研究的目的是开发一种从质谱数据中选择和识别潜在标记肽的算法。工作流程分为三个步骤：（I）特征工程，（II）化学计量数据分析，和（III）特征识别。第一步是将质谱数据转化为一种结构，这使得现有的数据分析软件包可以在 Python 中应用。第二步是用于选择单个特征的数据分析。这些特征在第三步中进一步处理，即特征识别。在这个原理验证方法中，示例数据来自一项关于热处理对牛奶蛋白质组/肽组影响的研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd82/11106092/7d93dadfd570/216_2024_5286_Fig1_HTML.jpg

相似文献

Python workflow for the selection and identification of marker peptides-proof-of-principle study with heated milk.加热牛奶的原理验证研究中用于选择和鉴定标记肽的 Python 工作流程。

Anal Bioanal Chem. 2024 Jun;416(14):3349-3360. doi: 10.1007/s00216-024-05286-w. Epub 2024 Apr 12.

PeptidePicker: a scientific workflow with web interface for selecting appropriate peptides for targeted proteomics experiments.肽段选择器：一种具有网页界面的科学工作流程，用于为靶向蛋白质组学实验选择合适的肽段。

J Proteomics. 2014 Jun 25;106:151-61. doi: 10.1016/j.jprot.2014.04.018. Epub 2014 Apr 22.

The Shelf Life of Milk-A Novel Concept for the Identification of Marker Peptides Using Multivariate Analysis.牛奶的货架期——一种使用多变量分析鉴定标记肽的新概念。

Foods. 2024 Mar 8;13(6):831. doi: 10.3390/foods13060831.

CoreFlow: a computational platform for integration, analysis and modeling of complex biological data.CoreFlow：一个用于复杂生物数据整合、分析和建模的计算平台。

J Proteomics. 2014 Apr 4;100:167-73. doi: 10.1016/j.jprot.2014.01.023. Epub 2014 Feb 3.

Scientific workflow optimization for improved peptide and protein identification.优化科学工作流程以改进肽和蛋白质鉴定

BMC Bioinformatics. 2015 Sep 3;16(1):284. doi: 10.1186/s12859-015-0714-x.

Peptidomics as a tool for characterizing bioactive milk peptides.肽组学作为一种表征生物活性乳肽的工具。

Food Chem. 2017 Sep 1;230:91-98. doi: 10.1016/j.foodchem.2017.03.016. Epub 2017 Mar 8.

Purple: A Computational Workflow for Strategic Selection of Peptides for Viral Diagnostics Using MS-Based Targeted Proteomics.紫色：一种基于 MS 的靶向蛋白质组学的病毒诊断中肽段的战略选择的计算工作流程。

Viruses. 2019 Jun 8;11(6):536. doi: 10.3390/v11060536.

Methods and Algorithms for Quantitative Proteomics by Mass Spectrometry.基于质谱的定量蛋白质组学方法与算法

Methods Mol Biol. 2020;2051:161-197. doi: 10.1007/978-1-4939-9744-2_7.

2DB: a Proteomics database for storage, analysis, presentation, and retrieval of information from mass spectrometric experiments.2DB：一个用于存储、分析、展示和检索质谱实验信息的蛋白质组学数据库。

BMC Bioinformatics. 2008 Jul 7;9:302. doi: 10.1186/1471-2105-9-302.

Discrimination of overheated pasteurized milk using mass spectrometry-based proteomics.基于质谱蛋白质组学的过热巴氏杀菌乳的鉴别。

J Chromatogr B Analyt Technol Biomed Life Sci. 2024 Aug 1;1243:124236. doi: 10.1016/j.jchromb.2024.124236. Epub 2024 Jul 7.

引用本文的文献

Accumulated seizure burden predicts neurodevelopmental outcome at 36 months of age in patients with tuberous sclerosis complex.累积癫痫发作负担可预测结节性硬化症患者36个月大时的神经发育结局。

Epilepsia. 2025 Jan;66(1):117-133. doi: 10.1111/epi.18172. Epub 2024 Oct 29.

Marker Peptides for Indicating the Spoilage of Milk-Sample Preparation and Chemometric Approaches for Yielding Potential Peptides in a Raw Milk Model.用于指示牛奶变质的标记肽——原料乳模型中样品制备及产生潜在肽的化学计量学方法

Foods. 2024 Oct 18;13(20):3315. doi: 10.3390/foods13203315.

本文引用的文献

The Shelf Life of Milk-A Novel Concept for the Identification of Marker Peptides Using Multivariate Analysis.牛奶的货架期——一种使用多变量分析鉴定标记肽的新概念。

Foods. 2024 Mar 8;13(6):831. doi: 10.3390/foods13060831.

AlphaPept: a modern and open framework for MS-based proteomics.AlphaPept：基于 MS 的蛋白质组学的现代开放框架。

Nat Commun. 2024 Mar 9;15(1):2168. doi: 10.1038/s41467-024-46485-4.

Finding features - variable extraction strategies for dimensionality reduction and marker compounds identification in GC-IMS data.发现特征 - GC-IMS 数据降维和标志物化合物识别的变量提取策略。

Food Res Int. 2022 Nov;161:111779. doi: 10.1016/j.foodres.2022.111779. Epub 2022 Aug 23.

spectrum_utils: A Python Package for Mass Spectrometry Data Processing and Visualization.spectrum_utils：一个用于质谱数据分析和可视化的 Python 包。

Anal Chem. 2020 Jan 7;92(1):659-661. doi: 10.1021/acs.analchem.9b04884. Epub 2019 Dec 20.

Alternative data mining/machine learning methods for the analytical evaluation of food quality and authenticity - A review.替代数据挖掘/机器学习方法在食品质量和真实性分析评价中的应用综述。

Food Res Int. 2019 Aug;122:25-39. doi: 10.1016/j.foodres.2019.03.063. Epub 2019 Mar 28.

Identification of Salmonella Taxon-Specific Peptide Markers to the Serovar Level by Mass Spectrometry.通过质谱法鉴定沙门氏菌分类特异性肽标记物到血清型水平。

Anal Chem. 2019 Apr 2;91(7):4388-4395. doi: 10.1021/acs.analchem.8b04843. Epub 2019 Mar 25.

Deep learning enables de novo peptide sequencing from data-independent-acquisition mass spectrometry.深度学习可实现基于数据非依赖采集质谱的从头多肽测序。

Nat Methods. 2019 Jan;16(1):63-66. doi: 10.1038/s41592-018-0260-3. Epub 2018 Dec 20.

glyXtool: An Open-Source Pipeline for Semiautomated Analysis of Glycopeptide Mass Spectrometry Data.glyXtool：用于糖肽质谱数据分析的开源半自动化分析工具。

Anal Chem. 2018 Oct 16;90(20):11908-11916. doi: 10.1021/acs.analchem.8b02087. Epub 2018 Sep 25.

pymzML v2.0: introducing a highly compressed and seekable gzip format.pymzML v2.0：引入一种高度压缩且可快速检索的 gzip 格式。

Bioinformatics. 2018 Jul 15;34(14):2513-2514. doi: 10.1093/bioinformatics/bty046.

Analysis of Gluten in a Wheat-Gluten-Incurred Sorghum Beer Brewed in the Presence of Proline Endopeptidase by LC/MS/MS.通过 LC/MS/MS 分析在脯氨酸内肽酶存在下酿造的含麦胶高粱啤酒中的谷蛋白。

Anal Chem. 2018 Feb 6;90(3):2111-2118. doi: 10.1021/acs.analchem.7b04371. Epub 2018 Jan 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

加热牛奶的原理验证研究中用于选择和鉴定标记肽的 Python 工作流程。

Python workflow for the selection and identification of marker peptides-proof-of-principle study with heated milk.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献