Department of Computer Science, Applied Bioinformatics, University of Tübingen, Tübingen, Germany.
Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada.
Methods Mol Biol. 2024;2758:77-88. doi: 10.1007/978-1-0716-3646-6_4.
In recent years, data-independent acquisition (DIA) has emerged as a powerful analysis method in biological mass spectrometry (MS). Compared to the previously predominant data-dependent acquisition (DDA), it offers a way to achieve greater reproducibility, sensitivity, and dynamic range in MS measurements. To make DIA accessible to non-expert users, a multifunctional, automated high-throughput pipeline DIAproteomics was implemented in the computational workflow framework "Nextflow" ( https://nextflow.io ). This allows high-throughput processing of proteomics and peptidomics DIA datasets on diverse computing infrastructures. This chapter provides a short summary and usage protocol guide for the most important modes of operation of this pipeline regarding the analysis of peptidomics datasets using the command line. In brief, DIAproteomics is a wrapper around the OpenSwathWorkflow and relies on either existing or ad-hoc generated spectral libraries from matching DDA runs. The OpenSwathWorkflow extracts chromatograms from the DIA runs and performs chromatographic peak-picking. Further downstream of the pipeline, these peaks are scored, aligned, and statistically evaluated for qualitative and quantitative differences across conditions depending on the user's interest. DIAproteomics is open-source and available under a permissive license. We encourage the scientific community to use or modify the pipeline to meet their specific requirements.
近年来,数据非依赖采集(DIA)已成为生物质谱分析领域的一种强大分析方法。与先前占主导地位的数据依赖采集(DDA)相比,它提供了一种在 MS 测量中实现更高重现性、灵敏度和动态范围的方法。为了使非专业用户能够使用 DIA,我们在计算工作流程框架“Nextflow”(https://nextflow.io)中实现了多功能、自动化的高通量 DIA 蛋白质组学管道 DIAproteomics。这允许在各种计算基础设施上对蛋白质组学和肽组学 DIA 数据集进行高通量处理。本章提供了该管道在使用命令行分析肽组学数据集方面的最重要操作模式的简短摘要和使用协议指南。简而言之,DIAproteomics 是 OpenSwathWorkflow 的包装器,依赖于来自匹配 DDA 运行的现有或临时生成的光谱库。OpenSwathWorkflow 从 DIA 运行中提取色谱图,并执行色谱峰提取。在管道的下游,这些峰根据用户的兴趣进行评分、对齐和统计评估,以确定条件之间的定性和定量差异。DIAproteomics 是开源的,并根据许可协议提供。我们鼓励科学界使用或修改该管道以满足他们的特定要求。