数据非依赖采集肽组学。

Data-Independent Acquisition Peptidomics.

机构信息

Department of Computer Science, Applied Bioinformatics, University of Tübingen, Tübingen, Germany.

Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada.

出版信息

Methods Mol Biol. 2024;2758:77-88. doi: 10.1007/978-1-0716-3646-6_4.

DOI:10.1007/978-1-0716-3646-6_4

PMID:38549009

Abstract

In recent years, data-independent acquisition (DIA) has emerged as a powerful analysis method in biological mass spectrometry (MS). Compared to the previously predominant data-dependent acquisition (DDA), it offers a way to achieve greater reproducibility, sensitivity, and dynamic range in MS measurements. To make DIA accessible to non-expert users, a multifunctional, automated high-throughput pipeline DIAproteomics was implemented in the computational workflow framework "Nextflow" ( https://nextflow.io ). This allows high-throughput processing of proteomics and peptidomics DIA datasets on diverse computing infrastructures. This chapter provides a short summary and usage protocol guide for the most important modes of operation of this pipeline regarding the analysis of peptidomics datasets using the command line. In brief, DIAproteomics is a wrapper around the OpenSwathWorkflow and relies on either existing or ad-hoc generated spectral libraries from matching DDA runs. The OpenSwathWorkflow extracts chromatograms from the DIA runs and performs chromatographic peak-picking. Further downstream of the pipeline, these peaks are scored, aligned, and statistically evaluated for qualitative and quantitative differences across conditions depending on the user's interest. DIAproteomics is open-source and available under a permissive license. We encourage the scientific community to use or modify the pipeline to meet their specific requirements.

摘要

近年来，数据非依赖采集（DIA）已成为生物质谱分析领域的一种强大分析方法。与先前占主导地位的数据依赖采集（DDA）相比，它提供了一种在 MS 测量中实现更高重现性、灵敏度和动态范围的方法。为了使非专业用户能够使用 DIA，我们在计算工作流程框架“Nextflow”（https://nextflow.io）中实现了多功能、自动化的高通量 DIA 蛋白质组学管道 DIAproteomics。这允许在各种计算基础设施上对蛋白质组学和肽组学 DIA 数据集进行高通量处理。本章提供了该管道在使用命令行分析肽组学数据集方面的最重要操作模式的简短摘要和使用协议指南。简而言之，DIAproteomics 是 OpenSwathWorkflow 的包装器，依赖于来自匹配 DDA 运行的现有或临时生成的光谱库。OpenSwathWorkflow 从 DIA 运行中提取色谱图，并执行色谱峰提取。在管道的下游，这些峰根据用户的兴趣进行评分、对齐和统计评估，以确定条件之间的定性和定量差异。DIAproteomics 是开源的，并根据许可协议提供。我们鼓励科学界使用或修改该管道以满足他们的特定要求。

相似文献

Data-Independent Acquisition Peptidomics.数据非依赖采集肽组学。

Methods Mol Biol. 2024;2758:77-88. doi: 10.1007/978-1-0716-3646-6_4.

DIAproteomics: A Multifunctional Data Analysis Pipeline for Data-Independent Acquisition Proteomics and Peptidomics.DIA蛋白质组学：一种用于非数据依赖采集蛋白质组学和肽组学的多功能数据分析流程

J Proteome Res. 2021 Jul 2;20(7):3758-3766. doi: 10.1021/acs.jproteome.1c00123. Epub 2021 Jun 21.

Reproducibility, Specificity and Accuracy of Relative Quantification Using Spectral Library-based Data-independent Acquisition.基于谱库的非依赖数据采集的相对定量的重现性、特异性和准确性。

Mol Cell Proteomics. 2020 Jan;19(1):181-197. doi: 10.1074/mcp.RA119.001714. Epub 2019 Nov 7.

High throughput and accurate serum proteome profiling by integrated sample preparation technology and single-run data independent mass spectrometry analysis.通过集成样本制备技术和单次运行数据独立质谱分析实现高通量和高准确度的血清蛋白质组分析。

J Proteomics. 2018 Mar 1;174:9-16. doi: 10.1016/j.jprot.2017.12.014. Epub 2017 Dec 24.

nf-encyclopedia: A Cloud-Ready Pipeline for Chromatogram Library Data-Independent Acquisition Proteomics Workflows.nf 百科全书：用于色谱库数据非依赖型采集蛋白质组学工作流程的云就绪管道。

J Proteome Res. 2023 Aug 4;22(8):2743-2749. doi: 10.1021/acs.jproteome.2c00613. Epub 2023 Jul 7.

Data Processing and Analysis for DIA-Based Phosphoproteomics Using Spectronaut.使用Spectronaut进行基于数据独立采集（DIA）的磷酸化蛋白质组学的数据处理与分析

Methods Mol Biol. 2021;2361:95-107. doi: 10.1007/978-1-0716-1641-3_6.

Hybrid data acquisition and processing strategies with increased throughput and selectivity: pSMART analysis for global qualitative and quantitative analysis.具有更高通量和选择性的混合数据采集与处理策略：用于全局定性和定量分析的pSMART分析

J Proteome Res. 2014 Dec 5;13(12):5415-30. doi: 10.1021/pr5003017. Epub 2014 Oct 14.

Implementing the reuse of public DIA proteomics datasets: from the PRIDE database to Expression Atlas.实现公共 DIA 蛋白质组学数据集的再利用：从 PRIDE 数据库到 Expression Atlas。

Sci Data. 2022 Jun 14;9(1):335. doi: 10.1038/s41597-022-01380-9.

Data-Driven Tool for Cross-Run Ion Selection and Peak-Picking in Quantitative Proteomics with Data-Independent Acquisition LC-MS/MS.用于数据非依赖性采集 LC-MS/MS 定量蛋白质组学中跨运行离子选择和峰提取的数据驱动工具。

Anal Chem. 2023 Nov 14;95(45):16558-16566. doi: 10.1021/acs.analchem.3c02689. Epub 2023 Oct 31.

Removing the Hidden Data Dependency of DIA with Predicted Spectral Libraries.利用预测谱库去除 DIA 的隐藏数据依赖性。

Proteomics. 2020 Feb;20(3-4):e1900306. doi: 10.1002/pmic.201900306. Epub 2020 Feb 5.

引用本文的文献

Residual host cell proteins: sources, properties, detection methods and data acquisition modes.残留宿主细胞蛋白：来源、特性、检测方法及数据采集模式

Front Microbiol. 2025 Aug 18;16:1658366. doi: 10.3389/fmicb.2025.1658366. eCollection 2025.

Neuropeptide Characterization Workflow from Sampling to Data-Independent Acquisition Mass Spectrometry.从采样到数据非依赖型采集质谱的神经肽表征工作流程

J Vis Exp. 2025 Aug 8(222). doi: 10.3791/68741.

本文引用的文献

Major shrimp allergen peptidomics signatures and potential biomarkers of heat processing.主要虾过敏原的肽组学特征及热加工的潜在生物标志物。

Food Chem. 2022 Jul 15;382:132567. doi: 10.1016/j.foodchem.2022.132567. Epub 2022 Feb 25.

MaxDIA enables library-based and library-free data-independent acquisition proteomics.MaxDIA支持基于文库和无文库的数据非依赖型采集蛋白质组学。

Nat Biotechnol. 2021 Dec;39(12):1563-1573. doi: 10.1038/s41587-021-00968-7. Epub 2021 Jul 8.

J Proteome Res. 2021 Jul 2;20(7):3758-3766. doi: 10.1021/acs.jproteome.1c00123. Epub 2021 Jun 21.

Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics.深度学习提高基于质谱的免疫肽组学的灵敏度。

Nat Commun. 2021 Jun 7;12(1):3346. doi: 10.1038/s41467-021-23713-9.

Diamond: a multi-modal DIA mass spectrometry data processing pipeline.Diamond：一种多模态数据独立采集质谱数据处理流程

Bioinformatics. 2021 Apr 19;37(2):265-267. doi: 10.1093/bioinformatics/btaa1093.

Data-independent acquisition mass spectrometry (DIA-MS) for proteomic applications in oncology.用于肿瘤蛋白质组学应用的数据非依赖采集质谱法（DIA-MS）。

Mol Omics. 2021 Feb 1;17(1):29-42. doi: 10.1039/d0mo00072h. Epub 2020 Oct 9.

Strategies to enable large-scale proteomics for reproducible research.实现大规模蛋白质组学研究可重复性的策略。

Nat Commun. 2020 Jul 30;11(1):3793. doi: 10.1038/s41467-020-17641-3.

Application of the UHPLC-DIA-HRMS Method for Determination of Cheese Peptides.超高效液相色谱-数据独立采集-高分辨率质谱法在奶酪肽测定中的应用。

Foods. 2020 Jul 23;9(8):979. doi: 10.3390/foods9080979.

The nf-core framework for community-curated bioinformatics pipelines.用于社区策划生物信息学流程的nf-core框架。

Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x.

DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput.DIA-NN：神经网络和干扰校正可实现高通量下的深度蛋白质组覆盖。

Nat Methods. 2020 Jan;17(1):41-44. doi: 10.1038/s41592-019-0638-x. Epub 2019 Nov 25.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

数据非依赖采集肽组学。

Data-Independent Acquisition Peptidomics.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献