管理大规模多组学项目：蛋白质基因组学中的团队科学案例研究。

Managing a Large-Scale Multiomics Project: A Team Science Case Study in Proteogenomics.

机构信息

H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA.

Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA.

出版信息

Methods Mol Biol. 2021;2194:187-221. doi: 10.1007/978-1-0716-0849-4_11.

DOI:10.1007/978-1-0716-0849-4_11

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7771375/

Abstract

Highly collaborative scientists are often called on to extend their expertise to different types of projects and to expand the scope and scale of projects well beyond their previous experience. For a large-scale project involving "big data" to be successful, several different aspects of the research plan need to be developed and tested, which include but are not limited to the experimental design, sample collection, sample preparation, metadata recording, technical capability, data acquisition, approaches for data analysis, methods for integration of different data types, recruitment of additional expertise as needed to guide the project, and strategies for clear communication throughout the project. To capture this process, we describe an example project in proteogenomics that built on our collective expertise and experience. Key steps included definition of hypotheses, identification of an appropriate clinical cohort, pilot projects to assess feasibility, refinement of experimental designs, and extensive discussions involving the research team throughout the process. The goal of this chapter is to provide the reader with a set of guidelines to support development of other large-scale multiomics projects.

摘要

高度协作的科学家经常被要求将其专业知识扩展到不同类型的项目中，并将项目的范围和规模大大超出其以往的经验。为了使涉及“大数据”的大型项目取得成功，需要开发和测试研究计划的几个不同方面，其中包括但不限于实验设计、样本收集、样本准备、元数据记录、技术能力、数据采集、数据分析方法、整合不同类型数据的方法、根据需要招募额外的专业知识来指导项目，以及整个项目中明确沟通的策略。为了捕捉这个过程，我们描述了一个基于我们集体专业知识和经验的蛋白质基因组学示例项目。关键步骤包括假设的定义、合适临床队列的确定、评估可行性的试点项目、实验设计的改进，以及研究团队在整个过程中的广泛讨论。本章的目标是为读者提供一套指导方针，以支持其他大规模多组学项目的开发。

相似文献

1

Managing a Large-Scale Multiomics Project: A Team Science Case Study in Proteogenomics.

Methods Mol Biol. 2021;2194:187-221. doi: 10.1007/978-1-0716-0849-4_11.

2

Proteogenomics: Key Driver for Clinical Discovery and Personalized Medicine.

Adv Exp Med Biol. 2016;926:21-47. doi: 10.1007/978-3-319-42316-6_3.

3

Primary Care Research Team Assessment (PCRTA): development and evaluation.

Occas Pap R Coll Gen Pract. 2002 Feb(81):iii-vi, 1-72.

4

Bridging the Chromosome-centric and Biology/Disease-driven Human Proteome Projects: Accessible and Automated Tools for Interpreting the Biological and Pathological Impact of Protein Sequence Variants Detected via Proteogenomics.

J Proteome Res. 2018 Dec 7;17(12):4329-4336. doi: 10.1021/acs.jproteome.8b00404. Epub 2018 Sep 5.

5

Moonshot Objectives: Catalyze New Scientific Breakthroughs-Proteogenomics.

Cancer J. 2018 May/Jun;24(3):121-125. doi: 10.1097/PPO.0000000000000315.

6

JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.

J Proteome Res. 2016 Jul 1;15(7):2309-20. doi: 10.1021/acs.jproteome.6b00344. Epub 2016 Jun 13.

7

Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis.

Nat Commun. 2020 Apr 9;11(1):1759. doi: 10.1038/s41467-020-15456-w.

8

SMAP is a pipeline for sample matching in proteogenomics.

Nat Commun. 2022 Feb 8;13(1):744. doi: 10.1038/s41467-022-28411-8.

9

Rise of Clinical Microbial Proteogenomics: A Multiomics Approach to Nontuberculous Mycobacterium-The Case of Mycobacterium abscessus UC22.

OMICS. 2019 Jan;23(1):1-16. doi: 10.1089/omi.2018.0116. Epub 2018 Sep 12.

10

Linking cancer genome to proteome: NCI's investment into proteogenomics.

Proteomics. 2014 Dec;14(23-24):2633-6. doi: 10.1002/pmic.201400193. Epub 2014 Oct 18.

引用本文的文献

1

Ovarian Cancer: Multi-Omics Data Integration.

Int J Mol Sci. 2025 Jun 21;26(13):5961. doi: 10.3390/ijms26135961.

本文引用的文献

1

Proteogenomic landscape of squamous cell lung cancer.

Nat Commun. 2019 Aug 8;10(1):3578. doi: 10.1038/s41467-019-11452-x.

2

Integration and Analysis of CPTAC Proteomics Data in the Context of Cancer Genomics in the cBioPortal.

Mol Cell Proteomics. 2019 Sep;18(9):1893-1898. doi: 10.1074/mcp.TIR119.001673. Epub 2019 Jul 15.

3

Correlation Analysis of Histopathology and Proteogenomics Data for Breast Cancer.

Mol Cell Proteomics. 2019 Aug 9;18(8 suppl 1):S37-S51. doi: 10.1074/mcp.RA118.001232. Epub 2019 Jul 8.

4

Comparative Proteome Profiling and Mutant Protein Identification in Metastatic Prostate Cancer Cells by Quantitative Mass Spectrometry-based Proteogenomics.

Cancer Genomics Proteomics. 2019 Jul-Aug;16(4):273-286. doi: 10.21873/cgp.20132.

5

GMSimpute: a generalized two-step Lasso approach to impute missing values in label-free mass spectrum analysis.

Bioinformatics. 2020 Jan 1;36(1):257-263. doi: 10.1093/bioinformatics/btz488.

6

Reproducible workflow for multiplexed deep-scale proteome and phosphoproteome analysis of tumor tissues by liquid chromatography-mass spectrometry.

Nat Protoc. 2018 Jul;13(7):1632-1661. doi: 10.1038/s41596-018-0006-9.

7

An Accessible Proteogenomics Informatics Resource for Cancer Researchers.

Cancer Res. 2017 Nov 1;77(21):e43-e46. doi: 10.1158/0008-5472.CAN-17-0331.

8

Evaluating somatic tumor mutation detection without matched normal samples.

Hum Genomics. 2017 Sep 4;11(1):22. doi: 10.1186/s40246-017-0118-2.

9

Relative protein quantification and accessible biology in lung tumor proteomes from four LC-MS/MS discovery platforms.

Proteomics. 2017 Mar;17(6). doi: 10.1002/pmic.201600300.

10

The international Genome sample resource (IGSR): A worldwide collection of genome variation incorporating the 1000 Genomes Project data.

Nucleic Acids Res. 2017 Jan 4;45(D1):D854-D859. doi: 10.1093/nar/gkw829. Epub 2016 Sep 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

文档翻译

学术文献翻译模型，支持多种主流文档格式。