H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA.
Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL, USA.
Methods Mol Biol. 2021;2194:187-221. doi: 10.1007/978-1-0716-0849-4_11.
Highly collaborative scientists are often called on to extend their expertise to different types of projects and to expand the scope and scale of projects well beyond their previous experience. For a large-scale project involving "big data" to be successful, several different aspects of the research plan need to be developed and tested, which include but are not limited to the experimental design, sample collection, sample preparation, metadata recording, technical capability, data acquisition, approaches for data analysis, methods for integration of different data types, recruitment of additional expertise as needed to guide the project, and strategies for clear communication throughout the project. To capture this process, we describe an example project in proteogenomics that built on our collective expertise and experience. Key steps included definition of hypotheses, identification of an appropriate clinical cohort, pilot projects to assess feasibility, refinement of experimental designs, and extensive discussions involving the research team throughout the process. The goal of this chapter is to provide the reader with a set of guidelines to support development of other large-scale multiomics projects.
高度协作的科学家经常被要求将其专业知识扩展到不同类型的项目中,并将项目的范围和规模大大超出其以往的经验。为了使涉及“大数据”的大型项目取得成功,需要开发和测试研究计划的几个不同方面,其中包括但不限于实验设计、样本收集、样本准备、元数据记录、技术能力、数据采集、数据分析方法、整合不同类型数据的方法、根据需要招募额外的专业知识来指导项目,以及整个项目中明确沟通的策略。为了捕捉这个过程,我们描述了一个基于我们集体专业知识和经验的蛋白质基因组学示例项目。关键步骤包括假设的定义、合适临床队列的确定、评估可行性的试点项目、实验设计的改进,以及研究团队在整个过程中的广泛讨论。本章的目标是为读者提供一套指导方针,以支持其他大规模多组学项目的开发。