Skinnider Michael A
Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA.
Ludwig Institute for Cancer Research, Princeton University, Princeton, NJ 08540, USA.
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae097.
High-throughput techniques that measure thousands of analytes at once have become ubiquitous features of biological research. The increasing expectation that the raw data generated by these techniques be deposited to public repositories creates rich opportunities for secondary analysis of these datasets. Such opportunities can take multiple forms. As the recipient of the 2023 Junior Research Parasite Award, I was asked to comment on the role of so-called research parasites within the ecosystem of secondary data analysis. Drawing on my own experiences, I discuss mechanisms by which reanalysis of published datasets can catalyze biological discoveries, produce resources that would be impossible to generate within a single laboratory, and drive the refinement of computational methods.
能够一次性测量数千种分析物的高通量技术已成为生物学研究中普遍存在的特征。人们越来越期望将这些技术产生的原始数据存入公共数据库,这为这些数据集的二次分析创造了丰富的机会。此类机会可以有多种形式。作为2023年初级研究寄生虫奖的获得者,我受邀就所谓的研究寄生虫在二次数据分析生态系统中的作用发表评论。借鉴我自己的经验,我讨论了对已发表数据集进行重新分析能够促进生物学发现、产生单个实验室无法生成的资源以及推动计算方法完善的机制。