IBM Research Europe, The Hartree Centre STFC Laboratory, Warrington WA4 4AD, UK.
Genome. 2021 Apr;64(4):467-475. doi: 10.1139/gen-2020-0096. Epub 2020 Nov 20.
Genomics is both a data- and compute-intensive discipline. The success of genomics depends on an adequate informatics infrastructure that can address growing data demands and enable a diverse range of resource-intensive computational activities. Designing a suitable infrastructure is a challenging task, and its success largely depends on its adoption by users. In this article, we take a user-centric view of the genomics, where users are bioinformaticians, computational biologists, and data scientists. We try to take their point of view on how traditional computational activities for genomics are expanding due to data growth, as well as the introduction of big data and cloud technologies. The changing landscape of computational activities and new user requirements will influence the design of future genomics infrastructures.
基因组学既是一个数据密集型学科,也是一个计算密集型学科。基因组学的成功取决于一个足够的信息学基础设施,该基础设施能够满足不断增长的数据需求,并支持各种资源密集型的计算活动。设计一个合适的基础设施是一项具有挑战性的任务,其成功在很大程度上取决于用户的采用。在本文中,我们从用户的角度来看待基因组学,这些用户是生物信息学家、计算生物学家和数据科学家。我们试图从他们的角度出发,了解由于数据的增长以及大数据和云计算技术的引入,传统的基因组学计算活动是如何扩展的。计算活动的不断变化的格局和新的用户需求将影响未来基因组学基础设施的设计。