School of Computing, Ulster University, Newtownabbey, UK.
MRC Integrative Epidemiology Unit, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK.
Brief Bioinform. 2019 Sep 27;20(5):1795-1811. doi: 10.1093/bib/bby051.
There has been an exponential growth in the performance and output of sequencing technologies (omics data) with full genome sequencing now producing gigabases of reads on a daily basis. These data may hold the promise of personalized medicine, leading to routinely available sequencing tests that can guide patient treatment decisions. In the era of high-throughput sequencing (HTS), computational considerations, data governance and clinical translation are the greatest rate-limiting steps. To ensure that the analysis, management and interpretation of such extensive omics data is exploited to its full potential, key factors, including sample sourcing, technology selection and computational expertise and resources, need to be considered, leading to an integrated set of high-performance tools and systems. This article provides an up-to-date overview of the evolution of HTS and the accompanying tools, infrastructure and data management approaches that are emerging in this space, which, if used within in a multidisciplinary context, may ultimately facilitate the development of personalized medicine.
测序技术(组学数据)的性能和产量呈指数级增长,全基因组测序现在每天产生千兆字节的读取量。这些数据可能有望实现个性化医疗,从而可以常规进行测序测试,以指导患者的治疗决策。在高通量测序(HTS)时代,计算考虑、数据治理和临床转化是最大的限速步骤。为了确保充分利用此类广泛的组学数据的分析、管理和解释,需要考虑关键因素,包括样本来源、技术选择以及计算专业知识和资源,从而形成一套集成的高性能工具和系统。本文提供了对 HTS 及其伴随工具、基础设施和数据管理方法的最新概述,这些工具、基础设施和数据管理方法正在这一领域中出现,如果在多学科背景下使用,最终可能会促进个性化医疗的发展。