Lin Ke, Smit Sandra, Bonnema Guusje, Sanchez-Perez Gabino, de Ridder Dick
Brief Bioinform. 2015 Sep;16(5):852-64. doi: 10.1093/bib/bbu047. Epub 2014 Dec 12.
From prokaryotes to eukaryotes, phenotypic variation, adaptation and speciation has been associated with structural variation between genomes of individuals within the same species. Many computer algorithms detecting such variations (callers) have recently been developed, spurred by the advent of the next-generation sequencing technology. Such callers mainly exploit split-read mapping or paired-end read mapping. However, as different callers are geared towards different types of structural variation, there is still no single caller that can be considered a community standard; instead, increasingly the various callers are combined in integrated pipelines. In this article, we review a wide range of callers, discuss challenges in the integration step and present a survey of pipelines used in population genomics studies. Based on our findings, we provide general recommendations on how to set-up such pipelines. Finally, we present an outlook on future challenges in structural variation detection.
从原核生物到真核生物,表型变异、适应和物种形成一直与同一物种内个体基因组之间的结构变异相关。受下一代测序技术出现的推动,最近开发了许多检测此类变异的计算机算法(调用程序)。此类调用程序主要利用分裂读段映射或双末端读段映射。然而,由于不同的调用程序针对不同类型的结构变异,仍然没有一个调用程序可以被视为社区标准;相反,越来越多的各种调用程序被组合在集成流程中。在本文中,我们回顾了广泛的调用程序,讨论了集成步骤中的挑战,并对群体基因组学研究中使用的流程进行了调查。基于我们的发现,我们就如何设置此类流程提供了一般性建议。最后,我们对结构变异检测未来的挑战进行了展望。