Weller Mathias, Chateau Annie, Giroudeau Rodolphe
BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S2. doi: 10.1186/1471-2105-16-S14-S2. Epub 2015 Oct 2.
This paper presents new structural and algorithmic results around the scaffolding problem, which occurs prominently in next generation sequencing. The problem can be formalized as an optimization problem on a special graph, the "scaffold graph". We prove that the problem is polynomial if this graph is a tree by providing a dynamic programming algorithm for this case. This algorithm serves as a basis to deduce an exact algorithm for general graphs using a tree decomposition of the input. We explore other structural parameters, proving a linear-size problem kernel with respect to the size of a feedback-edge set on a restricted version of Scaffolding. Finally, we examine some parameters of scaffold graphs, which are based on real-world genomes, revealing that the feedback edge set is significantly smaller than the input size.
本文围绕支架问题给出了新的结构和算法结果,该问题在下一代测序中显著出现。此问题可形式化为关于一种特殊图即“支架图”的优化问题。我们通过为该情况提供一种动态规划算法,证明了如果此图是一棵树,那么该问题是多项式时间可解的。该算法作为使用输入的树分解来推导一般图的精确算法的基础。我们探索了其他结构参数,证明了在支架问题的一个受限版本上相对于反馈边集大小的线性规模问题核。最后,我们研究了基于真实世界基因组的支架图的一些参数,发现反馈边集明显小于输入大小。