Kruglyak S
School of Operations Research and Industrial Engineering, Cornell University, Ithaca, New York 14853-3801, USA.
J Comput Biol. 1998 Spring;5(1):165-71. doi: 10.1089/cmb.1998.5.165.
The sequencing of DNA is an important and difficult problem. Many interesting algorithms combine various technologies in an attempt to sequence long regions of DNA. One such algorithm is sequencing by hybridization (SBH). We briefly review SBH and mention the drawbacks that prevent it from being used in practice. We then present a theoretical algorithm that uniquely determines a sequence of length n through hybridization experiments that require the examination of only O(n2log(n)) subsequences. The key idea is to double subsequence length in each iteration of the algorithm. There are various problems associated with transforming the theoretical algorithm into a practical biological procedure. However, the general strategy of increasing subsequence length may be used to develop algorithms that are feasible given the current state of technology. Combining this strategy with a computer processing phase leads to a novel method of extending the resolving power of standard SBH techniques.
DNA测序是一个重要且困难的问题。许多有趣的算法结合了各种技术,试图对长片段DNA进行测序。其中一种算法是杂交测序法(SBH)。我们简要回顾一下SBH,并提及阻碍其在实际中应用的缺点。然后,我们提出一种理论算法,该算法通过仅需检查O(n²log(n))个子序列的杂交实验来唯一确定长度为n的序列。关键思想是在算法的每次迭代中使子序列长度翻倍。将理论算法转化为实际生物学程序存在各种问题。然而,增加子序列长度的总体策略可用于开发在当前技术状态下可行的算法。将该策略与计算机处理阶段相结合,可得到一种扩展标准SBH技术分辨能力的新方法。