Department of Computer Science, Colorado State University, 1873 Campus Delivery, Fort Collins, CO 80523-1873, USA.
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1843-6. doi: 10.1109/TCBB.2012.84.
Given a set S of n strings, each of length l, and a nonnegative value d, we define a center string as a string of length l that has Hamming distance at most d from each string in S. The #CLOSEST STRING problem aims to determine the number of center strings for a given set of strings S and input parameters n, l, and d. We show #CLOSEST STRING is impossible to solve exactly or even approximately in polynomial time, and that restricting #CLOSEST STRING so that any one of the parameters n, ‘, or d is fixed leads to a fully polynomial-time randomized approximation scheme (FPRAS). We show equivalent results for the problem of efficiently sampling center strings uniformly at random (u.a.r.).
给定一个由 n 个长度为 l 的字符串组成的集合 S,以及一个非负值 d,我们将中心字符串定义为与 S 中的每个字符串的汉明距离不超过 d 的长度为 l 的字符串。最近字符串问题旨在确定给定字符串集合 S 和输入参数 n、l 和 d 的中心字符串的数量。我们表明,最近字符串问题既不可能精确求解,也不可能在多项式时间内近似求解,并且将最近字符串问题限制为使得参数 n、'或 d 中的任何一个固定,会导致完全多项式时间随机逼近方案(FPRAS)。我们还展示了该问题的等效结果,即高效地随机均匀抽样中心字符串(u.a.r.)。