Torney D C
Theoretical Biology and Biophysics, Los Alamos National Laboratory, NM 87545.
J Mol Biol. 1991 Jan 20;217(2):259-64. doi: 10.1016/0022-2836(91)90540-m.
Theoretical predictions are given for the progress expected, when mapping DNA by identifying clones containing specific unique sequences. Progress is measured in three ways; however, all results depend on (dimensionless counterparts of) the number of clones and the number of unique sequences used. Furthermore, the effects of clone length dispersion are included in the theoretical predictions. Both the clones in the library and the unique sequences are assumed to be generated randomly, with uniform probability of originating at any base in the region to be mapped. The first measure of progress is the expected length fraction of the region to be mapped covered by at least one clone, when clones containing at least one unique sequence are included in the map. The second measure of progress is the expected length fraction of the region to be mapped in "covered intervals", an interval being the region between adjacent unique sequences. Alternative definitions for clones covering an interval are analyzed. The third measure of progress is the expected number of clone islands generated; an island covers successive intervals. Finally, using these measures of progress, we compare the efficiency of this new mapping strategy with conventional clone mapping strategies.
针对通过识别包含特定独特序列的克隆来绘制DNA图谱时预期取得的进展给出了理论预测。进展通过三种方式衡量;然而,所有结果都取决于克隆数量和所使用的独特序列数量(无量纲对应物)。此外,克隆长度分散的影响也包含在理论预测中。假设文库中的克隆和独特序列都是随机产生的,在待绘制区域的任何碱基处起始的概率均等。进展的第一个衡量标准是当包含至少一个独特序列的克隆被纳入图谱时,待绘制区域中至少被一个克隆覆盖的预期长度比例。进展的第二个衡量标准是待绘制区域在“覆盖区间”中的预期长度比例,一个区间是相邻独特序列之间的区域。分析了覆盖一个区间的克隆的替代定义。进展的第三个衡量标准是产生的克隆岛的预期数量;一个岛覆盖连续的区间。最后,使用这些进展衡量标准,我们将这种新的图谱绘制策略的效率与传统克隆图谱绘制策略进行比较。