Huang W, Yin Z, Fuhrmann D R, States D J, Thomas L J
Biomedical Computer Laboratory, Washington University, St. Louis, MO 63130, USA.
Electrophoresis. 1997 Jan;18(1):23-5. doi: 10.1002/elps.1150180106.
In a previous paper (Yin et al., Electrophoresis 1996, 17, 1143-1150), an automated method for matrix determination in four-dye fluorescence-based DNA sequencing was presented. As a continuation of that work, we have developed an alternative method to estimate the matrix from raw sequence data. The method uses an iterative clustering technique to associate each 4 x 1 data vector with one column of the desired filter matrix, using Kullback's I-divergence as a distance measure. The method requires less preprocessing of the data and less computation than the approach described by Yin et al. (Electrophoresis 1996, 17, 1143-1150). An example demonstrating applicability of the proposed method to Applied Biosystems sequencer data is given.
在之前的一篇论文中(Yin等人,《电泳》,1996年,第17卷,第1143 - 1150页),提出了一种用于基于四染料荧光的DNA测序中矩阵测定的自动化方法。作为该项工作的延续,我们开发了一种从原始序列数据估计矩阵的替代方法。该方法使用迭代聚类技术,以库尔贝克信息散度作为距离度量,将每个4×1数据向量与所需滤波矩阵的一列相关联。与Yin等人(《电泳》,1996年,第17卷,第1143 - 1150页)所描述的方法相比,该方法对数据的预处理要求更低,计算量也更少。给出了一个示例,展示了所提方法对应用生物系统公司测序仪数据的适用性。