Cottingham R W, Idury R M, Schäffer A A
Department of Cell Biology, Baylor College of Medicine, Houston, TX 77030.
Am J Hum Genet. 1993 Jul;53(1):252-63.
Linkage analysis using maximum-likelihood estimation is a powerful tool for locating genes. As available data sets have grown, the computation required for analysis has grown exponentially and become a significant impediment. Others have previously shown that parallel computation is applicable to linkage analysis and can yield order-of-magnitude improvements in speed. In this paper, we demonstrate that algorithmic modifications can also yield order-of-magnitude improvements, and sometimes much more. Using the software package LINKAGE, we describe a variety of algorithmic improvements that we have implemented, demonstrating both how these techniques are applied and their power. Experiments show that these improvements speed up the programs by an order of magnitude, on problems of moderate and large size. All improvements were made only in the combinatorial part of the code, without restoring to parallel computers. These improvements synthesize biological principles with computer science techniques, to effectively restructure the time-consuming computations in genetic linkage analysis.
使用最大似然估计的连锁分析是定位基因的强大工具。随着可用数据集的增长,分析所需的计算呈指数级增长,成为一个重大障碍。其他人此前已表明并行计算适用于连锁分析,并且可以在速度上实现数量级的提升。在本文中,我们证明算法修改也可以带来数量级的提升,有时甚至更多。使用软件包LINKAGE,我们描述了我们实现的各种算法改进,展示了这些技术的应用方式及其强大功能。实验表明,在中等和大型问题上,这些改进将程序速度提高了一个数量级。所有改进仅在代码的组合部分进行,而无需使用并行计算机。这些改进将生物学原理与计算机科学技术相结合,以有效重组遗传连锁分析中耗时的计算。