Rouchka E C, States D J
Institute for Biomedical Computing, Washington University, St. Louis, MO 63110-1012, USA.
Proc Int Conf Intell Syst Mol Biol. 1998;6:140-7.
DNA sequence analysis depends on the accurate assembly of fragment reads for the determination of a consensus sequence. This report examines the possibility of analyzing multiple, independent restriction digests as a method for testing the fidelity of sequence assembly. A dynamic programming algorithm to determine the maximum likelihood alignment of error prone electrophoretic mobility data to the expected fragment mobilities given the consensus sequence and restriction enzymes is derived and used to assess the likelihood of detecting rearrangements in genomic sequencing projects. The method is shown to reliably detect errors in sequence fragment assembly without the necessity of making reference to an overlying physical map. An html form-based interface is available at http:/(/)www.ibc.wustl.edu/services/validate. html.
DNA序列分析依赖于对片段读数进行准确组装以确定共有序列。本报告探讨了分析多个独立限制性酶切消化作为测试序列组装保真度方法的可能性。推导了一种动态规划算法,用于在给定共有序列和限制性酶的情况下,确定易出错的电泳迁移率数据与预期片段迁移率的最大似然比对,并用于评估在基因组测序项目中检测重排的可能性。结果表明,该方法能够可靠地检测序列片段组装中的错误,而无需参考上层物理图谱。可通过http:/(/)www.ibc.wustl.edu/services/validate.html获取基于html表单的界面。