Staden R
Nucleic Acids Res. 1982 Aug 11;10(15):4731-51. doi: 10.1093/nar/10.15.4731.
This paper describes a computer method for handling gel reading data produced by the shotgun method of DNA sequencing. The method greatly reduces the time the sequencer needs to spend checking and editing his data and yet it produces a consensus sequence for which the accuracy of determination of every base can be clearly shown. The program can take a batch of new gel readings, screen them against vector sequences removing any that match, and then compare and align all the sequences to produce a final consensus. No information is lost in this process as alignments are achieved by making only insertions and because all the individual gel readings are added to a database from which they can be retrieved and displayed lined up one above the other. This allows the user to check on the alignments achieved by the program and if necessary change them. As each gel reading is added to the database the consensus is automatically updated accordingly and used for the next comparisons. This is a much faster process than comparing each new gel against every individual gel in the database.
本文描述了一种处理由DNA测序鸟枪法产生的凝胶读数数据的计算机方法。该方法大大减少了测序仪检查和编辑数据所需的时间,并且能生成一个共有序列,每个碱基的测定准确性都能清晰显示。该程序可以接收一批新的凝胶读数,根据载体序列对其进行筛选,去除任何匹配的序列,然后比较并比对所有序列以生成最终的共有序列。在此过程中不会丢失任何信息,因为比对是通过仅进行插入来实现的,并且所有单独的凝胶读数都被添加到一个数据库中,从中可以检索并将它们一个接一个地排列显示。这允许用户检查程序实现的比对情况,并在必要时进行更改。随着每个凝胶读数被添加到数据库中,共有序列会相应地自动更新,并用于下一次比较。这比将每个新凝胶与数据库中的每个单独凝胶进行比较要快得多。