Gordon Paul M K, Sensen Christoph W
Faculty of Medicine, Department of Biochemistry and Molecular Biology, Sun Center of Excellence for Visual Genomics, University of Calgary, 3330 Hospital Drive NW, Calgary, Alberta T2N 4N1, Canada.
Nucleic Acids Res. 2004 Sep 29;32(17):e133. doi: 10.1093/nar/gnh127.
We have developed a software package called Osprey for the calculation of optimal oligonucleotides for DNA sequencing and the creation of microarrays based on either PCR-products or directly spotted oligomers. It incorporates a novel use of position-specific scoring matrices, for the sensitive and specific identification of secondary binding sites anywhere in the target sequence. Using accelerated hardware is faster and more efficient than the traditional pairwise alignments used in most oligo-design software. Osprey consists of a module for target site selection based on user input, novel utilities for dealing with problematic sequences such as repeats, and a common code base for the identification of optimal oligonucleotides from the target list. Overall, these improvements provide a program that, without major increases in run time, reflects current DNA thermodynamics models, improves specificity and reduces the user's data preprocessing and parameterization requirements. Using a TimeLogic hardware accelerator, we report up to 50-fold reduction in search time versus a linear search strategy. Target sites may be derived from computer analysis of DNA sequence assemblies in the case of sequencing efforts, or genome or EST analysis in the case of microarray development in both prokaryotes and eukaryotes.
我们开发了一个名为Osprey的软件包,用于计算DNA测序的最佳寡核苷酸,并基于PCR产物或直接点样的寡聚物创建微阵列。它采用了位置特异性评分矩阵的新用法,用于灵敏且特异地识别靶序列中任何位置的二级结合位点。使用加速硬件比大多数寡核苷酸设计软件中使用的传统成对比对更快、更高效。Osprey由一个基于用户输入的靶位点选择模块、处理重复等问题序列的新实用程序,以及从靶标列表中识别最佳寡核苷酸的通用代码库组成。总体而言,这些改进提供了一个程序,在运行时间没有大幅增加的情况下,反映了当前的DNA热力学模型,提高了特异性,并减少了用户的数据预处理和参数设置要求。使用TimeLogic硬件加速器,我们报告与线性搜索策略相比,搜索时间减少了高达50倍。在测序工作中,靶位点可以从DNA序列组装的计算机分析中获得;在原核生物和真核生物的微阵列开发中,靶位点可以从基因组或EST分析中获得。