Aylor David L, Price Eric W, Carbone Ignazio
Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA.
Bioinformatics. 2006 Jun 1;22(11):1399-401. doi: 10.1093/bioinformatics/btl136. Epub 2006 Apr 6.
We have added two software tools to our Suite of Nucleotide Analysis Programs (SNAP) for working with DNA sequences sampled from populations. SNAP Map collapses DNA sequence data into unique haplotypes, extracts variable sites and manipulates output into multiple formats for input into existing software packages for evolutionary analyses. Map collapses DNA sequence data into unique haplotypes, extracts variable sites and manipulates output into multiple formats for input into existing software packages for evolutionary analyses. Map includes novel features such as recoding insertions or deletions, including or excluding variable sites that violate an infinite-sites model and the option of collapsing sequences with corresponding phenotypic information, important in testing for significant haplotype-phenotype associations. SNAP Combine merges multiple DNA sequence alignments into a single multiple alignment file. The resulting file can be the union or intersection of the input files. SNAP Combine currently reads from and writes to several sequence alignment file formats including both sequential and interleaved formats. Combine also keeps track of the start and end positions of each separate alignment file allowing the user to exclude variable sites or taxa, important in creating input files for multilocus analyses.
我们已在核苷酸分析程序套件(SNAP)中添加了两个软件工具,用于处理从群体中采样的DNA序列。SNAP Map将DNA序列数据归纳为独特的单倍型,提取可变位点,并将输出处理为多种格式,以便输入到现有的用于进化分析的软件包中。Map将DNA序列数据归纳为独特的单倍型,提取可变位点,并将输出处理为多种格式,以便输入到现有的用于进化分析的软件包中。Map具有一些新特性,如对插入或缺失进行重新编码,包括或排除违反无限位点模型的可变位点,以及将序列与相应表型信息合并的选项,这在检测显著的单倍型-表型关联时很重要。SNAP Combine将多个DNA序列比对合并为一个单一的多序列比对文件。生成的文件可以是输入文件的并集或交集。SNAP Combine目前支持读取和写入多种序列比对文件格式,包括连续格式和交错格式。Combine还会记录每个单独比对文件的起始和结束位置,允许用户排除可变位点或分类单元,这在创建多位点分析的输入文件时很重要。