Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14203, USA.
Bioinformatics. 2012 Apr 1;28(7):1021-3. doi: 10.1093/bioinformatics/bts063. Epub 2012 Feb 2.
The extension of mapped sequence tags is a common step in the analysis of single-end next-generation sequencing (NGS) data from protein localization and chromatin studies. The optimal extension can vary depending on experimental and technical conditions. Improper extension of sequence tags can obscure or mislead the interpretation of NGS results. We present an algorithm, ArchTEx (Architectural Tag Extender), which identifies the optimal extension of sequence tags based on the maximum correlation between forward and reverse tags and extracts and visualizes sites of interest using the predicted extension.
ArchTEx requires Java 1.6 or newer. Source code and the compiled program are freely available at http://sourceforge.net/projects/archtex/
Supplementary data are available at Bioinformatics online.
映射序列标签的延伸是蛋白质定位和染色质研究中单端下一代测序 (NGS) 数据分析的常见步骤。最佳的延伸可能因实验和技术条件而异。序列标签的不当延伸会使 NGS 结果的解释变得模糊或误导。我们提出了一种算法,即 ArchTEx(结构标签延伸器),它根据正向和反向标签之间的最大相关性来识别序列标签的最佳延伸,并使用预测的延伸来提取和可视化感兴趣的位点。
ArchTEx 需要 Java 1.6 或更高版本。源代码和编译程序可在 http://sourceforge.net/projects/archtex/ 上免费获得。
补充数据可在 Bioinformatics 在线获得。