Institute of Plant Biology and Zürich-Basel Plant Science Center, University of Zurich, 8008 Zürich, Switzerland.
Bioinformatics. 2015 Feb 1;31(3):436-7. doi: 10.1093/bioinformatics/btu680. Epub 2014 Oct 15.
Analysis of differential gene expression by RNA sequencing (RNA-Seq) is frequently done using feature counts, i.e. the number of reads mapping to a gene. However, commonly used count algorithms (e.g. HTSeq) do not address the problem of reads aligning with multiple locations in the genome (multireads) or reads aligning with positions where two or more genes overlap (ambiguous reads). Rcount specifically addresses these issues. Furthermore, Rcount allows the user to assign priorities to certain feature types (e.g. higher priority for protein-coding genes compared to rRNA-coding genes) or to add flanking regions.
Rcount provides a fast and easy-to-use graphical user interface requiring no command line or programming skills. It is implemented in C++ using the SeqAn (www.seqan.de) and the Qt libraries (qt-project.org). Source code and 64 bit binaries for (Ubuntu) Linux, Windows (7) and MacOSX are released under the GPLv3 license and are freely available on github.com/MWSchmid/Rcount.
Test data, genome annotation files, useful Python and R scripts and a step-by-step user guide (including run-time and memory usage tests) are available on github.com/MWSchmid/Rcount.
通过 RNA 测序(RNA-Seq)进行差异基因表达分析时,通常使用特征计数,即映射到基因的读取数。然而,常用的计数算法(例如 HTSeq)并未解决读取与基因组中多个位置(多读取)或与两个或多个基因重叠的位置(模糊读取)对齐的问题。Rcount 专门解决了这些问题。此外,Rcount 允许用户为某些特征类型分配优先级(例如,与 rRNA 编码基因相比,对蛋白质编码基因的优先级更高)或添加侧翼区域。
Rcount 提供了一个快速易用的图形用户界面,无需命令行或编程技能。它使用 C++ 实现,使用 SeqAn(www.seqan.de)和 Qt 库(qt-project.org)。(Ubuntu)Linux、Windows(7)和 MacOSX 的源代码和 64 位二进制文件根据 GPLv3 许可证发布,可在 github.com/MWSchmid/Rcount 上免费获得。
测试数据、基因组注释文件、有用的 Python 和 R 脚本以及逐步用户指南(包括运行时和内存使用测试)可在 github.com/MWSchmid/Rcount 上获得。