Department of Biomedical Engineering and Institute for Computational Medicine, Johns Hopkins University, Baltimore, MD, USA.
Bioinformatics. 2011 Aug 1;27(15):2147-8. doi: 10.1093/bioinformatics/btr357. Epub 2011 Jun 17.
Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline.
MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM.
目前正在对数千个人类癌症外显子进行测序,产生了可能与疾病病因学相关的数百万个非同义单核苷酸变异(SNV)。在这里,我们提供了一个软件工具包,用于根据其对肿瘤发生的预测贡献对 SNV 进行优先级排序。它包括一个预计算的、可预测的特征数据库,涵盖了注释人类外显子中的所有位置,可单独使用,也可作为更大的变异发现管道的一部分。
MySQL 数据库、源代码和二进制文件可在 http://wiki.chasmsoftware.org 上免费供学术/政府使用,Python 和 C++中的源代码。需要 32 位或 64 位 Linux 系统(在 Fedora Core 8、10、11 和 Ubuntu 10 上进行了测试),Python < 3.0*,MySQL 服务器 > 5.0,2.5*≤,60GB 可用硬盘空间(软件和数据文件占用 50MB,未压缩时 MySQL 数据库转储占用 40GB),2GB RAM。