Servant Nicolas, Varoquaux Nelle, Lajoie Bryan R, Viara Eric, Chen Chong-Jian, Vert Jean-Philippe, Heard Edith, Dekker Job, Barillot Emmanuel
Institut Curie, Paris, France.
INSERM, U900, Paris, France.
Genome Biol. 2015 Dec 1;16:259. doi: 10.1186/s13059-015-0831-x.
HiC-Pro is an optimized and flexible pipeline for processing Hi-C data from raw reads to normalized contact maps. HiC-Pro maps reads, detects valid ligation products, performs quality controls and generates intra- and inter-chromosomal contact maps. It includes a fast implementation of the iterative correction method and is based on a memory-efficient data format for Hi-C contact maps. In addition, HiC-Pro can use phased genotype data to build allele-specific contact maps. We applied HiC-Pro to different Hi-C datasets, demonstrating its ability to easily process large data in a reasonable time. Source code and documentation are available at http://github.com/nservant/HiC-Pro .
HiC-Pro是一个经过优化且灵活的流程,用于处理从原始读段到标准化接触图谱的Hi-C数据。HiC-Pro对读段进行定位,检测有效的连接产物,执行质量控制,并生成染色体内和染色体间的接触图谱。它包含迭代校正方法的快速实现,并基于一种内存高效的数据格式来存储Hi-C接触图谱。此外,HiC-Pro可以使用分阶段的基因型数据来构建等位基因特异性接触图谱。我们将HiC-Pro应用于不同的Hi-C数据集,证明了它能够在合理的时间内轻松处理大数据。源代码和文档可在http://github.com/nservant/HiC-Pro获取。