MOX-Department of Mathematics, Politecnico di Milano, Milan 20133, Italy.
Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia, Milan 20139, Italy.
Bioinformatics. 2017 Aug 15;33(16):2570-2572. doi: 10.1093/bioinformatics/btx201.
Chromatin Immunoprecipitation followed by sequencing (ChIP-seq) generates local accumulations of sequencing reads on the genome ("peaks"), which correspond to specific protein-DNA interactions or chromatin modifications. Peaks are detected by considering their total area above a background signal, usually neglecting their shapes, which instead may convey additional biological information. We present FunChIP, an R/Bioconductor package for clustering peaks according to a functional representation of their shapes: after approximating their profiles with cubic B-splines, FunChIP minimizes their functional distance and classifies the peaks applying a k-mean alignment and clustering algorithm. The whole pipeline is user-friendly and provides visualization functions for a quick inspection of the results. An application to the transcription factor Myc in 3T9 murine fibroblasts shows that clusters of peaks with different shapes are associated with different genomic locations and different transcriptional regulatory activity.
The package is implemented in R and is available under Artistic Licence 2.0 from the Bioconductor website (http://bioconductor.org/packages/FunChIP).
Supplementary data are available at Bioinformatics online.
染色质免疫沉淀测序(ChIP-seq)在基因组上产生测序reads 的局部聚集("峰"),这些峰对应于特定的蛋白质-DNA 相互作用或染色质修饰。通过考虑其在背景信号之上的总区域来检测峰,通常忽略其形状,而形状可能传递额外的生物学信息。我们提出了 FunChIP,这是一个用于根据其形状的功能表示对峰进行聚类的 R/Bioconductor 包:在用三次 B 样条近似其轮廓后,FunChIP 最小化它们的功能距离,并应用 k-均值对齐和聚类算法对峰进行分类。整个管道用户友好,并提供可视化功能,可快速检查结果。在 3T9 鼠成纤维细胞中的转录因子 Myc 的应用表明,具有不同形状的峰簇与不同的基因组位置和不同的转录调控活性相关。
该软件包是用 R 语言实现的,根据 Artistic Licence 2.0 版可在 Bioconductor 网站(http://bioconductor.org/packages/FunChIP)获得。
补充数据可在 Bioinformatics 在线获得。