Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.
TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
Nat Commun. 2021 Sep 1;12(1):5228. doi: 10.1038/s41467-021-25131-3.
EpiScanpy is a toolkit for the analysis of single-cell epigenomic data, namely single-cell DNA methylation and single-cell ATAC-seq data. To address the modality specific challenges from epigenomics data, epiScanpy quantifies the epigenome using multiple feature space constructions and builds a nearest neighbour graph using epigenomic distance between cells. EpiScanpy makes the many existing scRNA-seq workflows from scanpy available to large-scale single-cell data from other -omics modalities, including methods for common clustering, dimension reduction, cell type identification and trajectory learning techniques, as well as an atlas integration tool for scATAC-seq datasets. The toolkit also features numerous useful downstream functions, such as differential methylation and differential openness calling, mapping epigenomic features of interest to their nearest gene, or constructing gene activity matrices using chromatin openness. We successfully benchmark epiScanpy against other scATAC-seq analysis tools and show its outperformance at discriminating cell types.
EpiScanpy 是一个用于分析单细胞表观基因组数据的工具包,即单细胞 DNA 甲基化和单细胞 ATAC-seq 数据。为了解决表观基因组数据的特定模式挑战,epiScanpy 使用多种特征空间构建来量化表观基因组,并使用细胞之间的表观基因组距离构建最近邻图。EpiScanpy 使来自 scanpy 的许多现有的 scRNA-seq 工作流程可用于来自其他 -omics 模式的大规模单细胞数据,包括用于常见聚类、降维、细胞类型识别和轨迹学习技术的方法,以及用于 scATAC-seq 数据集的图谱集成工具。该工具包还具有许多有用的下游功能,例如差异甲基化和差异开放性调用,将感兴趣的表观基因组特征映射到其最近的基因,或使用染色质开放性构建基因活性矩阵。我们成功地对 epiScanpy 进行了基准测试,以评估其与其他 scATAC-seq 分析工具的性能,并展示了其在区分细胞类型方面的优异表现。