Department of Biomedical Informatics, University of Cincinnati, Cincinnati, OH, USA.
Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA.
Nucleic Acids Res. 2019 Dec 2;47(21):e138. doi: 10.1093/nar/gkz789.
To understand the molecular pathogenesis of human disease, precision analyses to define alterations within and between disease-associated cell populations are desperately needed. Single-cell genomics represents an ideal platform to enable the identification and comparison of normal and diseased transcriptional cell populations. We created cellHarmony, an integrated solution for the unsupervised analysis, classification, and comparison of cell types from diverse single-cell RNA-Seq datasets. cellHarmony efficiently and accurately matches single-cell transcriptomes using a community-clustering and alignment strategy to compute differences in cell-type specific gene expression over potentially dozens of cell populations. Such transcriptional differences are used to automatically identify distinct and shared gene programs among cell-types and identify impacted pathways and transcriptional regulatory networks to understand the impact of perturbations at a systems level. cellHarmony is implemented as a python package and as an integrated workflow within the software AltAnalyze. We demonstrate that cellHarmony has improved or equivalent performance to alternative label projection methods, is able to identify the likely cellular origins of malignant states, stratify patients into clinical disease subtypes from identified gene programs, resolve discrete disease networks impacting specific cell-types, and illuminate therapeutic mechanisms. Thus, this approach holds tremendous promise in revealing the molecular and cellular origins of complex disease.
为了理解人类疾病的分子发病机制,非常需要精确的分析来定义与疾病相关的细胞群体内部和之间的变化。单细胞基因组学代表了一种理想的平台,可以实现正常和患病转录细胞群体的识别和比较。我们创建了 cellHarmony,这是一种用于对来自不同单细胞 RNA-Seq 数据集的细胞类型进行无监督分析、分类和比较的集成解决方案。cellHarmony 使用社区聚类和对齐策略来高效、准确地匹配单细胞转录组,以计算潜在数十个细胞群体中细胞类型特异性基因表达的差异。这些转录差异用于自动识别细胞类型之间独特和共享的基因程序,并识别受影响的途径和转录调控网络,以从系统水平理解干扰的影响。cellHarmony 被实现为一个 Python 包,并作为软件 AltAnalyze 中的一个集成工作流程。我们证明,cellHarmony 的性能优于替代的标签投影方法,能够识别恶性状态的可能细胞起源,根据鉴定的基因程序将患者分层为临床疾病亚型,解决影响特定细胞类型的离散疾病网络,并阐明治疗机制。因此,这种方法在揭示复杂疾病的分子和细胞起源方面具有巨大的潜力。