Li Ye Henry, Li Dangna, Samusik Nikolay, Wang Xiaowei, Guan Leying, Nolan Garry P, Wong Wing Hung
Structural Biology Department and Public Policy Program, Stanford University, Stanford, United States of America.
Institute for Computational and Mathematical Engineering, Stanford University, Stanford, United States of America.
PLoS Comput Biol. 2017 Dec 27;13(12):e1005875. doi: 10.1371/journal.pcbi.1005875. eCollection 2017 Dec.
Mass cytometry (CyTOF) has greatly expanded the capability of cytometry. It is now easy to generate multiple CyTOF samples in a single study, with each sample containing single-cell measurement on 50 markers for more than hundreds of thousands of cells. Current methods do not adequately address the issues concerning combining multiple samples for subpopulation discovery, and these issues can be quickly and dramatically amplified with increasing number of samples. To overcome this limitation, we developed Partition-Assisted Clustering and Multiple Alignments of Networks (PAC-MAN) for the fast automatic identification of cell populations in CyTOF data closely matching that of expert manual-discovery, and for alignments between subpopulations across samples to define dataset-level cellular states. PAC-MAN is computationally efficient, allowing the management of very large CyTOF datasets, which are increasingly common in clinical studies and cancer studies that monitor various tissue samples for each subject.
质谱流式细胞术(CyTOF)极大地扩展了细胞计数的能力。现在,在一项研究中轻松生成多个CyTOF样本变得很容易,每个样本包含对超过数十万个细胞的50个标志物进行单细胞测量。当前方法无法充分解决与组合多个样本以发现亚群相关的问题,并且随着样本数量的增加,这些问题会迅速且显著地放大。为了克服这一限制,我们开发了网络分区辅助聚类和多重比对(PAC-MAN)方法,用于在CyTOF数据中快速自动识别与专家手动发现紧密匹配的细胞群体,并用于跨样本的亚群之间的比对,以定义数据集水平的细胞状态。PAC-MAN计算效率高,能够管理非常大的CyTOF数据集,这在监测每个受试者的各种组织样本的临床研究和癌症研究中越来越常见。