Wang Pingzhang, Han Wenling, Ma Dalong
Department of Immunology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China;
Peking University Center for Human Disease Genomics, Beijing 100191, China; and
J Immunol. 2017 Dec 15;199(12):4155-4164. doi: 10.4049/jimmunol.1700946. Epub 2017 Nov 1.
Immune cells are highly plastic in both gene expression and cell phenotype. We have established a method of gene expressional plasticity and virtual sorting to evaluate immune cell subpopulations and their characteristic genes in human CD4 T cells. In this study, we continued to investigate the informatics mechanism on the effectiveness of virtual sorting. We found that virtual sorting had an overall positive correlation to the Pearson correlation in the identification of positively correlated genes. However, owing to nonlinear biological anticorrelation, virtual sorting showed a distinctive advantage for anticorrelated genes, suggesting an important role in the identification of negative regulators. In addition, based on virtual sorting results, we identified two basic gene sets among highly plastic genes, i.e., highly plastic cell cycle-associated molecules and highly plastic immune and defense response-associated molecules. Genes within each set tended to be positively connected, but genes between two sets were often anticorrelated. Further analysis revealed preferential transcription factor binding motifs existed between highly plastic cell cycle-associated molecules and highly plastic immune and defense response-associated molecules. Our results strongly suggested predetermined regulation, which was called an immune cell internal phenotype, should exist and could be mined by virtual sorting analysis. This provided efficient functional clues to study immune cell phenotypes and their regulation. Moreover, the current substantial virtual sorting results in both CD4 T cells and B cells provide a useful resource for big-data-driven experimental studies and knowledge discoveries.
免疫细胞在基因表达和细胞表型方面具有高度可塑性。我们建立了一种基因表达可塑性和虚拟分选方法,以评估人类CD4 T细胞中的免疫细胞亚群及其特征基因。在本研究中,我们继续探究虚拟分选有效性的信息学机制。我们发现,在识别正相关基因方面,虚拟分选与皮尔逊相关性总体呈正相关。然而,由于非线性生物反相关性,虚拟分选在反相关基因方面显示出独特优势,表明其在识别负调控因子中发挥重要作用。此外,基于虚拟分选结果,我们在高度可塑性基因中鉴定出两个基本基因集,即高度可塑性细胞周期相关分子和高度可塑性免疫与防御反应相关分子。每个基因集内的基因倾向于正向连接,但两个基因集之间的基因通常呈反相关。进一步分析揭示,高度可塑性细胞周期相关分子和高度可塑性免疫与防御反应相关分子之间存在优先转录因子结合基序。我们的结果有力地表明,应该存在一种预先确定的调控,即免疫细胞内部表型,并且可以通过虚拟分选分析挖掘出来。这为研究免疫细胞表型及其调控提供了有效的功能线索。此外,目前在CD4 T细胞和B细胞中大量的虚拟分选结果为大数据驱动的实验研究和知识发现提供了有用资源。