Li Zhandong, Wang Deling, Guo Wei, Zhang Shiqi, Chen Lei, Zhang Yu-Hang, Lu Lin, Pan XiaoYong, Huang Tao, Cai Yu-Dong
College of Biological and Food Engineering, Jilin Engineering Normal University, Changchun, China.
State Key Laboratory of Oncology in South China, Department of Radiology, Collaborative Innovation Center for Cancer Medicine, Sun Yat-sen University Cancer Center, Guangzhou, China.
Front Neurosci. 2022 Jul 15;16:841145. doi: 10.3389/fnins.2022.841145. eCollection 2022.
Mammalian cortical interneurons (CINs) could be classified into more than two dozen cell types that possess diverse electrophysiological and molecular characteristics, and participate in various essential biological processes in the human neural system. However, the mechanism to generate diversity in CINs remains controversial. This study aims to predict CIN diversity in mouse embryo by using single-cell transcriptomics and the machine learning methods. Data of 2,669 single-cell transcriptome sequencing results are employed. The 2,669 cells are classified into three categories, caudal ganglionic eminence (CGE) cells, dorsal medial ganglionic eminence (dMGE) cells, and ventral medial ganglionic eminence (vMGE) cells, corresponding to the three regions in the mouse subpallium where the cells are collected. Such transcriptomic profiles were first analyzed by the minimum redundancy and maximum relevance method. A feature list was obtained, which was further fed into the incremental feature selection, incorporating two classification algorithms (random forest and repeated incremental pruning to produce error reduction), to extract key genes and construct powerful classifiers and classification rules. The optimal classifier could achieve an MCC of 0.725, and category-specified prediction accuracies of 0.958, 0.760, and 0.737 for the CGE, dMGE, and vMGE cells, respectively. The related genes and rules may provide helpful information for deepening the understanding of CIN diversity.
哺乳动物的皮层中间神经元(CINs)可分为二十多种细胞类型,它们具有多样的电生理和分子特征,并参与人类神经系统中的各种重要生物学过程。然而,CINs产生多样性的机制仍存在争议。本研究旨在通过使用单细胞转录组学和机器学习方法来预测小鼠胚胎中的CIN多样性。采用了2669个单细胞转录组测序结果的数据。这2669个细胞被分为三类,即尾侧神经节隆起(CGE)细胞、背内侧神经节隆起(dMGE)细胞和腹内侧神经节隆起(vMGE)细胞,它们对应于从小鼠大脑皮质下收集细胞的三个区域。此类转录组图谱首先通过最小冗余最大相关方法进行分析。获得了一个特征列表,该列表进一步输入到增量特征选择中,并结合两种分类算法(随机森林和重复增量剪枝以减少错误),以提取关键基因并构建强大的分类器和分类规则。最优分类器的马修斯相关系数(MCC)可达0.725,对CGE、dMGE和vMGE细胞的类别特异性预测准确率分别为0.958、0.760和0.737。相关基因和规则可能为深化对CIN多样性的理解提供有用信息。