Department of Industrial Engineering, University of Houston, Houston, TX, 77204, USA.
Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD, 21287, USA.
BMC Bioinformatics. 2023 Feb 14;24(1):47. doi: 10.1186/s12859-023-05146-x.
Functional gene networks (FGNs) capture functional relationships among genes that vary across tissues and cell types. Construction of cell-type-specific FGNs enables the understanding of cell-type-specific functional gene relationships and insights into genetic mechanisms of human diseases in disease-relevant cell types. However, most existing FGNs were developed without consideration of specific cell types within tissues.
In this study, we created a multimodal deep learning model (MDLCN) to predict cell-type-specific FGNs in the human brain by integrating single-nuclei gene expression data with global protein interaction networks. We systematically evaluated the prediction performance of the MDLCN and showed its superior performance compared to two baseline models (boosting tree and convolutional neural network). Based on the predicted cell-type-specific FGNs, we observed that cell-type marker genes had a higher level of hubness than non-marker genes in their corresponding cell type. Furthermore, we showed that risk genes underlying autism and Alzheimer's disease were more strongly connected in disease-relevant cell types, supporting the cellular context of predicted cell-type-specific FGNs.
Our study proposes a powerful deep learning approach (MDLCN) to predict FGNs underlying a diverse set of cell types in human brain. The MDLCN model enhances prediction accuracy of cell-type-specific FGNs compared to single modality convolutional neural network (CNN) and boosting tree models, as shown by higher areas under both receiver operating characteristic (ROC) and precision-recall curves for different levels of independent test datasets. The predicted FGNs also show evidence for the cellular context and distinct topological features (i.e. higher hubness and topological score) of cell-type marker genes. Moreover, we observed stronger modularity among disease-associated risk genes in FGNs of disease-relevant cell types. For example, the strength of connectivity among autism risk genes was stronger in neurons, but risk genes underlying Alzheimer's disease were more connected in microglia.
功能基因网络(FGN)捕捉了在组织和细胞类型之间变化的基因之间的功能关系。构建细胞类型特异性 FGN 能够理解细胞类型特异性的功能基因关系,并深入了解与疾病相关的细胞类型中人类疾病的遗传机制。然而,大多数现有的 FGN 是在没有考虑组织内特定细胞类型的情况下开发的。
在这项研究中,我们创建了一种多模态深度学习模型(MDLCN),通过整合单核基因表达数据和全局蛋白质相互作用网络,预测人类大脑中的细胞类型特异性 FGN。我们系统地评估了 MDLCN 的预测性能,并表明其优于两个基线模型(提升树和卷积神经网络)。基于预测的细胞类型特异性 FGN,我们观察到细胞类型标记基因在其相应的细胞类型中比非标记基因具有更高的枢纽度。此外,我们还表明,自闭症和阿尔茨海默病的风险基因在与疾病相关的细胞类型中连接更为紧密,支持预测的细胞类型特异性 FGN 的细胞上下文。
我们的研究提出了一种强大的深度学习方法(MDLCN),可以预测人类大脑中多种细胞类型的 FGN。与单模态卷积神经网络(CNN)和提升树模型相比,MDLCN 模型提高了细胞类型特异性 FGN 的预测准确性,不同独立测试数据集的接收者操作特征(ROC)和精度-召回曲线的面积更高。预测的 FGN 还显示了细胞类型标记基因的细胞上下文和独特拓扑特征(即更高的枢纽度和拓扑得分)的证据。此外,我们观察到在与疾病相关的细胞类型的 FGN 中,疾病相关风险基因之间的模块性更强。例如,自闭症风险基因之间的连接强度在神经元中更强,但阿尔茨海默病的风险基因在小胶质细胞中连接更紧密。