Huan Jia-Ming, Wang Xiao-Jie, Li Yuan, Zhang Shi-Jun, Hu Yuan-Long, Li Yun-Lun
First School of Clinical Medicine, Shandong University of Traditional Chinese Medicine, Jinan, 250355, China.
Department of Cardiovascular, Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, 250014, China.
BioData Min. 2024 May 21;17(1):13. doi: 10.1186/s13040-024-00365-1.
A knowledge graph can effectively showcase the essential characteristics of data and is increasingly emerging as a significant means of integrating information in the field of artificial intelligence. Coronary artery plaque represents a significant etiology of cardiovascular events, posing a diagnostic challenge for clinicians who are confronted with a multitude of nonspecific symptoms. To visualize the hierarchical relationship network graph of the molecular mechanisms underlying plaque properties and symptom phenotypes, patient symptomatology was extracted from electronic health record data from real-world clinical settings. Phenotypic networks were constructed utilizing clinical data and protein‒protein interaction networks. Machine learning techniques, including convolutional neural networks, Dijkstra's algorithm, and gene ontology semantic similarity, were employed to quantify clinical and biological features within the network. The resulting features were then utilized to train a K-nearest neighbor model, yielding 23 symptoms, 41 association rules, and 61 hub genes across the three types of plaques studied, achieving an area under the curve of 92.5%. Weighted correlation network analysis and pathway enrichment were subsequently utilized to identify lipid status-related genes and inflammation-associated pathways that could help explain the differences in plaque properties. To confirm the validity of the network graph model, we conducted coexpression analysis of the hub genes to evaluate their potential diagnostic value. Additionally, we investigated immune cell infiltration, examined the correlations between hub genes and immune cells, and validated the reliability of the identified biological pathways. By integrating clinical data and molecular network information, this biomedical knowledge graph model effectively elucidated the potential molecular mechanisms that collude symptoms, diseases, and molecules.
知识图谱能够有效地展示数据的基本特征,并且日益成为人工智能领域中整合信息的重要手段。冠状动脉斑块是心血管事件的重要病因,给面临众多非特异性症状的临床医生带来了诊断挑战。为了可视化斑块特性和症状表型背后分子机制的层次关系网络图,从真实临床环境的电子健康记录数据中提取了患者症状。利用临床数据和蛋白质-蛋白质相互作用网络构建了表型网络。采用包括卷积神经网络、迪杰斯特拉算法和基因本体语义相似性在内的机器学习技术,对网络中的临床和生物学特征进行量化。然后利用所得特征训练一个K近邻模型,在所研究的三种类型斑块中得出23种症状、41条关联规则和61个枢纽基因,曲线下面积达到92.5%。随后利用加权相关网络分析和通路富集来识别与脂质状态相关的基因和与炎症相关的通路,这些基因和通路有助于解释斑块特性的差异。为了证实网络图模型的有效性,我们对枢纽基因进行了共表达分析,以评估它们的潜在诊断价值。此外,我们研究了免疫细胞浸润,检查了枢纽基因与免疫细胞之间的相关性,并验证了所确定生物学通路的可靠性。通过整合临床数据和分子网络信息,这个生物医学知识图谱模型有效地阐明了勾结症状、疾病和分子的潜在分子机制。