Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan 430061, China; The clinical medical college of Traditional Chinese Medicine, Hubei University of Traditional Chinese Medicine, Wuhan 430065, China.
School of Computer and Information Technology and Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing 100044, China.
Comput Methods Programs Biomed. 2019 Jun;174:41-50. doi: 10.1016/j.cmpb.2018.02.014. Epub 2018 Feb 22.
Liver disease is a multifactorial complex disease with high global prevalence and poor long-term clinical efficacy and liver disease patients with different comorbidities often incorporate multiple phenotypes in the clinic. Thus, there is a pressing need to improve understanding of the complexity of clinical liver population to help gain more accurate disease subtypes for personalized treatment.
Individualized treatment of the traditional Chinese medicine (TCM) provides a theoretical basis to the study of personalized classification of complex diseases. Utilizing the TCM clinical electronic medical records (EMRs) of 6475 liver inpatient cases, we built a liver disease comorbidity network (LDCN) to show the complicated associations between liver diseases and their comorbidities, and then constructed a patient similarity network with shared symptoms (PSN). Finally, we identified liver patient subgroups using community detection methods and performed enrichment analyses to find both distinct clinical and molecular characteristics (with the phenotype-genotype associations and interactome networks) of these patient subgroups.
From the comorbidity network, we found that clinical liver patients have a wide range of disease comorbidities, in which the basic liver diseases (e.g. hepatitis b, decompensated liver cirrhosis), and the common chronic diseases (e.g. hypertension, type 2 diabetes), have high degree of disease comorbidities. In addition, we identified 303 patient modules (representing the liver patient subgroups) from the PSN, in which the top 6 modules with large number of cases include 51.68% of the whole cases and 251 modules contain only 10 or fewer cases, which indicates the manifestation diversity of liver diseases. Finally, we found that the patient subgroups actually have distinct symptom phenotypes, disease comorbidity characteristics and their underlying molecular pathways, which could be used for understanding the novel disease subtypes of liver conditions. For example, three patient subgroups, namely Module 6 (M6, n = 638), M2 (n = 623) and M1 (n = 488) were associated to common chronic liver disease conditions (hepatitis, cirrhosis, hepatocellular carcinoma). Meanwhile, patient subgroups of M30 (n = 36) and M36 (n = 37) were mostly related to acute gastroenteritis and upper respiratory infection, respectively, which reflected the individual comorbidity characteristics of liver subgroups. Furthermore, we identified the distinct genes and pathways of patient subgroups and the basic liver diseases (hepatitis b and cirrhosis), respectively. The high degree of overlapping pathways between them (e.g. M36 with 93.33% shared enriched pathways) indicates the underlying molecular network mechanisms of each patient subgroup.
Our results demonstrate the utility and comprehensiveness of disease classification study based on community detection of patient network using shared TCM symptom phenotypes and it can be used to other more complex diseases.
肝脏疾病是一种具有全球高发率、临床疗效差的多因素复杂疾病,且患有不同合并症的肝脏疾病患者在临床上常表现出多种表型。因此,迫切需要提高对临床肝脏人群复杂性的认识,以帮助获得更准确的疾病亚型,从而实现个体化治疗。
中药个体化治疗为复杂疾病的个体化分类研究提供了理论依据。我们利用 6475 例肝脏住院患者的中医临床电子病历(EMR)构建了肝脏疾病合并症网络(LDCN),以显示肝脏疾病及其合并症之间复杂的关联,然后构建了具有共享症状的患者相似网络(PSN)。最后,我们使用社区检测方法识别肝脏患者亚组,并进行富集分析,以发现这些患者亚组的独特临床和分子特征(包括表型-基因型关联和互作网络)。
从合并症网络中,我们发现临床肝脏患者存在广泛的疾病合并症,其中基本肝脏疾病(如乙型肝炎、失代偿性肝硬化)和常见慢性疾病(如高血压、2 型糖尿病)具有较高的疾病合并症程度。此外,我们从 PSN 中识别出 303 个患者模块(代表肝脏患者亚组),其中前 6 个模块包含大量病例,占总病例的 51.68%,而 251 个模块仅包含 10 个或更少的病例,这表明肝脏疾病的表现多样性。最后,我们发现患者亚组实际上具有不同的症状表型、疾病合并症特征及其潜在的分子途径,可用于理解肝脏疾病的新型疾病亚型。例如,三个患者亚组,即模块 6(M6,n=638)、M2(n=623)和 M1(n=488)与常见慢性肝病(肝炎、肝硬化、肝癌)有关。同时,M30(n=36)和 M36(n=37)患者亚组分别主要与急性胃肠炎和上呼吸道感染有关,这反映了肝脏亚组的个体合并症特征。此外,我们鉴定了患者亚组和基本肝脏疾病(乙型肝炎和肝硬化)的不同基因和途径。它们之间高度重叠的途径(例如,M36 有 93.33%的共享富集途径)表明了每个患者亚组的潜在分子网络机制。
我们的研究结果表明,基于共享中医症状表型的患者网络的社区检测进行疾病分类研究具有实用性和全面性,可用于其他更复杂的疾病。