Center for the Study of Systems Biology, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332, USA.
Commun Biol. 2022 Aug 25;5(1):870. doi: 10.1038/s42003-022-03816-9.
To understand the origin of disease comorbidity and to identify the essential proteins and pathways underlying comorbid diseases, we developed LeMeDISCO (Large-Scale Molecular Interpretation of Disease Comorbidity), an algorithm that predicts disease comorbidities from shared mode of action proteins predicted by the artificial intelligence-based MEDICASCY algorithm. LeMeDISCO was applied to predict the occurrence of comorbid diseases for 3608 distinct diseases. Benchmarking shows that LeMeDISCO has much better comorbidity recall than the two molecular methods XD-score (44.5% vs. 6.4%) and the S score (68.6% vs. 8.0%). Its performance is somewhat comparable to the phenotype method-based Symptom Similarity Score, 63.7% vs. 100%, but LeMeDISCO works for far more cases and its large comorbidity recall is attributed to shared proteins that can help provide an understanding of the molecular mechanism(s) underlying disease comorbidity. The LeMeDISCO web server is available for academic users at: http://sites.gatech.edu/cssb/LeMeDISCO .
为了理解疾病共病的起源,并确定共病的基本蛋白质和途径,我们开发了 LeMeDISCO(疾病共病的大规模分子解释),这是一种从基于人工智能的 MEDICASCY 算法预测的共享作用模式蛋白质预测疾病共病的算法。LeMeDISCO 被应用于预测 3608 种不同疾病的共病发生。基准测试表明,LeMeDISCO 的共病召回率远高于 XD-score(44.5% 对 6.4%)和 S 评分(68.6% 对 8.0%)两种分子方法。它的性能与基于表型的症状相似性评分有些相似,63.7% 对 100%,但 LeMeDISCO 适用于更多的病例,其大的共病召回率归因于共享蛋白质,这些蛋白质有助于理解疾病共病的分子机制。LeMeDISCO 网络服务器可在以下网址供学术用户使用:http://sites.gatech.edu/cssb/LeMeDISCO。