Suresh Nikhila T, E R Vimina, Krishnakumar U
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):742-751. doi: 10.1109/TCBB.2022.3140388. Epub 2023 Feb 3.
In gene-based therapies, local perturbations associated with one disease can lead to comorbidity as it influences the pathways involved with the other diseases. The key genes orchestrating the common biological mechanisms are need to be prioritized for addressing the challenges introduced by the cross talks between disease modules. Here, a local centrality measure named Sub graph based Average Path length Double Specific Betweenness centrality (SAPDSB) for prioritizing the comorbid genes via Protein-Protein Interaction Network (PPIN) analysis is presented. This approach can be used to identify putative biomarkers which can be repurposed for the management of comorbidity. Proposed network based topological measure is designed specifically to prioritize the comorbid genes that are most likely to be present in the overlap of disease modules. In order to attain this, the estimated average path length of the seed network which holds Protein-Protein Interactions (PPIs) of the disease genes is exploited. Prioritized comorbid genes are further pruned using centrality-based cut-off values and specificity scores. The biological significance of the resultant genes is corroborated with connectivity analysis using leave-one-out method, pathway enrichment analysis and a comparative analysis using single disease-based gene prioritization tools. For performance analysis, proposed approach is tested using case studies involving common diseases and rare neurodegenerative diseases. For case study1, diseases such as Diabetes, Carcinoma and Alzheimer's are considered in a pairwise manner while for case study2, Amyotrophic Lateral Sclerosis (ALS) and Spinal Muscular Atrophy (SMA) are considered. As outcome, prioritized candidate genes and biological pathways associated with respective disease pairs have been found. The associations from top 10 candidate genes in different disease pair combinations of Diabetes-Carcinoma-Alzheimer's revealed common genes like CREBBP, TP53, HSP90AA1 and the common pathway namely p53 pathway feedback loops 2. Out of the pathways retrieved from the top 10 genes associated with ALS-SMA disease pair, 60% of unique pathways are found to be leading to both diseases and its comorbidities. Comparative analysis of the proposed method with recent similar approach also reported a clear degree of benefits in performance.
在基于基因的疗法中,与一种疾病相关的局部扰动可能会导致共病,因为它会影响与其他疾病相关的通路。为应对疾病模块间相互作用带来的挑战,需要对协调共同生物学机制的关键基因进行优先级排序。在此,提出了一种基于子图的平均路径长度双特异性介数中心性(SAPDSB)的局部中心性度量方法,用于通过蛋白质-蛋白质相互作用网络(PPIN)分析对共病基因进行优先级排序。该方法可用于识别可重新用于共病管理的潜在生物标志物。所提出的基于网络的拓扑度量方法专门设计用于对最有可能存在于疾病模块重叠部分的共病基因进行优先级排序。为实现这一点,利用了包含疾病基因蛋白质-蛋白质相互作用(PPI)的种子网络的估计平均路径长度。使用基于中心性的截止值和特异性分数对优先级共病基因进行进一步筛选。通过留一法连接性分析、通路富集分析以及使用基于单一疾病的基因优先级工具进行的比较分析,证实了所得基因具有生物学意义。为进行性能分析,使用涉及常见疾病和罕见神经退行性疾病的案例研究对所提出的方法进行测试。对于案例研究1,以两两组合的方式考虑糖尿病、癌症和阿尔茨海默病等疾病,而对于案例研究2,考虑肌萎缩侧索硬化症(ALS)和脊髓性肌萎缩症(SMA)。结果发现了与各疾病对相关的优先级候选基因和生物学通路。糖尿病-癌症-阿尔茨海默病不同疾病对组合中前10个候选基因的关联揭示了CREBBP、TP53、HSP90AA1等共同基因以及名为p53通路反馈环2的共同通路。在与ALS-SMA疾病对相关的前10个基因检索到的通路中,发现60%的独特通路会导致两种疾病及其共病情况。将所提出的方法与近期类似方法进行比较分析,也报告了在性能方面有明显的优势。