School of Life Sciences, Shanghai University, Shanghai, China.
Department of Radiology, Columbia University Medical Center, New York.
J Cell Biochem. 2019 Jan;120(1):405-416. doi: 10.1002/jcb.27395. Epub 2018 Aug 20.
Synthetic lethality is the synthesis of mutations leading to cell death. Tumor-specific synthetic lethality has been targeted in research to improve cancer therapy. With the advances of techniques in molecular biology, such as RNAi and CRISPR/Cas9 gene editing, efforts have been made to systematically identify synthetic lethal interactions, especially for frequently mutated genes in cancers. However, elucidating the mechanism of synthetic lethality remains a challenge because of the complexity of its influencing conditions. In this study, we proposed a new computational method to identify critical functional features that can accurately predict synthetic lethal interactions. This method incorporates several machine learning algorithms and encodes protein-coding genes by an enrichment system derived from gene ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways to represent their functional features. We built a random forest-based prediction engine by using 2120 selected features and obtained a Matthews correlation coefficient of 0.532. We examined the top 15 features and found that most of them have potential roles in synthetic lethality according to previous studies. These results demonstrate the ability of our proposed method to predict synthetic lethal interactions and provide a basis for further characterization of these particular genetic combinations.
合成致死性是导致细胞死亡的突变的综合。肿瘤特异性合成致死性已成为研究的目标,以改善癌症治疗。随着分子生物学技术的进步,如 RNAi 和 CRISPR/Cas9 基因编辑,人们已经努力系统地识别合成致死性相互作用,特别是针对癌症中经常发生突变的基因。然而,由于其影响条件的复杂性,阐明合成致死性的机制仍然是一个挑战。在这项研究中,我们提出了一种新的计算方法来识别关键的功能特征,这些特征可以准确预测合成致死性相互作用。该方法结合了几种机器学习算法,并通过从基因本体论术语和京都基因与基因组百科全书途径中得出的富集系统对蛋白质编码基因进行编码,以表示它们的功能特征。我们使用 2120 个选定的特征构建了一个基于随机森林的预测引擎,并获得了 0.532 的马修斯相关系数。我们检查了前 15 个特征,发现根据先前的研究,其中大多数在合成致死性中具有潜在作用。这些结果表明了我们提出的方法预测合成致死性相互作用的能力,并为进一步表征这些特定的遗传组合提供了基础。