Department of Biomedical Informatics, College of Medicine, The Ohio State University, 1800 Cannon Drive, Columbus, OH 43210, United States.
Department of Biomedical Informatics, College of Pharmacy, The Ohio State University, 500 W. 12 ave, Columbus, OH 43210, United States.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae425.
Synthetic lethality (SL) has shown great promise for the discovery of novel targets in cancer. CRISPR double-knockout (CDKO) technologies can only screen several hundred genes and their combinations, but not genome-wide. Therefore, good SL prediction models are highly needed for genes and gene pairs selection in CDKO experiments. However, lack of scalable SL properties prevents generalizability of SL interactions to out-of-sample data, thereby hindering modeling efforts. In this paper, we recognize that SL connectivity is a scalable and generalizable SL property. We develop a novel two-step multilayer encoder for individual sample-specific SL prediction model (MLEC-iSL), which predicts SL connectivity first and SL interactions subsequently. MLEC-iSL has three encoders, namely, gene, graph, and transformer encoders. MLEC-iSL achieves high SL prediction performance in K562 (AUPR, 0.73; AUC, 0.72) and Jurkat (AUPR, 0.73; AUC, 0.71) cells, while no existing methods exceed 0.62 AUPR and AUC. The prediction performance of MLEC-iSL is validated in a CDKO experiment in 22Rv1 cells, yielding a 46.8% SL rate among 987 selected gene pairs. The screen also reveals SL dependency between apoptosis and mitosis cell death pathways.
合成致死性 (SL) 在癌症新靶点的发现方面显示出巨大的潜力。CRISPR 双敲除 (CDKO) 技术只能筛选几百个基因及其组合,而不是全基因组。因此,CDKO 实验中需要良好的 SL 预测模型来筛选基因和基因对。然而,缺乏可扩展的 SL 特性会阻止 SL 相互作用对样本外数据的泛化,从而阻碍建模工作。在本文中,我们认识到 SL 连接性是一种可扩展和可泛化的 SL 特性。我们开发了一种新颖的两步多层编码器,用于个体样本特异性 SL 预测模型 (MLEC-iSL),该模型首先预测 SL 连接性,然后预测 SL 相互作用。MLEC-iSL 有三个编码器,分别是基因、图和转换器编码器。MLEC-iSL 在 K562(AUPR,0.73;AUC,0.72)和 Jurkat(AUPR,0.73;AUC,0.71)细胞中实现了高 SL 预测性能,而没有现有的方法超过 0.62 AUPR 和 AUC。在 22Rv1 细胞中的 CDKO 实验中验证了 MLEC-iSL 的预测性能,在 987 对选定的基因对中产生了 46.8%的 SL 率。该筛选还揭示了凋亡和有丝分裂细胞死亡途径之间的 SL 依赖性。