Ben Cohen Gil, Yaacov Adar, Ben Zvi Yishai, Loutati Ranel, Lishinsky Natan, Landau Jakob, Hope Tom, Popovzter Aron, Rosenberg Shai
Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel; The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel.
Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel; The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel.
Comput Biol Med. 2025 Feb;185:109491. doi: 10.1016/j.compbiomed.2024.109491. Epub 2024 Dec 19.
The identification and drug targeting of cancer causing (driver) genetic alterations has seen immense improvement in recent years, with many new targeted therapies developed. However, identifying, prioritizing, and treating genetic alterations is insufficient for most cancer patients. Current clinical practices rely mainly on DNA level mutational analyses, which in many cases fail to identify treatable driver events. Arguably, signal strength may determine cell fate more than the mutational status that initiated it. The use of transcriptomics, a complex and highly informative representation of cellular and tumor state, had been suggested to enhance diagnostics and treatment successes. A gene-expression based model trained over known genetic alterations could improve identification and quantification of cancer related biological aberrations' signal strength.
We present STAMP (Signatures in Transcriptome Associated with Mutated Protein), a Graph Convolution Networks (GCN) based framework for the identification of gene expression signatures related to cancer driver events. STAMP was trained to identify the p53 dysfunction of cancer samples from gene expression, utilizing comprehensive curated graph structures of gene interactions. Predictions were modified for generating a quantitative score to rank the severity of a driver event in each sample. STAMP was then extended to almost 300 tumor type-specific predictive models for important cancer genes/pathways, by training to identify well-established driver events' annotations from the literature.
STAMP achieved very high AUC on unseen data across several tumor types and on an independent cohort. The framework was validated on p53 related genetic and clinical characteristics, including the effect of Variants of Unknown Significance, and showed strong correlation with protein function. For genes and tumor types where targeted therapy is available, STAMP showed correlation with drugs sensitivity (IC50) in an independent cell line database. It managed to stratify drug effect on samples with similar mutational profiles. STAMP was validated for drug-response prediction in clinical patients' cohorts, improving over a state-of-the-art method and suggesting potential biomarkers for cancer treatments.
The STAMP models provide a learning framework that successfully identifies and quantifies driver events' signal strength, showing utility in portraying the molecular landscape of tumors based on transcriptomics. Importantly, STAMP manifested the ability to improve targeted therapy selection and hence can contribute to better treatment.
近年来,致癌(驱动)基因改变的识别和药物靶向治疗取得了巨大进展,开发了许多新的靶向治疗方法。然而,对于大多数癌症患者来说,识别、排序和治疗基因改变是不够的。目前的临床实践主要依赖于DNA水平的突变分析,而在许多情况下,这种分析无法识别可治疗的驱动事件。可以说,信号强度可能比引发它的突变状态更能决定细胞命运。有人建议使用转录组学,它是细胞和肿瘤状态的一种复杂且信息丰富的表征,以提高诊断和治疗的成功率。基于已知基因改变训练的基于基因表达的模型可以改善癌症相关生物学异常信号强度的识别和量化。
我们提出了STAMP(与突变蛋白相关的转录组特征),这是一个基于图卷积网络(GCN)的框架,用于识别与癌症驱动事件相关的基因表达特征。STAMP经过训练,利用全面整理的基因相互作用图结构,从基因表达中识别癌症样本的p53功能障碍。对预测结果进行修改,以生成定量分数,对每个样本中驱动事件的严重程度进行排名。然后,通过训练从文献中识别公认的驱动事件注释,将STAMP扩展到近300个针对重要癌症基因/通路的肿瘤类型特异性预测模型。
STAMP在多种肿瘤类型的未见数据和一个独立队列上取得了非常高的AUC。该框架在与p53相关的基因和临床特征上得到了验证,包括未知意义变异的影响,并与蛋白质功能显示出强相关性。对于有靶向治疗可用的基因和肿瘤类型,STAMP在一个独立的细胞系数据库中显示出与药物敏感性(IC50)的相关性。它能够对具有相似突变谱的样本进行药物效应分层。STAMP在临床患者队列中的药物反应预测得到了验证,优于一种先进方法,并为癌症治疗提出了潜在的生物标志物。
STAMP模型提供了一个学习框架,成功地识别和量化了驱动事件的信号强度,显示了基于转录组学描绘肿瘤分子景观的效用。重要的是,STAMP表现出改善靶向治疗选择的能力,因此可以有助于更好的治疗。