Yang Shuangyu, He Dan, Li Ling, Lu Zhiya, Li Shaoying, Lan Tianjun, Liu Feiyi, Zhang Huasong, Cooper David N, Zhao Huiying
Department of Medical Research Center, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, 107 Yan Jiang West Road, Guangzhou, 510000, People's Republic of China.
Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Guangzhou, 510006, People's Republic of China.
Hum Genet. 2025 Jun 14. doi: 10.1007/s00439-025-02755-9.
Discovering cancer driver genes is critical for improving survival rates. Current methods often overlook the varying functional impacts of mutations. It is necessary to develop a method integrating mutation pathogenicity and gene expression data, enhancing the identification of novel cancer drivers. To predict cancer drivers, we have developed a framework (DGAT-cancer) that integrates the pathogenicity of somatic mutation in tumors and germline variants in the healthy population, with topological networks of gene expression in tumors, and the gene expressions in tumor and paracancerous tissues. This integration overcomes the limitations of current methods that assume a uniform impact of all mutations by leveraging a comprehensive view of mutation function within its biological context. These features were filtered by an unsupervised approach, Laplacian selection, and combined by Hotelling and Box-Cox transformations to score genes. By using gene scores as weights, Gibbs sampling was performed to identify cancer drivers. DGAT-cancer was applied to seven types of cancer cohorts, and achieved the best area under the precision-recall curve (AUPRC ranging from 0.646 to 0.862) compared to five commonly used methods (AUPRC ranging from 0.357 to 0.629). DGAT-cancer has identified 505 cancer drivers. Knockdown of the top ranked gene, EEF1A1 indicated a ~ 41-50% decrease in glioma size and improved the temozolomide sensitivity of glioma cells. By combining heterogeneous genomics and transcriptomics data, DGAT-cancer has significantly improved our ability to detect novel cancer drivers, and is an innovative approach revealing cancer therapeutic targets, thereby advancing the development of more precise and effective cancer treatments.
发现癌症驱动基因对于提高生存率至关重要。当前方法常常忽略突变的不同功能影响。有必要开发一种整合突变致病性和基因表达数据的方法,以增强对新型癌症驱动基因的识别。为了预测癌症驱动基因,我们开发了一个框架(DGAT-cancer),该框架整合了肿瘤中体细胞突变和健康人群种系变异的致病性、肿瘤中基因表达的拓扑网络以及肿瘤和癌旁组织中的基因表达。这种整合克服了当前方法的局限性,即通过在生物学背景下全面了解突变功能,假设所有突变具有统一影响。这些特征通过无监督方法拉普拉斯选择进行筛选,并通过霍特林变换和Box-Cox变换进行组合以对基因进行评分。以基因分数作为权重,进行吉布斯采样以识别癌症驱动基因。DGAT-cancer应用于七种癌症队列,与五种常用方法相比(精确召回曲线下面积范围为0.357至0.629),其在精确召回曲线下面积方面表现最佳(范围为0.646至0.862)。DGAT-cancer已识别出505个癌症驱动基因。敲低排名最高的基因EEF1A1可使胶质瘤大小降低约41%-50%,并提高胶质瘤细胞对替莫唑胺的敏感性。通过结合异质基因组学和转录组学数据,DGAT-cancer显著提高了我们检测新型癌症驱动基因的能力,是一种揭示癌症治疗靶点的创新方法,从而推动更精确有效的癌症治疗的发展。