Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain.
Centro de Investigación Biomédica en Red en Cáncer (CIBERONC), Instituto de Salud Carlos III, Madrid, Spain.
Cancer Discov. 2024 Sep 4;14(9):1717-1731. doi: 10.1158/2159-8290.CD-23-1416.
Clonal hematopoiesis (CH) is a phenomenon of clonal expansion of hematopoietic stem cells driven by somatic mutations affecting certain genes. Recently, CH has been linked to the development of hematologic malignancies, cardiovascular diseases, and other conditions. Although the most frequently mutated CH driver genes have been identified, a systematic landscape of the mutations capable of initiating this phenomenon is still lacking. In this study, we trained machine learning models for 12 of the most recurrent CH genes to identify their driver mutations. These models outperform expert-curated rules based on prior knowledge of the function of these genes. Moreover, their application to identify CH driver mutations across almost half a million donors of the UK Biobank reproduces known associations between CH driver mutations and age, and the prevalence of several diseases and conditions. We thus propose that these models support the accurate identification of CH across healthy individuals. Significance: We developed and validated gene-specific machine learning models to identify CH driver mutations, showing their advantage with respect to expert-curated rules. These models can support the identification and clinical interpretation of CH mutations in newly sequenced individuals. See related commentary by Arends and Jaiswal, p. 1581.
克隆性造血 (CH) 是一种由影响某些基因的体细胞突变驱动的造血干细胞克隆性扩增的现象。最近,CH 与血液系统恶性肿瘤、心血管疾病和其他疾病的发展有关。尽管已经确定了最常突变的 CH 驱动基因,但能够引发这种现象的突变的系统景观仍然缺乏。在这项研究中,我们为 12 个最常见的 CH 基因训练了机器学习模型,以识别其驱动突变。这些模型优于基于这些基因功能的先验知识的专家编纂规则。此外,它们在 UK Biobank 的近 50 万名供体中应用,以识别 CH 驱动突变,再现了 CH 驱动突变与年龄之间以及几种疾病和病症的患病率之间的已知关联。因此,我们提出这些模型支持在健康个体中准确识别 CH。意义:我们开发并验证了针对特定基因的机器学习模型,以识别 CH 驱动突变,与专家编纂规则相比,这些模型具有优势。这些模型可以支持在新测序个体中对 CH 突变的识别和临床解释。请参阅 Arends 和 Jaiswal 的相关评论,第 1581 页。