Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 69978, Israel.
Department of Computer Science and Center for Bioinformatics and Computational Biology, University of Maryland, College Park, 20742, MD, USA.
Genome Med. 2021 Nov 1;13(1):173. doi: 10.1186/s13073-021-00988-7.
Mutational signatures are key to understanding the processes that shape cancer genomes, yet their analysis requires relatively rich whole-genome or whole-exome mutation data. Recently, orders-of-magnitude sparser gene-panel-sequencing data have become increasingly available in the clinic. To deal with such sparse data, we suggest a novel mixture model, Mix. In application to simulated and real gene-panel sequences, Mix is shown to outperform current approaches and yield mutational signatures and patient stratifications that are in higher agreement with the literature. We further demonstrate its utility in several clinical settings, successfully predicting therapy benefit and patient groupings from MSK-IMPACT pan-cancer data. Availability: https://github.com/itaysason/Mix-MMM .
突变特征是理解塑造癌症基因组的过程的关键,但它们的分析需要相对丰富的全基因组或全外显子组突变数据。最近,在临床上越来越多地获得了数量级上稀疏得多的基因面板测序数据。为了处理这种稀疏数据,我们提出了一种新的混合模型 Mix。在应用于模拟和真实的基因面板序列时,Mix 被证明优于当前的方法,并产生与文献更一致的突变特征和患者分层。我们进一步在几个临床环境中证明了它的实用性,成功地从 MSK-IMPACT 泛癌数据中预测了治疗效果和患者分组。可获取性:https://github.com/itaysason/Mix-MMM 。