Schill Rudolf, Solbrig Stefan, Wettig Tilo, Spang Rainer
Department of Statistical Bioinformatics, Institute of Functional Genomics, Regensburg 93040, Germany.
Department of Physics, University of Regensburg, Regensburg 93040, Germany.
Bioinformatics. 2020 Jan 1;36(1):241-249. doi: 10.1093/bioinformatics/btz513.
Cancer progresses by accumulating genomic events, such as mutations and copy number alterations, whose chronological order is key to understanding the disease but difficult to observe. Instead, cancer progression models use co-occurrence patterns in cross-sectional data to infer epistatic interactions between events and thereby uncover their most likely order of occurrence. State-of-the-art progression models, however, are limited by mathematical tractability and only allow events to interact in directed acyclic graphs, to promote but not inhibit subsequent events, or to be mutually exclusive in distinct groups that cannot overlap.
Here we propose Mutual Hazard Networks (MHN), a new Machine Learning algorithm to infer cyclic progression models from cross-sectional data. MHN model events by their spontaneous rate of fixation and by multiplicative effects they exert on the rates of successive events. MHN compared favourably to acyclic models in cross-validated model fit on four datasets tested. In application to the glioblastoma dataset from The Cancer Genome Atlas, MHN proposed a novel interaction in line with consecutive biopsies: IDH1 mutations are early events that promote subsequent fixation of TP53 mutations.
Implementation and data are available at https://github.com/RudiSchill/MHN.
Supplementary data are available at Bioinformatics online.
癌症通过积累基因组事件(如突变和拷贝数改变)而进展,这些事件的时间顺序对于理解该疾病至关重要,但却难以观察到。相反,癌症进展模型利用横断面数据中的共现模式来推断事件之间的上位性相互作用,从而揭示它们最可能的发生顺序。然而,当前最先进的进展模型受到数学可处理性的限制,仅允许事件在有向无环图中相互作用,以促进而非抑制后续事件,或者在不能重叠的不同组中相互排斥。
在此,我们提出了互险网络(MHN),这是一种从横断面数据推断循环进展模型的新型机器学习算法。MHN通过事件的自发固定率以及它们对后续事件发生率施加的乘法效应来对事件进行建模。在对四个测试数据集进行的交叉验证模型拟合中,MHN与无环模型相比表现良好。在应用于来自癌症基因组图谱的胶质母细胞瘤数据集时,MHN提出了一种与连续活检结果相符的新型相互作用:异柠檬酸脱氢酶1(IDH1)突变是促进随后TP53突变固定的早期事件。
可在https://github.com/RudiSchill/MHN获取实现代码和数据。
补充数据可在《生物信息学》在线版获取。