Razi Abolfazl, Afghah Fatemeh, Varadan Vinay
Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:6509-13. doi: 10.1109/EMBC.2015.7319884.
The problem of identifying interacting genes that jointly are associated with a phenotype is considered. When the number of features are extremely large compared to the number of samples, there may be several subsets of features that provide acceptable levels of predictability. This is particularly true in cancer genomics, where we are interested in finding functionally related gene sets likely to jointly drive cancer phenotypes. In this paper, a novel game theoretic solution is proposed by modeling genes as players of a Coalition Game. This method discovers and develops informative gene subnetworks by integrating gene expression profiling of cancer tissues with protein-protein interaction (PPI) networks. These subnetworks are gradually developed by selective addition of candidate genes that present maximal Shapely values in coalition with subnetworks of genes. We applied the proposed algorithm to an ovarian cancer dataset (N = 201), in order to identify optimal subnetworks that can predict cancer progression risk in response to platinum-based therapy. We show improved predictive power of the proposed method when compared to state-of-the-art feature selection methods, with the added advantage of identifying potentially functional gene subnetworks that may provide insights into the mechanisms underlying cancer progression.
我们考虑了识别与一种表型相关的相互作用基因的问题。当特征数量与样本数量相比极大时,可能存在几个能提供可接受预测水平的特征子集。这在癌症基因组学中尤为如此,在癌症基因组学中,我们感兴趣的是找到可能共同驱动癌症表型的功能相关基因集。在本文中,通过将基因建模为联盟博弈的参与者,提出了一种新颖的博弈论解决方案。该方法通过整合癌症组织的基因表达谱与蛋白质 - 蛋白质相互作用(PPI)网络来发现和开发信息丰富的基因子网。这些子网通过选择性添加与基因子网联合时具有最大夏普利值的候选基因逐步发展。我们将所提出的算法应用于一个卵巢癌数据集(N = 201),以识别能够预测基于铂类疗法的癌症进展风险的最佳子网。与最先进的特征选择方法相比,我们展示了所提出方法的更高预测能力,其额外优势在于识别潜在的功能基因子网,这可能为癌症进展的潜在机制提供见解。