School of Information Science and Technology, University of Science and Technology of China, Hefei AH230027, People's Republic of China.
Centers for Biomedical Engineering, University of Science and Technology of China, Hefei AH230027, People's Republic of China.
IET Syst Biol. 2014 Apr;8(2):24-32. doi: 10.1049/iet-syb.2013.0027.
The authors describe an integrated method for analysing cancer driver aberrations and disrupted pathways by using tumour single nucleotide polymorphism (SNP) arrays. The authors new method adopts a novel statistical model to explicitly quantify the SNP signals, and therefore infers the genomic aberrations, including copy number alteration and loss of heterozygosity. Examination on the dilution series dataset shows that this method can correctly identify the genomic aberrations even with the existence of severe normal cell contamination in tumour sample. Furthermore, with the results of the aberration identification obtained from multiple tumour samples, a permutation-based approach is proposed for identifying the statistically significant driver aberrations, which are further incorporated with the known signalling pathways for pathway enrichment analysis. By applying the approach to 286 hepatocellular tumour samples, they successfully uncover numerous driver aberration regions across the cancer genome, for example, chromosomes 4p and 5q, which harbour many known hepatocellular cancer related genes such as alpha-fetoprotein (AFP) and ectodermal-neural cortex (ENC1). In addition, they identify nine disrupted pathways that are highly enriched by the driver aberrations, including the systemic lupus erythematosus pathway, the vascular endothelial growth factor (VEGF) signalling pathway and so on. These results support the feasibility and the utility of the proposed method on the characterisation of the cancer genome and the downstream analysis of the driver aberrations and the disrupted signalling pathways.
作者描述了一种通过肿瘤单核苷酸多态性 (SNP) 阵列分析癌症驱动因子异常和信号通路失调的综合方法。该方法采用一种新的统计模型来明确量化 SNP 信号,从而推断基因组异常,包括拷贝数改变和杂合性丢失。对稀释系列数据集的检验表明,即使在肿瘤样本中存在严重的正常细胞污染,该方法也可以正确识别基因组异常。此外,通过对多个肿瘤样本的异常识别结果,提出了一种基于排列的方法来识别统计学上显著的驱动异常,这些异常进一步与已知的信号通路结合进行通路富集分析。作者将该方法应用于 286 个肝细胞肿瘤样本,成功地揭示了癌症基因组中许多驱动异常区域,例如染色体 4p 和 5q,其中包含许多已知的肝细胞癌相关基因,如甲胎蛋白 (AFP) 和外胚层-神经皮质 (ENC1)。此外,作者还鉴定了 9 个被驱动异常高度富集的失调通路,包括系统性红斑狼疮通路、血管内皮生长因子 (VEGF) 信号通路等。这些结果支持了所提出的方法在癌症基因组特征描述和驱动异常以及失调信号通路下游分析中的可行性和实用性。