Babenko Vladimir N, Basu Malay K, Kondrashov Fyodor A, Rogozin Igor B, Koonin Eugene V
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD, USA.
BMC Cancer. 2006 Feb 9;6:36. doi: 10.1186/1471-2407-6-36.
Carcinogenesis typically involves multiple somatic mutations in caretaker (DNA repair) and gatekeeper (tumor suppressors and oncogenes) genes. Analysis of mutation spectra of the tumor suppressor that is most commonly mutated in human cancers, p53, unexpectedly suggested that somatic evolution of the p53 gene during tumorigenesis is dominated by positive selection for gain of function. This conclusion is supported by accumulating experimental evidence of evolution of new functions of p53 in tumors. These findings prompted a genome-wide analysis of possible positive selection during tumor evolution.
A comprehensive analysis of probable somatic mutations in the sequences of Expressed Sequence Tags (ESTs) from malignant tumors and normal tissues was performed in order to access the prevalence of positive selection in cancer evolution. For each EST, the numbers of synonymous and non-synonymous substitutions were calculated. In order to identify genes with a signature of positive selection in cancers, these numbers were compared to: i) expected numbers and ii) the numbers for the respective genes in the ESTs from normal tissues.
We identified 112 genes with a signature of positive selection in cancers, i.e., a significantly elevated ratio of non-synonymous to synonymous substitutions, in tumors as compared to 37 such genes in an approximately equal-sized EST collection from normal tissues. A substantial fraction of the tumor-specific positive-selection candidates have experimentally demonstrated or strongly predicted links to cancer.
The results of EST analysis should be interpreted with extreme caution given the noise introduced by sequencing errors and undetected polymorphisms. Furthermore, an inherent limitation of EST analysis is that multiple mutations amenable to statistical analysis can be detected only in relatively highly expressed genes. Nevertheless, the present results suggest that positive selection might affect a substantial number of genes during tumorigenic somatic evolution.
癌症发生通常涉及维持基因组稳定性基因(DNA修复基因)和守门基因(肿瘤抑制基因和癌基因)中的多个体细胞突变。对人类癌症中最常发生突变的肿瘤抑制基因p53的突变谱分析意外地表明,肿瘤发生过程中p53基因的体细胞进化主要由功能获得的正选择主导。这一结论得到了肿瘤中p53新功能进化的越来越多实验证据的支持。这些发现促使对肿瘤进化过程中可能的正选择进行全基因组分析。
对来自恶性肿瘤和正常组织的表达序列标签(EST)序列中的可能体细胞突变进行了全面分析,以了解癌症进化中正选择的普遍性。对于每个EST,计算同义替换和非同义替换的数量。为了鉴定在癌症中具有正选择特征的基因,将这些数量与以下两者进行比较:i)预期数量,ii)来自正常组织的EST中相应基因的数量。
我们鉴定出112个在癌症中具有正选择特征的基因,即与来自正常组织的大小大致相同的EST集合中的37个此类基因相比,肿瘤中非同义替换与同义替换的比率显著升高。很大一部分肿瘤特异性正选择候选基因在实验上已证明或强烈预测与癌症有关。
考虑到测序错误和未检测到的多态性引入的噪声,EST分析结果应极其谨慎地解释。此外,EST分析的一个固有局限性是,只有在相对高表达的基因中才能检测到适合进行统计分析的多个突变。尽管如此,目前的结果表明,在致瘤体细胞进化过程中,正选择可能会影响大量基因。