School of Physics and Astronomy, Raymond & Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, 69978 Tel Aviv, Israel;
Center for Bioinformatics and Computational Biology, Institute of Advanced Computer Studies, University of Maryland, College Park, MD 20742.
Proc Natl Acad Sci U S A. 2019 Aug 20;116(34):16987-16996. doi: 10.1073/pnas.1908790116. Epub 2019 Aug 6.
Repetitive sequences are hotspots of evolution at multiple levels. However, due to difficulties involved in their assembly and analysis, the role of repeats in tumor evolution is poorly understood. We developed a rigorous motif-based methodology to quantify variations in the repeat content, beyond microsatellites, in proteomes and genomes directly from proteomic and genomic raw data. This method was applied to a wide range of tumors and normal tissues. We identify high similarity between repeat instability patterns in tumors and their patient-matched adjacent normal tissues. Nonetheless, tumor-specific signatures both in protein expression and in the genome strongly correlate with cancer progression and robustly predict the tumorigenic state. In a patient, the hierarchy of genomic repeat instability signatures accurately reconstructs tumor evolution, with primary tumors differentiated from metastases. We observe an inverse relationship between repeat instability and point mutation load within and across patients independent of other somatic aberrations. Thus, repeat instability is a distinct, transient, and compensatory adaptive mechanism in tumor evolution and a potential signal for early detection.
重复序列是多个层面进化的热点。然而,由于其组装和分析的困难,重复在肿瘤进化中的作用仍不清楚。我们开发了一种严格的基于模体的方法,可从蛋白质组学和基因组学的原始数据中直接定量分析蛋白质组和基因组中微卫星以外的重复含量的变化。该方法应用于广泛的肿瘤和正常组织。我们发现肿瘤与患者匹配的相邻正常组织中的重复不稳定性模式非常相似。尽管如此,肿瘤中蛋白质表达和基因组中的特异性特征与癌症进展强烈相关,并能可靠地预测肿瘤发生状态。在一个患者中,基因组重复不稳定性特征的层次结构准确地重建了肿瘤的进化,原发肿瘤与转移瘤不同。我们观察到,在患者内部和之间,重复不稳定性与点突变负荷之间存在反比关系,而与其他体细胞异常无关。因此,重复不稳定性是肿瘤进化中的一种独特的、短暂的和代偿性的适应机制,也是早期检测的潜在信号。