Lin Cheng-Yu, Shukla Ankit, Grady John P, Fink J Lynn, Dray Eloise, Duijf Pascal H G
University of Queensland Diamantina Institute, The University of Queensland, Translational Research Institute, 37 Kent Street, Brisbane, QLD 4102, Australia.
Institute of Health and Biomedical Innovation, Queensland University of Technology, Translational Research Institute, 37 Kent Street, Brisbane, QLD 4102, Australia.
Cancers (Basel). 2018 Jan 8;10(1):13. doi: 10.3390/cancers10010013.
Chromosomal translocations drive the development of many hematological and some solid cancers. Several factors have been identified to explain the non-random occurrence of translocation breakpoints in the genome. These include chromatin density, gene density and CCCTC-binding factor (CTCF)/cohesin binding site density. However, such factors are at least partially interdependent. Using 13,844 and 1563 karyotypes from human blood and solid cancers, respectively, our multiple regression analysis only identified chromatin density as the primary statistically significant predictor. Specifically, translocation breakpoints preferentially occur in open chromatin. Also, blood and solid tumors show markedly distinct translocation signatures. Strikingly, translocation breakpoints occur significantly more frequently in acrocentric chromosomes than in non-acrocentric chromosomes. Thus, translocations are probably often generated around nucleoli in the inner nucleoplasm, away from the nuclear envelope. Importantly, our findings remain true both in multivariate analyses and after removal of highly recurrent translocations. Finally, we applied pairwise probabilistic co-occurrence modeling. In addition to well-known highly prevalent translocations, such as those resulting in - (-) and - (-) fusion genes, we identified significantly underrepresented translocations with putative fusion genes, which are probably subject to strong negative selection during tumor evolution. Taken together, our findings provide novel insights into the generation and selection of translocations during cancer development.
染色体易位驱动许多血液系统癌症和一些实体癌的发展。已经确定了几个因素来解释基因组中易位断点的非随机出现。这些因素包括染色质密度、基因密度和CCCTC结合因子(CTCF)/黏连蛋白结合位点密度。然而,这些因素至少部分相互依赖。分别使用来自人类血液和实体癌的13844个和1563个核型,我们的多元回归分析仅将染色质密度确定为主要的统计学显著预测因子。具体而言,易位断点优先出现在开放染色质中。此外,血液肿瘤和实体肿瘤显示出明显不同的易位特征。引人注目的是,易位断点在近端着丝粒染色体上出现的频率明显高于非近端着丝粒染色体。因此,易位可能经常在核仁周围的核内体中产生,远离核膜。重要的是,我们的发现在多变量分析中以及在去除高度复发的易位后仍然成立。最后,我们应用了成对概率共现建模。除了众所周知的高度普遍的易位,例如那些导致-(-)和-(-)融合基因的易位,我们还鉴定出了具有推定融合基因的明显代表性不足的易位,这些易位在肿瘤进化过程中可能受到强烈的负选择。综上所述,我们的发现为癌症发展过程中易位的产生和选择提供了新的见解。