Marotta Michael, Chen Xiongfong, Inoshita Ayako, Stephens Robert, Budd G Thomas, Crowe Joseph P, Lyons Joanne, Kondratova Anna, Tubbs Raymond, Tanaka Hisashi
Breast Cancer Res. 2012 Nov 26;14(6):R150. doi: 10.1186/bcr3362.
Segmental duplications (low-copy repeats) are the recently duplicated genomic segments in the human genome that display nearly identical (> 90%) sequences and account for about 5% of euchromatic regions. In germline, duplicated segments mediate nonallelic homologous recombination and thus cause both non-disease-causing copy-number variants and genomic disorders. To what extent duplicated segments play a role in somatic DNA rearrangements in cancer remains elusive. Duplicated segments often cluster and form genomic blocks enriched with both direct and inverted repeats (complex genomic regions). Such complex regions could be fragile and play a mechanistic role in the amplification of the ERBB2 gene in breast tumors, because repeated sequences are known to initiate gene amplification in model systems.
We conducted polymerase chain reaction (PCR)-based assays for primary breast tumors and analyzed publically available array-comparative genomic hybridization data to map a common copy-number breakpoint in ERBB2-amplified primary breast tumors. We further used molecular, bioinformatics, and population-genetics approaches to define duplication contents, structural variants, and haplotypes within the common breakpoint.
We found a large (> 300-kb) block of duplicated segments that was colocalized with a common-copy number breakpoint for ERBB2 amplification. The breakpoint that potentially initiated ERBB2 amplification localized in a region 1.5 megabases (Mb) on the telomeric side of ERBB2. The region is very complex, with extensive duplications of KRTAP genes, structural variants, and, as a result, a paucity of single-nucleotide polymorphism (SNP) markers. Duplicated segments are varied in size and degree of sequence homology, indicating that duplications have occurred recurrently during genome evolution.
Amplification of the ERBB2 gene in breast tumors is potentially initiated by a complex region that has unusual genomic features and thus requires rigorous, labor-intensive investigation. The haplotypes we provide could be useful to identify the potential association between the complex region and ERBB2 amplification.
片段重复(低拷贝重复序列)是人类基因组中近期复制的基因组片段,其序列相似度接近100%(>90%),约占常染色质区域的5%。在生殖细胞系中,重复片段介导非等位基因同源重组,从而导致非致病的拷贝数变异和基因组疾病。重复片段在癌症体细胞DNA重排中发挥何种程度的作用仍不清楚。重复片段常聚集形成富含正向和反向重复序列的基因组区域(复杂基因组区域)。由于已知重复序列可在模型系统中引发基因扩增,因此这类复杂区域可能很脆弱,并在乳腺肿瘤中ERBB2基因的扩增中发挥机制性作用。
我们对原发性乳腺肿瘤进行了基于聚合酶链反应(PCR)的检测,并分析了公开可用的阵列比较基因组杂交数据,以确定ERBB2扩增的原发性乳腺肿瘤中常见的拷贝数断点。我们进一步使用分子、生物信息学和群体遗传学方法来确定常见断点内的重复内容、结构变异和单倍型。
我们发现了一个大的(>300kb)重复片段区域,该区域与ERBB2扩增的一个常见拷贝数断点共定位。可能引发ERBB2扩增的断点位于ERBB2端粒侧1.5兆碱基(Mb)处的一个区域。该区域非常复杂,有大量角蛋白相关蛋白(KRTAP)基因重复、结构变异,因此单核苷酸多态性(SNP)标记很少。重复片段的大小和序列同源性程度各不相同,表明在基因组进化过程中重复事件反复发生。
乳腺肿瘤中ERBB2基因的扩增可能由一个具有异常基因组特征的复杂区域引发,因此需要进行严格且耗时的研究。我们提供的单倍型可能有助于识别该复杂区域与ERBB2扩增之间的潜在关联。