Institute of Informatics, University of Warsaw, Warsaw, Poland.
Hum Mutat. 2013 Jan;34(1):210-20. doi: 10.1002/humu.22217. Epub 2012 Oct 11.
Inverse paralogous low-copy repeats (IP-LCRs) can cause genome instability by nonallelic homologous recombination (NAHR)-mediated balanced inversions. When disrupting a dosage-sensitive gene(s), balanced inversions can lead to abnormal phenotypes. We delineated the genome-wide distribution of IP-LCRs >1 kB in size with >95% sequence identity and mapped the genes, potentially intersected by an inversion, that overlap at least one of the IP-LCRs. Remarkably, our results show that 12.0% of the human genome is potentially susceptible to such inversions and 942 genes, 99 of which are on the X chromosome, are predicted to be disrupted secondary to such an inversion! In addition, IP-LCRs larger than 800 bp with at least 98% sequence identity (duplication/triplication facilitating IP-LCRs, DTIP-LCRs) were recently implicated in the formation of complex genomic rearrangements with a duplication-inverted triplication-duplication (DUP-TRP/INV-DUP) structure by a replication-based mechanism involving a template switch between such inverted repeats. We identified 1,551 DTIP-LCRs that could facilitate DUP-TRP/INV-DUP formation. Remarkably, 1,445 disease-associated genes are at risk of undergoing copy-number gain as they map to genomic intervals susceptible to the formation of DUP-TRP/INV-DUP complex rearrangements. We implicate inverted LCRs as a human genome architectural feature that could potentially be responsible for genomic instability associated with many human disease traits.
反向平行低拷贝重复序列(IP-LCRs)可通过非等位基因同源重组(NAHR)介导的平衡倒位引起基因组不稳定。当破坏一个剂量敏感基因(s)时,平衡倒位可导致异常表型。我们描绘了大小大于 1 kB、序列同一性大于 95%的 IP-LCR 的全基因组分布,并绘制了至少与一个 IP-LCR 重叠的基因,这些基因可能被倒位所交叉。值得注意的是,我们的结果表明,12.0%的人类基因组可能容易受到这种倒位的影响,并且有 942 个基因,其中 99 个位于 X 染色体上,预计会因这种倒位而受到干扰!此外,最近的研究表明,大于 800 bp 且具有至少 98%序列同一性的 IP-LCR(复制/三倍体促进 IP-LCR,DTIP-LCR)通过涉及模板转换的复制机制参与了具有复制-倒位-三倍体-复制(DUP-TRP/INV-DUP)结构的复杂基因组重排的形成。我们确定了 1551 个可以促进 DUP-TRP/INV-DUP 形成的 DTIP-LCR。值得注意的是,由于它们映射到易受 DUP-TRP/INV-DUP 复杂重排形成影响的基因组区间,因此 1445 个与疾病相关的基因存在发生拷贝数增益的风险。我们暗示,反向 LCR 是人类基因组结构特征的一部分,它可能是与许多人类疾病特征相关的基因组不稳定性的原因。