通过群体规模的基因组测序来绘制拷贝数变异图谱。
Mapping copy number variation by population-scale genome sequencing.
机构信息
Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA.
出版信息
Nature. 2011 Feb 3;470(7332):59-65. doi: 10.1038/nature09708.
Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.
基因组结构变异(SVs)在人类中大量存在,与其他形式的变异在程度、起源和功能影响上有所不同。尽管在 SV 特征描述方面取得了进展,但大多数 SV 的核苷酸分辨率结构仍然未知。我们基于来自 185 个人类基因组的全基因组 DNA 测序数据构建了不平衡 SV(即拷贝数变异)图谱,整合了来自互补 SV 发现方法的证据,并进行了广泛的实验验证。我们的图谱包含了 22025 个缺失和 6000 个其他 SV,包括插入和串联重复。大多数 SV(53%)被映射到核苷酸分辨率,这便于分析它们的起源和功能影响。我们使用基因分型方法研究了许多完整和部分基因缺失,并观察到高频缺失中基因缺失的减少。此外,我们观察到来自不同形成机制的 SV 的大小谱的差异,并构建了由常见机制形成的 SV 热点图谱。我们的分析框架和 SV 图谱为基于测序的关联研究提供了资源。