J. Craig Venter Institute, Rockville, Maryland 20850, USA.
Genome Res. 2013 May;23(5):826-32. doi: 10.1101/gr.144600.112. Epub 2013 Jan 2.
There is increasing evidence that the phenotypic effects of genomic sequence variants are best understood in terms of variant haplotypes rather than as isolated polymorphisms. Haplotype analysis is also critically important for uncovering population histories and for the study of evolutionary genetics. Although the sequencing of individual human genomes to reveal personal collections of sequence variants is now well established, there has been slower progress in the phasing of these variants into pairs of haplotypes along each pair of chromosomes. Here, we have developed a distinct approach to haplotyping that can yield chromosome-length haplotypes, including the vast majority of heterozygous single-nucleotide polymorphisms (SNPs) in an individual human genome. This approach exploits the haploid nature of sperm cells and employs a combination of genotyping and low-coverage sequencing on a short-read platform. In addition to generating chromosome-length haplotypes, the approach can directly identify recombination events (averaging 1.1 per chromosome) with a median resolution of <100 kb.
越来越多的证据表明,基因组序列变异的表型效应最好根据变异单倍型来理解,而不是作为孤立的多态性。单倍型分析对于揭示种群历史和进化遗传学研究也至关重要。虽然对个体人类基因组进行测序以揭示个人的序列变异集合现在已经得到很好的建立,但将这些变异相位成每对染色体上的一对单倍型的进展较慢。在这里,我们开发了一种独特的单倍型分析方法,可以产生染色体长度的单倍型,包括个体人类基因组中绝大多数杂合单核苷酸多态性(SNP)。该方法利用精子细胞的单倍体性质,并结合短读长平台上的基因分型和低覆盖度测序。除了生成染色体长度的单倍型外,该方法还可以直接识别重组事件(平均每个染色体 1.1 个),分辨率中位数<100kb。