Pendleton Matthew, Sebra Robert, Pang Andy Wing Chun, Ummat Ajay, Franzen Oscar, Rausch Tobias, Stütz Adrian M, Stedman William, Anantharaman Thomas, Hastie Alex, Dai Heng, Fritz Markus Hsi-Yang, Cao Han, Cohain Ariella, Deikus Gintaras, Durrett Russell E, Blanchard Scott C, Altman Roger, Chin Chen-Shan, Guo Yan, Paxinos Ellen E, Korbel Jan O, Darnell Robert B, McCombie W Richard, Kwok Pui-Yan, Mason Christopher E, Schadt Eric E, Bashir Ali
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, USA.
BioNano Genomics, San Diego, California, USA.
Nat Methods. 2015 Aug;12(8):780-6. doi: 10.1038/nmeth.3454. Epub 2015 Jun 29.
We present the first comprehensive analysis of a diploid human genome that combines single-molecule sequencing with single-molecule genome maps. Our hybrid assembly markedly improves upon the contiguity observed from traditional shotgun sequencing approaches, with scaffold N50 values approaching 30 Mb, and we identified complex structural variants (SVs) missed by other high-throughput approaches. Furthermore, by combining Illumina short-read data with long reads, we phased both single-nucleotide variants and SVs, generating haplotypes with over 99% consistency with previous trio-based studies. Our work shows that it is now possible to integrate single-molecule and high-throughput sequence data to generate de novo assembled genomes that approach reference quality.
我们展示了对二倍体人类基因组的首次全面分析,该分析将单分子测序与单分子基因组图谱相结合。我们的混合组装显著改善了传统鸟枪法测序方法所观察到的连续性,支架N50值接近30 Mb,并且我们鉴定出了其他高通量方法遗漏的复杂结构变异(SV)。此外,通过将Illumina短读长数据与长读长数据相结合,我们对单核苷酸变异和SV进行了定相,生成了与先前基于三人组的研究一致性超过99%的单倍型。我们的工作表明,现在有可能整合单分子和高通量序列数据以生成接近参考质量的从头组装基因组。