Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, 2-1, Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.
Tohoku Medical Megabank Organization, Tohoku University, 2-1, Seiryo-machi, Aoba-ku, Sendai, Miyagi, 980-8573, Japan.
Nat Commun. 2021 Jan 11;12(1):226. doi: 10.1038/s41467-020-20146-8.
The complete human genome sequence is used as a reference for next-generation sequencing analyses. However, some ethnic ancestries are under-represented in the reference genome (e.g., GRCh37) due to its bias toward European and African ancestries. Here, we perform de novo assembly of three Japanese male genomes using > 100× Pacific Biosciences long reads and Bionano Genomics optical maps per sample. We integrate the genomes using the major allele for consensus and anchor the scaffolds using genetic and radiation hybrid maps to reconstruct each chromosome. The resulting genome sequence, JG1, is contiguous, accurate, and carries the Japanese major allele at most loci. We adopt JG1 as the reference for confirmatory exome re-analyses of seven rare-disease Japanese families and find that re-analysis using JG1 reduces total candidate variant calls versus GRCh37 while retaining disease-causing variants. These results suggest that integrating multiple genomes from a single population can aid genome analyses of that population.
完整的人类基因组序列被用作下一代测序分析的参考。然而,由于参考基因组(例如 GRCh37)偏向于欧洲和非洲血统,一些种族血统的代表性不足。在这里,我们使用超过 100× 的太平洋生物科学长读长和每个样本的 Bionano 基因组学光学图谱,对三个日本男性基因组进行从头组装。我们使用主要等位基因对共识进行整合,并使用遗传和辐射杂种图谱将支架锚定,以重建每条染色体。得到的基因组序列 JG1 是连续的、准确的,并且大多数位点都带有日本主要等位基因。我们采用 JG1 作为七个罕见疾病日本家系确认外显子重新分析的参考,发现使用 JG1 进行重新分析相对于 GRCh37 减少了总候选变异调用,同时保留了致病变异。这些结果表明,整合来自单一人群的多个基因组可以辅助该人群的基因组分析。