Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
Am J Hum Genet. 2022 Jun 2;109(6):1175-1181. doi: 10.1016/j.ajhg.2022.04.006. Epub 2022 May 2.
Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.
目前可用于快速探索标记之间连锁不平衡(LD)的公开工具(例如 HaploReg 和 LDlink)是基于 1000 基因组计划中 2504 个人的全基因组序列(WGS)数据。在这里,我们介绍了 TOP-LD,这是一种在线工具,用于探索来自 NHLBI 转化医学精准医学计划(TOPMed)中 15578 个人的高覆盖率(约 30×)WGS 数据推断出的 LD。与当前的 LD 工具相比,TOP-LD 提供了显著的升级,因为 TOPMed WGS 数据比 1000 基因组数据更全面地代表了遗传变异,特别是对于罕见变异和我们分析的特定人群。例如,TOP-LD 涵盖了欧洲、非洲和东亚祖先样本中分别为 150.3、62.2 和 36.7 百万个变体的 LD 信息,与 HaploReg 4.0 或 LDlink 相比,变体覆盖率分别增加了 2.6 到 9.1 倍。此外,TOP-LD 还包括数万种结构变体(SVs)。我们在英国生物库中非洲裔参与者中与γ-谷氨酰转移酶相关的 GGT1 基因座精细映射中展示了 TOP-LD 的价值。除了精细映射之外,TOP-LD 还可以促进基于汇总统计数据和 LD 估计的广泛应用。TOP-LD 可免费在线获取。