College of Animal Science, Yangtze University, Jingzhou 434025, Hubei, China.
College of Animal Science and Technology, Anhui Agricultural University, Hefei 230036, Anhui, China.
G3 (Bethesda). 2023 Aug 30;13(9). doi: 10.1093/g3journal/jkad154.
Linkage disequilibrium (LD) analysis is fundamental to the investigation of the genetic architecture of complex traits (e.g. human disease, animal and plant breeding) and population structure and evolution dynamics. However, until now, studies primarily focus on LD status between genetic variants located on the same chromosome. Moreover, genome (re)sequencing produces unprecedented numbers of genetic variants, and fast LD computation becomes a challenge. Here, we have developed GWLD, a parallelized and generalized tool designed for the rapid genome-wide calculation of LD values, including conventional D/D', r2, and (reduced) mutual information (MI and RMI) measures. LD between genetic variants within and across chromosomes can be rapidly computed and visualized in either an R package or a standalone C++ software package. To evaluate the accuracy and speed of LD calculation, we conducted comparisons using 4 real datasets. Interchromosomal LD patterns observed potentially reflect levels of selection intensity across different species. Both versions of GWLD, the R package (https://github.com/Rong-Zh/GWLD/GWLD-R) and the standalone C++ software (https://github.com/Rong-Zh/GWLD/GWLD-C++), are freely available on GitHub.
连锁不平衡(LD)分析是研究复杂性状(如人类疾病、动植物育种)遗传结构和群体结构与进化动态的基础。然而,到目前为止,研究主要集中在位于同一染色体上的遗传变异体之间的 LD 状态。此外,基因组(重)测序产生了前所未有的大量遗传变异体,快速 LD 计算成为一个挑战。在这里,我们开发了 GWLD,这是一种并行化和通用的工具,用于快速计算全基因组 LD 值,包括传统的 D/D'、r2 和(简化)互信息(MI 和 RMI)度量。可以在 R 包或独立的 C++软件包中快速计算和可视化染色体内和染色体间遗传变异体之间的 LD。为了评估 LD 计算的准确性和速度,我们使用 4 个真实数据集进行了比较。跨染色体 LD 模式的观察结果可能反映了不同物种之间选择强度的水平。GWLD 的 R 包(https://github.com/Rong-Zh/GWLD/GWLD-R)和独立的 C++软件(https://github.com/Rong-Zh/GWLD/GWLD-C++)版本都可以在 GitHub 上免费获得。