Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education & College of Animal Science and Technology, Huazhong Agricultural University, Wuhan 430070, PR China.
Frontiers Science Center for Animal Breeding and Sustainable Production, Wuhan 430070, PR China.
Nucleic Acids Res. 2023 May 8;51(8):3501-3512. doi: 10.1093/nar/gkad074.
Human diseases and agricultural traits can be predicted by modeling a genetic random polygenic effect in linear mixed models. To estimate variance components and predict random effects of the model efficiently with limited computational resources has always been of primary concern, especially when it involves increasing the genotype data scale in the current genomic era. Here, we thoroughly reviewed the development history of statistical algorithms used in genetic evaluation and theoretically compared their computational complexity and applicability for different data scenarios. Most importantly, we presented a computationally efficient, functionally enriched, multi-platform and user-friendly software package named 'HIBLUP' to address the challenges that are faced currently using big genomic data. Powered by advanced algorithms, elaborate design and efficient programming, HIBLUP computed fastest while using the lowest memory in analyses, and the greater the number of individuals that are genotyped, the greater the computational benefits from HIBLUP. We also demonstrated that HIBLUP is the only tool which can accomplish the analyses for a UK Biobank-scale dataset within 1 h using the proposed efficient 'HE + PCG' strategy. It is foreseeable that HIBLUP will facilitate genetic research for human, plants and animals. The HIBLUP software and user manual can be accessed freely at https://www.hiblup.com.
通过在线性混合模型中对遗传随机多基因效应进行建模,可以预测人类疾病和农业性状。在有限的计算资源下,有效地估计方差分量并预测模型的随机效应一直是首要关注的问题,尤其是在当前基因组时代增加基因型数据规模时。在这里,我们全面回顾了遗传评估中使用的统计算法的发展历史,并从理论上比较了它们的计算复杂性和在不同数据场景下的适用性。最重要的是,我们提出了一种计算效率高、功能丰富、多平台且用户友好的软件包,名为“ HIBLUP”,以解决当前使用大型基因组数据所面临的挑战。借助先进的算法、精心的设计和高效的编程,HIBLUP 在分析中计算速度最快,占用的内存最低,并且所检测的个体数量越多,从 HIBLUP 获得的计算优势就越大。我们还证明,HIBLUP 是唯一可以使用建议的高效“ HE + PCG”策略在 1 小时内完成 UK Biobank 规模数据集分析的工具。可以预见,HIBLUP 将促进人类,植物和动物的遗传研究。HIBLUP 软件和用户手册可在 https://www.hiblup.com 上免费获得。