Precision Medicine Institute, Seoul, 08511, Republic of Korea.
Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, 03080, Republic of Korea.
Hum Genomics. 2021 Aug 28;15(1):58. doi: 10.1186/s40246-021-00357-w.
Low-pass sequencing (LPS) has been extensively investigated for applicability to various genetic studies due to its advantages over genotype array data including cost-effectiveness. Predicting the risk of complex diseases such as Parkinson's disease (PD) using polygenic risk score (PRS) based on the genetic variations has shown decent prediction accuracy. Although ultra-LPS has been shown to be effective in PRS calculation, array data has been favored to the majority of PRS analysis, especially for PD.
Using eight high-coverage WGS, we assessed imputation approaches for downsampled LPS data ranging from 0.5 × to 7.0 × . We demonstrated that uncertain genotype calls of LPS diminished imputation accuracy, and an imputation approach using genotype likelihoods was plausible for LPS. Additionally, comparing imputation accuracies between LPS and simulated array illustrated that LPS had higher accuracies particularly at rare frequencies. To evaluate ultra-low coverage data in PRS calculation for PD, we prepared low-coverage WGS and genotype array of 87 PD cases and 101 controls. Genotype imputation of array and downsampled LPS were conducted using a population-specific reference panel, and we calculated risk scores based on the PD-associated SNPs from an East Asian meta-GWAS. The PRS models discriminated cases and controls as previously reported when both LPS and genotype array were used. Also strong correlations in PRS models for PD between LPS and genotype array were discovered.
Overall, this study highlights the potentials of LPS under 1.0 × followed by genotype imputation in PRS calculation and suggests LPS as attractive alternatives to genotype array in the area of precision medicine for PD.
由于其具有成本效益等优势,因此在各种遗传研究中都广泛研究了低深度测序(LPS)的适用性。使用基于遗传变异的多基因风险评分(PRS)预测帕金森病(PD)等复杂疾病的风险已显示出相当不错的预测准确性。尽管超深度 LPS 已被证明在 PRS 计算中有效,但大多数 PRS 分析都倾向于使用阵列数据,尤其是对于 PD。
我们使用了八个高覆盖 WGS,评估了从 0.5×到 7.0×的 LPS 下采样数据的导入方法。我们证明了 LPS 的不确定基因型调用会降低导入准确性,并且使用基因型可能性的导入方法对于 LPS 是合理的。此外,比较 LPS 和模拟数组之间的导入准确性表明,LPS 的准确性更高,尤其是在罕见频率下。为了评估 PD 中PRS 计算的超低覆盖数据,我们准备了 87 例 PD 病例和 101 例对照的低覆盖 WGS 和基因型数组。使用特定于人群的参考面板对数组和下采样 LPS 进行基因型导入,我们根据东亚元 GWAS 中的 PD 相关 SNP 计算了风险评分。当同时使用 LPS 和基因型数组时,PRS 模型以前所报道的方式区分了病例和对照。还发现了 LPS 和基因型数组之间 PD 的 PRS 模型之间的强相关性。
总体而言,这项研究强调了在 PRS 计算中 LPS 低于 1.0×,然后进行基因型导入的潜力,并表明 LPS 是 PD 精准医学领域中替代基因型数组的有吸引力的选择。