Sardell J, Pearson M, Chocian K, Das S, Taylor K, Strivens M, Gupta R, Rochlin A, Gardner S
PrecisionLife Ltd., Unit 8b Bankside, Hanborough Business Park, Long Hanborough, OX29 8LJ, UK.
Metrodora Institute, 3535 South Market Street, West Valley City, UT, 84119, USA.
J Transl Med. 2025 May 8;23(1):516. doi: 10.1186/s12967-025-06535-x.
Long COVID is a major public health burden causing a diverse array of debilitating symptoms in tens of millions of patients globally. In spite of this overwhelming disease prevalence, staggering cost, severe impact on patients' lives and intense global research efforts, study of the disease has proved challenging due to its complexity. Genome-wide association studies (GWAS) have identified only four loci potentially associated with the disease, although these results did not statistically replicate between studies. A previous combinatorial analysis study identified a total of 73 genes that were highly associated with two long COVID cohorts in the predominantly (> 91%) white European ancestry Sano GOLD population, and we sought to reproduce these findings in the independent and ancestrally more diverse All of Us (AoU) population.
We assessed the reproducibility of the 5343 long COVID disease signatures from the original study in the AoU population. Because the very small population sizes provide very limited power to replicate findings, we initially tested whether we observed a statistically significant enrichment of the Sano GOLD disease signatures that are also positively correlated with long COVID in the AoU cohort after controlling for population substructure.
For the Sano GOLD disease signatures that have a case frequency greater than 5% in AoU, we consistently observed a significant enrichment (77-83%, p < 0.01) of signatures that are also positively associated with long COVID in the AoU cohort. These encompassed 92% of the genes identified in the original study. At least five of the disease signatures found in Sano GOLD were also shown to be individually significantly associated with increased long COVID prevalence in the AoU population. Rates of signature reproducibility are strongest among self-identified white patients, but we also observe significant enrichment of reproducing disease associations in self-identified black/African-American and Hispanic/Latino cohorts. Signatures associated with 11 out of the 13 drug repurposing candidates identified in the original Sano GOLD study were reproduced in this study.
These results demonstrate the reproducibility of long COVID disease signal found by combinatorial analysis, broadly validating the results of the original analysis. They provide compelling evidence for a much broader array of genetic associations with long COVID than previously identified through traditional GWAS studies. This strongly supports the hypothesis that genetic factors play a critical role in determining an individual's susceptibility to long COVID following recovery from acute SARS-CoV-2 infection. It also lends weight to the drug repurposing candidates identified in the original analysis. Together these results may help to stimulate much needed new precision medicine approaches to more effectively diagnose and treat the disease. This is also the first reproduction of long COVID genetic associations across multiple populations with substantially different ancestry distributions. Given the high reproducibility rate across diverse populations, these findings may have broader clinical application and promote better health equity. We hope that this will provide confidence to explore some of these mechanisms and drug targets and help advance research into novel ways to diagnose the disease and accelerate the discovery and selection of better therapeutic options, both in the form of newly discovered drugs and/or the immediate prioritization of coordinated investigations into the efficacy of repurposed drug candidates.
长期新冠是一项重大的公共卫生负担,在全球数千万患者中引发了一系列使人衰弱的症状。尽管该疾病患病率极高、成本惊人、对患者生活造成严重影响且全球都在进行深入研究,但由于其复杂性,对该疾病的研究已被证明具有挑战性。全基因组关联研究(GWAS)仅确定了四个可能与该疾病相关的基因座,尽管这些结果在不同研究之间未得到统计学上的重复验证。先前的一项组合分析研究在主要为(>91%)欧洲白人血统的萨诺黄金(Sano GOLD)人群中,确定了总共73个与两个长期新冠队列高度相关的基因,我们试图在独立且祖先背景更多样化的“我们所有人”(AoU)人群中重现这些发现。
我们评估了原始研究中5343个长期新冠疾病特征在AoU人群中的可重复性。由于样本量非常小,提供的复制研究结果的能力非常有限,我们首先测试在控制了人群亚结构后,是否在AoU队列中观察到与长期新冠也呈正相关的萨诺黄金疾病特征有统计学意义的富集。
对于在AoU中病例频率大于5%的萨诺黄金疾病特征,我们始终观察到在AoU队列中与长期新冠也呈正相关的特征有显著富集(77 - 83%,p < 0.01)。这些特征涵盖了原始研究中确定的92%的基因。在萨诺黄金中发现的至少五个疾病特征也被证明在AoU人群中分别与长期新冠患病率增加显著相关。特征可重复性在自我认定为白人的患者中最强,但我们也在自我认定为黑人/非裔美国人和西班牙裔/拉丁裔队列中观察到重现疾病关联的显著富集。在原始萨诺黄金研究中确定的13个药物再利用候选药物中,有11个的相关特征在本研究中得到了重现。
这些结果证明了通过组合分析发现的长期新冠疾病信号的可重复性,广泛验证了原始分析的结果。它们为与长期新冠相关的基因关联提供了比以前通过传统GWAS研究确定的更为广泛的令人信服的证据。这有力地支持了这样一种假设,即遗传因素在决定个体从急性SARS-CoV-2感染康复后对长期新冠的易感性方面起着关键作用。这也为原始分析中确定的药物再利用候选药物提供了支持。这些结果共同可能有助于激发急需的新的精准医学方法,以更有效地诊断和治疗该疾病。这也是首次在具有实质上不同祖先分布的多个人群中重现长期新冠的遗传关联。鉴于在不同人群中的高重现率,这些发现可能具有更广泛的临床应用并促进更好的健康公平性。我们希望这将为探索其中一些机制和药物靶点提供信心,并有助于推动对诊断该疾病的新方法的研究,加速发现和选择更好的治疗选择,无论是以新发现的药物形式,还是以立即优先对再利用候选药物的疗效进行协调研究的形式。