Lovaglio Pietro Giorgio, Borgonovo Fabio, Manzo Margiotta Alessandro, Mowafy Mohamed, Colaneri Marta, Bandera Alessandra, Gori Andrea, Capetti Amedeo Ferdinando
Department of Statistics and Quantitative Methods, University of Milano Bicocca, Milan, Italy.
Division of Infectious Diseases, Luigi Sacco Hospital, University of Milan, Milan, Italy.
Front Epidemiol. 2025 May 30;5:1597799. doi: 10.3389/fepid.2025.1597799. eCollection 2025.
Long COVID (LC) is a multisystem condition with prolonged symptoms persisting beyond acute SARS-CoV-2 infection. However, prevalence estimates vary widely due to differences in case definitions and sampling methodologies. This study aims to determine the prevalence of LC across different definitions and correct for selection bias using advanced statistical modeling.
We conducted a retrospective, observational study at Luigi Sacco Hospital (Milan, Italy), analyzing 3,344 COVID-19 patients from two pandemic waves (2020-2021). Participants included 1,537 outpatients from the ARCOVID clinic and 1,807 hospitalized patients. LC was defined based on WHO and NICE criteria, as well as two alternative definitions: symptoms persisting at 3 and 6 months post-infection. We used a bivariate censored Probit model to account for selection bias and estimate adjusted LC prevalence.
LC prevalence varied across definitions: 67.4% (WHO), 76.3% (NICE), 80.2% (3 months), and 79.6% (6 months). Adjusted prevalence estimates remained consistent across definitions. The most common symptoms were fatigue (58.6%), dyspnea (41.1%), and joint/muscle pain (39.2%). Risk factors included female sex (OR 2.165-2.379), metabolic disease (OR 1.587-1.629), and older age (40-50 years, OR 1.847). Protective factors included antiplatelets (OR 0.640-0.689), statins (OR 0.616), and hypoglycemics (OR 0.593-0.706). Vaccination, hydroxychloroquine, and antibiotics were associated with an increased risk of LC. Selection bias significantly influenced prevalence estimates, underscoring the need for robust statistical adjustments.
Our findings highlight the high prevalence of LC, particularly among specific subgroups, with strong selection effects influencing outpatient participation. Differences in prevalence estimates emphasize the impact of case definitions and study designs on LC research. The identification of risk and protective factors supports targeted interventions and patient management strategies.
This study provides one of the most comprehensive analyses of LC prevalence while accounting for selection bias. Our findings call for standardized LC definitions, improved epidemiological methodologies, and targeted prevention strategies. Future research should explore prospective cohorts to refine LC prevalence estimates and investigate long-term health outcomes.
长期新冠(LC)是一种多系统疾病,其症状在急性SARS-CoV-2感染后持续很长时间。然而,由于病例定义和抽样方法的差异,患病率估计值差异很大。本研究旨在确定不同定义下LC的患病率,并使用先进的统计模型校正选择偏倚。
我们在意大利米兰的路易吉·萨科医院进行了一项回顾性观察研究,分析了来自两个疫情波(2020 - 2021年)的3344例新冠患者。参与者包括ARCOVID诊所的1537名门诊患者和1807名住院患者。LC根据世界卫生组织(WHO)和英国国家卫生与临床优化研究所(NICE)的标准以及另外两个替代定义来定义:感染后3个月和6个月持续存在的症状。我们使用二元删失Probit模型来校正选择偏倚并估计调整后的LC患病率。
LC患病率因定义而异:67.4%(WHO)、76.3%(NICE)、80.2%(3个月)和79.6%(6个月)。调整后的患病率估计值在不同定义下保持一致。最常见症状为疲劳(58.6%)、呼吸困难(41.1%)和关节/肌肉疼痛(39.2%)。危险因素包括女性(比值比2.165 - 2.379)、代谢疾病(比值比1.587 - 1.629)和年龄较大(40 - 50岁,比值比1.847)。保护因素包括抗血小板药物(比值比0.640 - 0.689)、他汀类药物(比值比0.616)和降糖药(比值比0.