• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

逻辑回归模型在复杂调查数据中的正确应用:一项系统评价

The proper application of logistic regression model in complex survey data: a systematic review.

作者信息

Dey Devjit, Haque Md Samio, Islam Md Mojahedul, Aishi Umme Iffat, Shammy Sajida Sultana, Mayen Md Sabbir Ahmed, Noor Syed Toukir Ahmed, Uddin Md Jamal

机构信息

Department of Statistics, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh.

Maternal and Child Health Division, International Centre for Diarrhoeal Disease Research, Bangladesh (Icddr,B), Dhaka, Bangladesh.

出版信息

BMC Med Res Methodol. 2025 Jan 22;25(1):15. doi: 10.1186/s12874-024-02454-5.

DOI:10.1186/s12874-024-02454-5
PMID:39844030
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11752662/
Abstract

BACKGROUND

Logistic regression is a useful statistical technique commonly used in many fields like healthcare, marketing, or finance to generate insights from binary outcomes (e.g., sick vs. not sick). However, when applying logistic regression to complex survey data, which includes complex sampling designs, specific methodological issues are often overlooked.

METHODS

The systematic review extensively searched the PubMed and ScienceDirect databases from January 2015 to December 2021, following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines, focusing primarily on the Demographic and Health Surveys (DHS) and Multiple Indicator Cluster Surveys (MICS). 810 articles met the inclusion criteria and were included in the analysis. When discussing logistic regression, the review considered multiple methodological problems such as the model adequacy assessment, handling dependence of observations, utilization of complex survey design, dealing with missing values, outliers, and more.

RESULTS

Among the selected articles, the DHS database was used the most (96%), with MICS accounting for only 3%, and both DHS and MICS accounting for 1%. Of these, it was found that only 19.7% of the studies employed multilevel mixed-effects logistic regression to account for data dependencies. Model validation techniques were not reported in 94.8% of the studies with limited uses of the bootstrap, jackknife, and other resampling methods. Moreover, sample weights, PSUs, and strata variables were used together in 40.4% of the articles, and 41.7% of the studies did not use any of these variables, which could have produced biased results. Goodness-of-fit assessments were not mentioned in 75.3% of the articles, and the Hosmer-Lemeshow and likelihood ratio test were the most common among those reported. Furthermore, 95.8% of studies did not mention outliers, and only 41.0% of studies corrected for missing information, while only 2.7% applied imputation techniques.

CONCLUSIONS

This systematic review highlights important gaps in the use of logistic regression with complex survey data, such as overlooking data dependencies, survey design, and proper validation techniques, along with neglecting outliers, missing data, and goodness-of-fit assessments, all of which point to the need for clearer methodological standards and more thorough reporting to improve the reliability of results. Future research should focus on consistently following these standards to ensure stronger and more dependable findings.

摘要

背景

逻辑回归是一种有用的统计技术,常用于医疗保健、市场营销或金融等许多领域,以从二元结果(例如,患病与未患病)中得出见解。然而,在将逻辑回归应用于复杂的调查数据时,包括复杂的抽样设计,特定的方法学问题常常被忽视。

方法

本系统评价按照系统评价和Meta分析的首选报告项目(PRISMA)2020指南,于2015年1月至2021年12月广泛检索了PubMed和ScienceDirect数据库,主要关注人口与健康调查(DHS)和多指标类集调查(MICS)。810篇文章符合纳入标准并被纳入分析。在讨论逻辑回归时,该评价考虑了多个方法学问题,如模型充分性评估、处理观测值的依赖性、复杂调查设计的利用、处理缺失值、异常值等。

结果

在所选文章中,DHS数据库使用最多(96%),MICS仅占3%,DHS和MICS均使用的占1%。其中,发现只有19.7%的研究采用多级混合效应逻辑回归来考虑数据依赖性。94.8%的研究未报告模型验证技术,对自抽样法、刀切法和其他重抽样方法的使用有限。此外,40.4%的文章同时使用了样本权重、初级抽样单元和分层变量,41.7%的研究未使用这些变量中的任何一个,这可能会产生有偏差的结果。75.3%的文章未提及拟合优度评估,在报告的评估中,Hosmer-Lemeshow检验和似然比检验最为常见。此外,95.8%的研究未提及异常值,只有41.0%的研究对缺失信息进行了校正,而只有2.7%的研究应用了插补技术。

结论

本系统评价突出了在使用逻辑回归分析复杂调查数据时存在的重要差距,如忽视数据依赖性、调查设计和适当的验证技术,以及忽略异常值、缺失数据和拟合优度评估,所有这些都表明需要更明确的方法学标准和更全面的报告,以提高结果的可靠性。未来的研究应专注于始终遵循这些标准,以确保得出更强有力和更可靠的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/70398c332d6f/12874_2024_2454_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/73672b6fed68/12874_2024_2454_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/264dadccb601/12874_2024_2454_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/08a771c43582/12874_2024_2454_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/a22c8d5c3046/12874_2024_2454_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/70398c332d6f/12874_2024_2454_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/73672b6fed68/12874_2024_2454_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/264dadccb601/12874_2024_2454_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/08a771c43582/12874_2024_2454_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/a22c8d5c3046/12874_2024_2454_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab8f/11752662/70398c332d6f/12874_2024_2454_Fig5_HTML.jpg

相似文献

1
The proper application of logistic regression model in complex survey data: a systematic review.逻辑回归模型在复杂调查数据中的正确应用:一项系统评价
BMC Med Res Methodol. 2025 Jan 22;25(1):15. doi: 10.1186/s12874-024-02454-5.
2
Evaluating the current methodological practices and issues in existing literature in pooling complex surveys: a systematic review.评估现有文献中复杂调查汇总的当前方法学实践和问题:系统评价。
BMC Med Res Methodol. 2024 Nov 13;24(1):279. doi: 10.1186/s12874-024-02400-5.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
Characterization of immunization secondary analyses using demographic and health surveys (DHS) and multiple indicator cluster surveys (MICS), 2006-2018.2006 - 2018年利用人口与健康调查(DHS)和多指标类集调查(MICS)进行免疫接种二次分析的特征描述
BMC Public Health. 2021 Feb 12;21(1):351. doi: 10.1186/s12889-021-10364-0.
6
Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.超越黑木树:影响澳大利亚地区、农村和偏远地区的健康研究问题的快速综述。
Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.
7
Deployment of personnel to military operations: impact on mental health and social functioning.人员部署到军事行动中:对心理健康和社会功能的影响。
Campbell Syst Rev. 2018 Jun 1;14(1):1-127. doi: 10.4073/csr.2018.6. eCollection 2018.
8
Bias due to selective inclusion and reporting of outcomes and analyses in systematic reviews of randomised trials of healthcare interventions.在医疗保健干预随机试验的系统评价中,因对结果和分析进行选择性纳入及报告而产生的偏倚。
Cochrane Database Syst Rev. 2014 Oct 1;2014(10):MR000035. doi: 10.1002/14651858.MR000035.pub2.
9
Public sector reforms and their impact on the level of corruption: A systematic review.公共部门改革及其对腐败程度的影响:一项系统综述。
Campbell Syst Rev. 2021 May 24;17(2):e1173. doi: 10.1002/cl2.1173. eCollection 2021 Jun.
10
Reporting and Methodological Quality of Systematic Reviews and Meta-Analyses of Nursing Interventions in Patients With Alzheimer's Disease: General Implications of the Findings.阿尔茨海默病患者护理干预的系统评价和荟萃分析的报告和方法学质量:研究结果的普遍意义。
J Nurs Scholarsh. 2019 May;51(3):308-316. doi: 10.1111/jnu.12462. Epub 2019 Feb 25.

引用本文的文献

1
Comment on "The Role of the BMI ≥ 40 kg/m Criterium in ASA-PS Classification for Metabolic and Bariatric Surgery (MBS)".关于“体重指数(BMI)≥40kg/m²标准在代谢和减重手术(MBS)的美国麻醉医师协会身体状况分级(ASA-PS)中的作用”的评论
Obes Surg. 2025 Sep 8. doi: 10.1007/s11695-025-08261-1.
2
The association of cyberbullying with major depressive disorders among Bangladeshi female adolescents: findings from the Bangladesh adolescent health and wellbeing survey 2019-20.孟加拉国女性青少年中网络欺凌与重度抑郁症的关联:2019 - 20年孟加拉国青少年健康与幸福调查结果
BMC Psychiatry. 2025 Aug 12;25(1):783. doi: 10.1186/s12888-025-07234-z.
3

本文引用的文献

1
The effect of high prevalence of missing data on estimation of the coefficients of a logistic regression model when using multiple imputation.高缺失数据发生率对使用多重插补的逻辑回归模型系数估计的影响。
BMC Med Res Methodol. 2022 Jul 18;22(1):196. doi: 10.1186/s12874-022-01671-0.
2
An introduction to inverse probability of treatment weighting in observational research.观察性研究中治疗权重逆概率法简介。
Clin Kidney J. 2021 Aug 26;15(1):14-20. doi: 10.1093/ckj/sfab158. eCollection 2022 Jan.
3
Variable selection - A review and recommendations for the practicing statistician.
Cardiometabolic index is associated with impaired renal function: Nonlinear threshold effects and dose-response relationships in a national cohort (NHANES 1999-2018).
心脏代谢指数与肾功能受损相关:一项全国性队列研究(1999 - 2018年美国国家健康与营养检查调查)中的非线性阈值效应和剂量反应关系
Medicine (Baltimore). 2025 Aug 1;104(31):e43653. doi: 10.1097/MD.0000000000043653.
4
The role of oxidative balance score in Cardiovascular-Kidney-Metabolic syndrome progression and mortality: insights from NHANES 1999-2020.氧化平衡评分在心血管-肾脏-代谢综合征进展及死亡率中的作用:来自1999 - 2020年美国国家健康与营养检查调查(NHANES)的见解
Front Nutr. 2025 Jul 1;12:1597693. doi: 10.3389/fnut.2025.1597693. eCollection 2025.
5
Associations of parental labour migration and childhood maltreatment with psychosocial health among adolescents and young adults in China.中国青少年和青年中父母劳务移民及童年期虐待与心理社会健康的关联。
Eur J Psychotraumatol. 2025 Dec;16(1):2500139. doi: 10.1080/20008066.2025.2500139. Epub 2025 May 12.
6
The Timeliness of Drug Therapy in Colorectal and Prostate Cancer in Antigua and Barbuda: The Role of Disease Stage.安提瓜和巴布达结直肠癌与前列腺癌药物治疗的及时性:疾病分期的作用
Healthcare (Basel). 2025 Apr 16;13(8):915. doi: 10.3390/healthcare13080915.
7
Letter to the editor: Statistical methodology critique and alternative approaches in H5Nx avian influenza seroprevalence study among French cats.致编辑的信:法国猫群中H5Nx禽流感血清流行率研究的统计方法批判及替代方法
Euro Surveill. 2025 Apr;30(15). doi: 10.2807/1560-7917.ES.2025.30.15.2500237.
8
Machine learning identifies potential diagnostic biomarkers associated with ferroptosis in obstructive sleep apnea.机器学习识别出与阻塞性睡眠呼吸暂停中铁死亡相关的潜在诊断生物标志物。
Exp Ther Med. 2025 Mar 13;29(5):95. doi: 10.3892/etm.2025.12845. eCollection 2025 May.
9
Interpretable prediction of stroke prognosis: SHAP for SVM and nomogram for logistic regression.中风预后的可解释预测:支持向量机的SHAP值及逻辑回归的列线图
Front Neurol. 2025 Mar 4;16:1522868. doi: 10.3389/fneur.2025.1522868. eCollection 2025.
10
Comparisons of Correlates of Viral Suppression Among Adults Living With HIV/AIDS in Tanzania: Analysis With and Without Including Survey Designs.坦桑尼亚感染艾滋病毒/艾滋病成年人病毒抑制相关因素的比较:包含和不包含调查设计的分析
AIDS Res Treat. 2025 Feb 18;2025:3817430. doi: 10.1155/arat/3817430. eCollection 2025.
变量选择——给执业统计学家的一篇综述与建议
Biom J. 2018 May;60(3):431-449. doi: 10.1002/bimj.201700067. Epub 2018 Jan 2.
4
Quality of reporting of multivariable logistic regression models in Chinese clinical medical journals.中文临床医学期刊中多变量逻辑回归模型的报告质量
Medicine (Baltimore). 2017 May;96(21):e6972. doi: 10.1097/MD.0000000000006972.
5
Pseudo maximum likelihood approach for the analysis of multivariate left-censored longitudinal data.用于分析多变量左删失纵向数据的伪最大似然方法。
Stat Med. 2017 Jan 15;36(1):81-91. doi: 10.1002/sim.7080. Epub 2016 Aug 18.
6
Model building strategy for logistic regression: purposeful selection.Logistic 回归模型构建策略:有目的的选择。
Ann Transl Med. 2016 Mar;4(6):111. doi: 10.21037/atm.2016.02.15.
7
Mixed-Effects Logistic Regression Models for Indirectly Observed Discrete Outcome Variables.混合效应逻辑回归模型在间接观测离散结局变量中的应用。
Multivariate Behav Res. 2005;40(3):281-301. doi: 10.1207/s15327906mbr4003_1.
8
Caste- and ethnicity-based inequalities in HIV/AIDS-related knowledge gap: a case of Nepal.基于种姓和族裔的艾滋病毒/艾滋病相关知识差距不平等:以尼泊尔为例。
Health Soc Work. 2015 May;40(2):100-7. doi: 10.1093/hsw/hlv010.
9
Predictors of skilled attendance at delivery among antenatal clinic attendants in Ghana: a cross-sectional study of population data.加纳产前保健门诊就诊者分娩时熟练医护服务的预测因素:一项人口数据横断面研究
BMJ Open. 2015 May 19;5(5):e007810. doi: 10.1136/bmjopen-2015-007810.
10
Tobacco use and associated factors among adults in Ethiopia: further analysis of the 2011 Ethiopian Demographic and Health Survey.埃塞俄比亚成年人的烟草使用及相关因素:对2011年埃塞俄比亚人口与健康调查的进一步分析
BMC Public Health. 2015 May 13;15:487. doi: 10.1186/s12889-015-1820-4.