• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多元逻辑回归中分离与多重共线性问题的一种解决方案。

A Solution to Separation and Multicollinearity in Multiple Logistic Regression.

作者信息

Shen Jianzhao, Gao Sujuan

机构信息

Division of Biostatistics, Department of Medicine, Indiana University School of Medicine, 1050 Wishard Blvd. RG4101, Indianapolis, IN 46202-2872, USA.

出版信息

J Data Sci. 2008 Oct 1;6(4):515-531.

PMID:20376286
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2849171/
Abstract

In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.

摘要

在痴呆筛查测试中,可使用多元逻辑回归来选择项目以缩短现有的筛查测试。然而,由于大量高度相关项目导致的分离和多重共线性问题,此类逻辑回归模型的最大似然估计常常会出现严重偏差甚至不存在。弗思(1993年,《生物统计学》,80(1),27 - 38)提出了一种广义线性模型的惩罚似然估计器,结果表明它能减少偏差和不存在问题。岭回归已用于逻辑回归,以在多重共线性情况下稳定估计。然而,两者都无法解决对方的问题。在本文中,我们提出一种将弗思的惩罚似然方程与一个岭参数相结合的双重惩罚最大似然估计器。我们进行了一项模拟研究,评估双重惩罚似然估计器在中小样本量情况下的实证性能。我们使用来自一项基于社区的痴呆研究的当前筛查数据展示了所提出的方法。

相似文献

1
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.多元逻辑回归中分离与多重共线性问题的一种解决方案。
J Data Sci. 2008 Oct 1;6(4):515-531.
2
On estimation for accelerated failure time models with small or rare event survival data.小样本或稀有事件生存数据的加速失效时间模型估计。
BMC Med Res Methodol. 2022 Jun 11;22(1):169. doi: 10.1186/s12874-022-01638-1.
3
Solutions to problems of nonexistence of parameter estimates and sparse data bias in Poisson regression.泊松回归中参数估计不存在问题和稀疏数据偏差的解决方案。
Stat Methods Med Res. 2022 Feb;31(2):253-266. doi: 10.1177/09622802211065405. Epub 2021 Dec 21.
4
To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets.调参还是不调参,小数据集或稀疏数据集的岭 logistic 回归案例研究。
BMC Med Res Methodol. 2021 Sep 30;21(1):199. doi: 10.1186/s12874-021-01374-y.
5
Reparametrized Firth's Logistic Regressions for Dose-Finding Study With the Biased-Coin Design.重参数化 Firth 的逻辑回归在偏币设计剂量发现研究中的应用。
Pharm Stat. 2024 Nov-Dec;23(6):1117-1127. doi: 10.1002/pst.2423. Epub 2024 Jul 16.
6
Firth's logistic regression with rare events: accurate effect estimates and predictions?针对罕见事件的费思逻辑回归:准确的效应估计与预测?
Stat Med. 2017 Jun 30;36(14):2302-2317. doi: 10.1002/sim.7273. Epub 2017 Mar 12.
7
An investigation of penalization and data augmentation to improve convergence of generalized estimating equations for clustered binary outcomes.探讨惩罚和数据增强对改善聚类二项结局广义估计方程收敛性的作用。
BMC Med Res Methodol. 2022 Jun 9;22(1):168. doi: 10.1186/s12874-022-01641-6.
8
New ridge parameter estimators for the quasi-Poisson ridge regression model.拟泊松岭回归模型的新岭参数估计器。
Sci Rep. 2024 Apr 11;14(1):8489. doi: 10.1038/s41598-023-50085-5.
9
Tutorial on Firth's Logistic Regression Models for Biomarkers in Preclinical Space.临床前空间中生物标志物的费希尔逻辑回归模型教程。
Pharm Stat. 2025 Jan-Feb;24(1):e2422. doi: 10.1002/pst.2422. Epub 2024 Aug 6.
10
Bring More Data!-A Good Advice? Removing Separation in Logistic Regression by Increasing Sample Size.带来更多数据!—一个好建议?通过增加样本量消除逻辑回归中的分离。
Int J Environ Res Public Health. 2019 Nov 22;16(23):4658. doi: 10.3390/ijerph16234658.

引用本文的文献

1
Association between bystander intervention and emergency medical services and the return of spontaneous circulation in out-of-hospital cardiac arrests occurring at a train station in the Tokyo metropolitan area: A retrospective cohort study.东京都区内某火车站发生的院外心脏骤停中旁观者干预与紧急医疗服务及自主循环恢复之间的关联:一项回顾性队列研究。
Resusc Plus. 2023 Aug 9;15:100438. doi: 10.1016/j.resplu.2023.100438. eCollection 2023 Sep.
2
Spatial analysis of the impact of urban geometry and socio-demographic characteristics on COVID-19, a study in Hong Kong.城市形态和社会人口特征对COVID-19影响的空间分析:香港的一项研究
Sci Total Environ. 2021 Apr 10;764:144455. doi: 10.1016/j.scitotenv.2020.144455. Epub 2020 Dec 16.
3
A Maximum Entropy Procedure to Solve Likelihood Equations.一种求解似然方程的最大熵方法。
Entropy (Basel). 2019 Jun 15;21(6):596. doi: 10.3390/e21060596.
4
Innovative approach to identify multigenomic and environmental interactions associated with birth defects in family-based hybrid designs.创新方法识别与基于家系的杂交设计中出生缺陷相关的多基因组和环境相互作用。
Genet Epidemiol. 2021 Mar;45(2):171-189. doi: 10.1002/gepi.22363. Epub 2020 Sep 30.
5
Machine learning workflows to estimate class probabilities for precision cancer diagnostics on DNA methylation microarray data.机器学习工作流程,用于估计 DNA 甲基化微阵列数据精准癌症诊断的类别概率。
Nat Protoc. 2020 Feb;15(2):479-512. doi: 10.1038/s41596-019-0251-6. Epub 2020 Jan 13.
6
White Matter Connectome Correlates of Auditory Over-Responsivity: Edge Density Imaging and Machine-Learning Classifiers.听觉过度反应的白质连接组相关性:边缘密度成像与机器学习分类器
Front Integr Neurosci. 2019 Mar 29;13:10. doi: 10.3389/fnint.2019.00010. eCollection 2019.
7
Do prognostic variables predict a set of outcomes for patients with chronic low back pain: a long-term follow-up secondary analysis of a randomized control trial.预后变量能否预测慢性下腰痛患者的一系列结局:一项随机对照试验的长期随访二次分析
J Man Manip Ther. 2019 Sep;27(4):197-207. doi: 10.1080/10669817.2019.1597435. Epub 2019 Apr 4.
8
Validity and reliability of the (adjusted) Impact on Participation and Autonomy questionnaire for social-support populations.(调整后)社会支持人群参与和自主影响问卷的有效性和可靠性。
Health Qual Life Outcomes. 2019 Feb 26;17(1):41. doi: 10.1186/s12955-019-1106-0.
9
Predictors of chronic prescription opioid use after orthopedic surgery: derivation of a clinical prediction rule.骨科手术后慢性处方阿片类药物使用的预测因素:临床预测规则的推导
Perioper Med (Lond). 2018 Nov 22;7:25. doi: 10.1186/s13741-018-0105-8. eCollection 2018.
10
Can experienced physiotherapists identify which patients are likely to succeed with physical therapy treatment?经验丰富的物理治疗师能否识别出哪些患者可能从物理治疗中取得成功?
Arch Physiother. 2015 Jul 8;5:3. doi: 10.1186/s40945-015-0003-z. eCollection 2015.

本文引用的文献

1
An invariant form for the prior probability in estimation problems.估计问题中先验概率的一种不变形式。
Proc R Soc Lond A Math Phys Sci. 1946;186(1007):453-61. doi: 10.1098/rspa.1946.0056.
2
A solution to the problem of separation in logistic regression.逻辑回归中分离问题的一种解决方案。
Stat Med. 2002 Aug 30;21(16):2409-19. doi: 10.1002/sim.1047.
3
Bias correction in maximum likelihood logistic regression.最大似然逻辑回归中的偏差校正
Stat Med. 1983 Jan-Mar;2(1):71-8. doi: 10.1002/sim.4780020108.