• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稀疏数据中多项逻辑回归的置信区间

Confidence intervals for multinomial logistic regression in sparse data.

作者信息

Bull Shelley B, Lewinger Juan Pablo, Lee Sophia S F

机构信息

Samuel Lunenfeld Research Institute, Prosserman Centre for Health Research, Mount Sinai Hospital, Toronto, Ont., Canada M5G 1X5.

出版信息

Stat Med. 2007 Feb 20;26(4):903-18. doi: 10.1002/sim.2518.

DOI:10.1002/sim.2518
PMID:16489602
Abstract

Logistic regression is one of the most widely used regression models in practice, but alternatives to conventional maximum likelihood estimation methods may be more appropriate for small or sparse samples. Modification of the logistic regression score function to remove first-order bias is equivalent to penalizing the likelihood by the Jeffreys prior, and yields penalized maximum likelihood estimates (PLEs) that always exist, even in samples in which maximum likelihood estimates (MLEs) are infinite. PLEs are an attractive alternative in small-to-moderate-sized samples, and are preferred to exact conditional MLEs when there are continuous covariates. We present methods to construct confidence intervals (CI) in the penalized multinomial logistic regression model, and compare CI coverage and length for the PLE-based methods to that of conventional MLE-based methods in trinomial logistic regressions with both binary and continuous covariates. Based on simulation studies in sparse data sets, we recommend profile CIs over asymptotic Wald-type intervals for the PLEs in all cases. Furthermore, when finite sample bias and data separation are likely to occur, we prefer PLE profile CIs over MLE methods.

摘要

逻辑回归是实践中使用最广泛的回归模型之一,但对于小样本或稀疏样本,传统最大似然估计方法的替代方法可能更合适。对逻辑回归得分函数进行修改以消除一阶偏差,等同于用杰弗里斯先验对似然进行惩罚,并产生总是存在的惩罚最大似然估计(PLE),即使在最大似然估计(MLE)为无穷大的样本中也是如此。在中小规模样本中,PLE是一种有吸引力的替代方法,并且在存在连续协变量时,比精确条件MLE更受青睐。我们提出了在惩罚多项逻辑回归模型中构建置信区间(CI)的方法,并在具有二元和连续协变量的三项逻辑回归中,将基于PLE的方法的CI覆盖率和长度与基于传统MLE的方法进行比较。基于稀疏数据集的模拟研究,我们在所有情况下都推荐使用轮廓CI而不是渐近Wald型区间来估计PLE。此外,当可能出现有限样本偏差和数据分离时,我们更喜欢基于PLE的轮廓CI而不是MLE方法。

相似文献

1
Confidence intervals for multinomial logistic regression in sparse data.稀疏数据中多项逻辑回归的置信区间
Stat Med. 2007 Feb 20;26(4):903-18. doi: 10.1002/sim.2518.
2
Penalized maximum likelihood inference under the mixture cure model in sparse data.稀疏数据下混合治愈模型的惩罚极大似然推断。
Stat Med. 2023 Jun 15;42(13):2134-2161. doi: 10.1002/sim.9715. Epub 2023 Mar 25.
3
Confidence intervals after multiple imputation: combining profile likelihood information from logistic regressions.多次插补后的置信区间:结合逻辑回归的似然信息。
Stat Med. 2013 Dec 20;32(29):5062-76. doi: 10.1002/sim.5899. Epub 2013 Jul 19.
4
Performance of Firth-and logF-type penalized methods in risk prediction for small or sparse binary data.Firth 法和对数 F 型惩罚方法在小样本或稀疏二元数据风险预测中的性能
BMC Med Res Methodol. 2017 Feb 23;17(1):33. doi: 10.1186/s12874-017-0313-9.
5
On estimation for accelerated failure time models with small or rare event survival data.小样本或稀有事件生存数据的加速失效时间模型估计。
BMC Med Res Methodol. 2022 Jun 11;22(1):169. doi: 10.1186/s12874-022-01638-1.
6
Bias-reduced and separation-proof conditional logistic regression with small or sparse data sets.具有小数据集或稀疏数据集的偏倚降低和分离证明的条件逻辑回归。
Stat Med. 2010 Mar 30;29(7-8):770-7. doi: 10.1002/sim.3794.
7
Dealing with separation or near-to-separation in the model for multinomial response with application to childhood health seeking behavior data from a complex survey.处理多项响应模型中的分离或近似分离问题,并应用于来自复杂调查的儿童就医行为数据。
J Appl Stat. 2021 Sep 17;49(16):4254-4277. doi: 10.1080/02664763.2021.1977260. eCollection 2022.
8
A solution to the problem of separation in logistic regression.逻辑回归中分离问题的一种解决方案。
Stat Med. 2002 Aug 30;21(16):2409-19. doi: 10.1002/sim.1047.
9
A comparative investigation of methods for logistic regression with separated or nearly separated data.对用于分离或近似分离数据的逻辑回归方法的比较研究。
Stat Med. 2006 Dec 30;25(24):4216-26. doi: 10.1002/sim.2687.
10
Analysis of sparse data in logistic regression in medical research: A newer approach.医学研究中逻辑回归稀疏数据的分析:一种新方法。
J Postgrad Med. 2016 Jan-Mar;62(1):26-31. doi: 10.4103/0022-3859.173193.

引用本文的文献

1
-Penalized Multinomial Regression: Estimation, Inference, and Prediction, With an Application to Risk Factor Identification for Different Dementia Subtypes.-惩罚多项回归:估计、推断与预测,及其在不同痴呆亚型风险因素识别中的应用。
Stat Med. 2024 Dec 30;43(30):5711-5747. doi: 10.1002/sim.10263. Epub 2024 Nov 12.
2
Risk Factors Associated With Upper Aerodigestive Tract or Coliform Bacterial Overgrowth of the Small Intestine in Symptomatic Patients.与症状患者的上呼吸道或大肠菌小肠过度生长相关的风险因素。
J Clin Gastroenterol. 2020 Feb;54(2):150-157. doi: 10.1097/MCG.0000000000001150.
3
Identification of low frequency and rare variants for hypertension using sparse-data methods.
使用稀疏数据方法鉴定高血压的低频和罕见变异体。
BMC Proc. 2016 Oct 11;10(Suppl 7):389-395. doi: 10.1186/s12919-016-0061-6. eCollection 2016.
4
No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.二元逻辑回归分析中每10个事件对应1个变量的标准没有理论依据。
BMC Med Res Methodol. 2016 Nov 24;16(1):163. doi: 10.1186/s12874-016-0267-3.
5
Exome Sequencing of Familial Bipolar Disorder.家族性双相情感障碍的外显子组测序
JAMA Psychiatry. 2016 Jun 1;73(6):590-7. doi: 10.1001/jamapsychiatry.2016.0251.
6
Bias-corrected estimates for logistic regression models for complex surveys with application to the United States' Nationwide Inpatient Sample.用于复杂调查的逻辑回归模型的偏差校正估计及其在美国全国住院患者样本中的应用。
Stat Methods Med Res. 2017 Oct;26(5):2257-2269. doi: 10.1177/0962280215596550. Epub 2015 Aug 11.
7
High-throughput sequencing of the synaptome in major depressive disorder.重度抑郁症中突触体的高通量测序
Mol Psychiatry. 2016 May;21(5):650-5. doi: 10.1038/mp.2015.98. Epub 2015 Jul 28.
8
Avoiding bias due to perfect prediction in multiple imputation of incomplete categorical variables.在不完全分类变量的多重填补中避免因完美预测导致的偏差。
Comput Stat Data Anal. 2010 Oct 1;54(10):2267-2275. doi: 10.1016/j.csda.2010.04.005.
9
Bias correction for the proportional odds logistic regression model with application to a study of surgical complications.比例优势逻辑回归模型的偏差校正及其在手术并发症研究中的应用
J R Stat Soc Ser C Appl Stat. 2013 Mar;62(2):233-250. doi: 10.1111/j.1467-9876.2012.01057.x.
10
Survival, growth and reproduction of non-native Nile tilapia II: fundamental niche projections and invasion potential in the northern Gulf of Mexico.非本地尼罗罗非鱼的生存、生长和繁殖 II:在墨西哥湾北部的基础生态位预测和入侵潜力。
PLoS One. 2012;7(7):e41580. doi: 10.1371/journal.pone.0041580. Epub 2012 Jul 27.