Suppr超能文献

使用堆叠正则化逻辑回归模型进行儿科再入院分类。

Pediatric readmission classification using stacked regularized logistic regression models.

作者信息

Stiglic Gregor, Wang Fei, Davey Adam, Obradovic Zoran

机构信息

University of Maribor, Maribor, Slovenia.

IBM T.J. Watson Research Center, Yorktown Heights, NY.

出版信息

AMIA Annu Symp Proc. 2014 Nov 14;2014:1072-81. eCollection 2014.

Abstract

BACKGROUND

Regulations and privacy concerns often hinder exchange of healthcare data between hospitals or other healthcare providers. Sharing predictive models built on original data and averaging their results offers an alternative to more efficient prediction of outcomes on new cases. Although one can choose from many techniques to combine outputs from different predictive models, it is difficult to find studies that try to interpret the results obtained from ensemble-learning methods.

METHODS

We propose a novel approach to classification based on models from different hospitals that allows a high level of performance along with comprehensibility of obtained results. Our approach is based on regularized sparse regression models in two hierarchical levels and exploits the interpretability of obtained regression coefficients to rank the contribution of hospitals in terms of outcome prediction.

RESULTS

The proposed approach was used to predict the 30-days all-cause readmissions for pediatric patients in 54 Californian hospitals. Using repeated holdout evaluation, including more than 60,000 hospital discharge records, we compared the proposed approach to alternative approaches. The performance of two-level classification model was measured using the Area Under the ROC Curve (AUC) with an additional evaluation that uncovered the importance and contribution of each single data source (i.e. hospital) to the final result. The results for the best distributed model (AUC=0.787, 95% CI: 0.780-0.794) demonstrate no significant difference in terms of AUC performance when compared to a single elastic net model built on all available data (AUC=0.789, 95% CI: 0.781-0.796).

CONCLUSIONS

This paper presents a novel approach to improved classification with shared predictive models for environments where centralized collection of data is not possible. The significant improvements in classification performance and interpretability of results demonstrate the effectiveness of our approach.

摘要

背景

法规和隐私问题常常阻碍医院或其他医疗服务提供者之间的医疗数据交换。共享基于原始数据构建的预测模型并对其结果进行平均,为更高效地预测新病例的结局提供了一种替代方法。尽管可以从许多技术中选择来组合不同预测模型的输出,但很难找到试图解释从集成学习方法获得的结果的研究。

方法

我们提出了一种基于不同医院模型的新型分类方法,该方法在实现高性能的同时,还能使所得结果具有可理解性。我们的方法基于两个层次级别的正则化稀疏回归模型,并利用所得回归系数的可解释性来对医院在结局预测方面的贡献进行排名。

结果

所提出的方法用于预测加利福尼亚州54家医院儿科患者的30天全因再入院情况。使用重复留出评估,包括超过60000份医院出院记录,我们将所提出的方法与其他替代方法进行了比较。使用ROC曲线下面积(AUC)来衡量两级分类模型的性能,并进行了额外评估,以揭示每个单一数据源(即医院)对最终结果的重要性和贡献。最佳分布模型的结果(AUC = 0.787,95% CI:0.780 - 0.794)表明,与基于所有可用数据构建的单个弹性网络模型(AUC = 0.789,95% CI:0.781 - 0.796)相比,在AUC性能方面没有显著差异。

结论

本文提出了一种新型方法,用于在无法进行数据集中收集的环境中,通过共享预测模型改进分类。分类性能和结果可解释性的显著提高证明了我们方法的有效性。

相似文献

1
Pediatric readmission classification using stacked regularized logistic regression models.
AMIA Annu Symp Proc. 2014 Nov 14;2014:1072-81. eCollection 2014.
2
Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.
Artif Intell Med. 2016 Sep;72:12-21. doi: 10.1016/j.artmed.2016.07.003. Epub 2016 Jul 29.
3
Casting a Wider Net: Data Driven Discovery of Proxies for Target Diagnoses.
AMIA Annu Symp Proc. 2015 Nov 5;2015:1047-56. eCollection 2015.
5
A hospital wide predictive model for unplanned readmission using hierarchical ICD data.
Comput Methods Programs Biomed. 2019 May;173:177-183. doi: 10.1016/j.cmpb.2019.02.007. Epub 2019 Feb 13.
6
Predictive models for hospital readmission risk: A systematic review of methods.
Comput Methods Programs Biomed. 2018 Oct;164:49-64. doi: 10.1016/j.cmpb.2018.06.006. Epub 2018 Jun 28.
9
Using decision trees to manage hospital readmission risk for acute myocardial infarction, heart failure, and pneumonia.
Appl Health Econ Health Policy. 2014 Dec;12(6):573-85. doi: 10.1007/s40258-014-0124-7.
10
Predicting readmission risk with institution-specific prediction models.
Artif Intell Med. 2015 Oct;65(2):89-96. doi: 10.1016/j.artmed.2015.08.005. Epub 2015 Aug 22.

引用本文的文献

2
Identifying the Prevalence and Causes of 30-Day Hospital Readmission in Children: A Case Study from a Tertiary Pediatric Hospital.
Glob J Qual Saf Healthc. 2023 Nov 24;6(4):101-110. doi: 10.36401/JQSH-23-17. eCollection 2023 Nov.
5
Comprehensible Predictive Modeling Using Regularized Logistic Regression and Comorbidity Based Features.
PLoS One. 2015 Dec 8;10(12):e0144439. doi: 10.1371/journal.pone.0144439. eCollection 2015.

本文引用的文献

1
A study in transfer learning: leveraging data from multiple hospitals to enhance hospital-specific predictions.
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):699-706. doi: 10.1136/amiajnl-2013-002162. Epub 2014 Jan 30.
2
Profiling risk factors for chronic uveitis in juvenile idiopathic arthritis: a new model for EHR-based research.
Pediatr Rheumatol Online J. 2013 Dec 3;11(1):45. doi: 10.1186/1546-0096-11-45.
3
Measuring hospital quality using pediatric readmission and revisit rates.
Pediatrics. 2013 Sep;132(3):429-36. doi: 10.1542/peds.2012-3527. Epub 2013 Aug 26.
4
Hospital readmission: quality indicator or statistical inevitability?
Pediatrics. 2013 Sep;132(3):569-70. doi: 10.1542/peds.2013-1755. Epub 2013 Aug 26.
5
EXpectation Propagation LOgistic REgRession (EXPLORER): distributed privacy-preserving online model learning.
J Biomed Inform. 2013 Jun;46(3):480-96. doi: 10.1016/j.jbi.2013.03.008. Epub 2013 Apr 4.
6
Pediatric readmissions as a hospital quality measure.
JAMA. 2013 Jan 23;309(4):396-8. doi: 10.1001/jama.2012.217006.
7
Pediatric readmission prevalence and variability across hospitals.
JAMA. 2013 Jan 23;309(4):372-80. doi: 10.1001/jama.2012.188351.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验