优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.

机构信息

Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academcy of Higher Education, Manipal, India.

出版信息

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

DOI:10.1016/j.artmed.2019.07.008

PMID:31521253

Abstract

OBJECTIVE

The neonatal period of a child is considered the most crucial phase of its physical development and future health. As per the World Health Organization, India has the highest number of pre-term births [1], with over 3.5 million babies born prematurely, and up to 40% of them are babies with low birth weights, highly prone to a multitude of diseases such as Jaundice, Sepsis, Apnea, and other Metabolic disorders. Apnea is the primary concern for caretakers of neonates in intensive care units. The real-time medical data is known to be noisy and nonlinear and to address the resultant complexity in classification and prediction of diseases; there is a need for optimizing learning models to maximize predictive performance. Our study attempts to optimize neural network architectures to predict the occurrence of apneic episodes in neonates, after the first week of admission to Neonatal Intensive Care Unit (NICU). The primary contribution of this study is the formulation and description of a set of generic steps involved in selecting various model-specific, training and hyper-parametric optimization algorithms, as well as model architectures for optimal predictive performance on complex and noisy medical datasets.

METHODS

The data used for the study being inherently complex and noisy, Kernel Principal Component Analysis (PCA) is used to reduce dataset dimensionality for the analysis such as interpretations and visualization of the dataset. Hyper-parametric and parametric optimization, in different categories, are considered, including learning rate updater algorithms, regularization methods, activation functions, gradient descent algorithms and depth of the network, based on their performance on the validation set, to obtain a holistically optimized neural network, that best model the given complex medical dataset. Deep Neural Network Architectures such as Deep Multilayer Perceptron's, Stacked Auto-encoders and Deep Belief Networks are employed to model the dataset, and their performance is compared to the optimized neural network obtained from the parametric exploration. Further, the results are compared with Support Vector Machine (SVM), K Nearest Neighbor, Decision Tree (DT) and Random Forest (RF) algorithms.

RESULTS

The results indicate that the optimized eight layer Multilayer Perceptron (MLP) model, with Adam Decay and Stochastic Gradient Descent (AUC 0.82) can outperform the conventional machine learning models, and perform comparably to the Deep Auto-encoder model (AUC 0.83) in predicting the presence of apnea in neonates.

CONCLUSION

The study shows that an MLP model can undergo significant improvements in predictive performance, by the proposed step-wise optimization. The optimized MLP is proved to be as accurate as deep neural network models such as Deep Belief Networks and Deep Auto-encoders for noisy and nonlinear data sets, and outperform all conventional models like Support Vector Machine (SVM), Decision Tree (DT), K Nearest Neighbor and Random Forest (RF) algorithms. The generic nature of the proposed step-wise optimization provides a framework to optimize neural networks on such complex nonlinear datasets. The investigated models can help neonatologists as a diagnostic tool.

摘要

目的

儿童的新生儿期被认为是其身体发育和未来健康的最关键阶段。根据世界卫生组织的数据，印度早产儿的数量最多[1]，超过 350 万婴儿早产，其中多达 40%的婴儿体重偏低，极易患上黄疸、败血症、呼吸暂停和其他代谢紊乱等多种疾病。呼吸暂停是重症监护病房新生儿护理人员最关心的问题。众所周知，实时医疗数据存在噪声和非线性，为了解决疾病分类和预测的复杂性，需要优化学习模型以最大限度地提高预测性能。我们的研究旨在优化神经网络架构，以预测新生儿在进入新生儿重症监护病房（NICU）后的第一周内发生呼吸暂停的情况。本研究的主要贡献是制定和描述了一组通用步骤，用于选择各种特定于模型的训练和超参数优化算法以及模型架构，以在复杂和嘈杂的医疗数据集上获得最佳的预测性能。

方法

由于研究中使用的数据本质上是复杂且嘈杂的，因此使用核主成分分析（PCA）来降低数据集的维度，以便对数据集进行解释和可视化等分析。考虑了不同类别中的超参数和参数优化，包括学习率更新算法、正则化方法、激活函数、梯度下降算法和网络深度等，根据它们在验证集上的性能，以获得最佳的整体优化神经网络，从而最好地对给定的复杂医疗数据集进行建模。使用深度神经网络架构，如深度多层感知机、堆叠自动编码器和深度置信网络来对数据集进行建模，并将其性能与从参数探索中获得的优化神经网络进行比较。此外，还将结果与支持向量机（SVM）、K 近邻、决策树（DT）和随机森林（RF）算法进行了比较。

结果

结果表明，经过逐步优化的八层多层感知机（MLP）模型（AUC 0.82）可以优于传统的机器学习模型，并且在预测新生儿呼吸暂停方面的性能可与深度自动编码器模型（AUC 0.83）相媲美。

结论

研究表明，通过提出的逐步优化方法，MLP 模型的预测性能可以得到显著提高。优化后的 MLP 被证明与深度神经网络模型（如深度信念网络和深度自动编码器）一样，可以对嘈杂和非线性数据集进行精确预测，并且优于所有传统模型（如支持向量机（SVM）、决策树（DT）、K 近邻和随机森林（RF）算法）。所提出的逐步优化的通用性质为在这种复杂的非线性数据集上优化神经网络提供了一个框架。所研究的模型可以作为新生儿科医生的诊断工具。

相似文献

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Estimation of Caffeine Regimens: A Machine Learning Approach for Enhanced Clinical Decision Making at a Neonatal Intensive Care Unit (NICU).咖啡因给药方案的评估：一种用于加强新生儿重症监护病房（NICU）临床决策的机器学习方法。

Crit Rev Biomed Eng. 2018;46(2):93-115. doi: 10.1615/CritRevBiomedEng.2018025933.

Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

Machine learning and deep learning methods that use omics data for metastasis prediction.利用组学数据进行转移预测的机器学习和深度学习方法。

Comput Struct Biotechnol J. 2021 Sep 4;19:5008-5018. doi: 10.1016/j.csbj.2021.09.001. eCollection 2021.

Deep generative learning for automated EHR diagnosis of traditional Chinese medicine.基于深度学习的中医电子病历自动化诊断

Comput Methods Programs Biomed. 2019 Jun;174:17-23. doi: 10.1016/j.cmpb.2018.05.008. Epub 2018 May 4.

Seminal quality prediction using data mining methods.使用数据挖掘方法进行精液质量预测。

Technol Health Care. 2014;22(4):531-45. doi: 10.3233/THC-140816.

Predicting Inpatient Payments Prior to Lower Extremity Arthroplasty Using Deep Learning: Which Model Architecture Is Best?利用深度学习预测下肢关节置换术患者的住院费用：哪种模型架构最佳？

J Arthroplasty. 2019 Oct;34(10):2235-2241.e1. doi: 10.1016/j.arth.2019.05.048. Epub 2019 Jun 3.

Prediction and feature selection of low birth weight using machine learning algorithms.利用机器学习算法预测和选择低出生体重。

J Health Popul Nutr. 2024 Oct 12;43(1):157. doi: 10.1186/s41043-024-00647-8.

Automated Amharic News Categorization Using Deep Learning Models.基于深度学习模型的阿姆哈拉语新闻自动分类。

Comput Intell Neurosci. 2021 Jul 27;2021:3774607. doi: 10.1155/2021/3774607. eCollection 2021.

Prediction and Diagnosis of Breast Cancer Using Machine and Modern Deep Learning Models.使用机器和现代深度学习模型预测和诊断乳腺癌。

Asian Pac J Cancer Prev. 2024 Mar 1;25(3):1077-1085. doi: 10.31557/APJCP.2024.25.3.1077.

引用本文的文献

Association of monocyte to lymphocyte ratio with length of stay in intensive care unit in neonatal apnea modified by treatment.治疗对新生儿呼吸暂停患者单核细胞与淋巴细胞比值和重症监护病房住院时间之间关联的影响

Transl Pediatr. 2025 Jun 27;14(6):1073-1086. doi: 10.21037/tp-2025-21. Epub 2025 Jun 25.

A decision tree analysis to predict massive pulmonary hemorrhage in extremely low birth weight infants: a nationwide large cohort database.预测极低出生体重儿大量肺出血的决策树分析：一项全国性大型队列数据库研究

Front Pediatr. 2025 Mar 21;13:1529712. doi: 10.3389/fped.2025.1529712. eCollection 2025.

Enhancing machine learning performance in cardiac surgery ICU: Hyperparameter optimization with metaheuristic algorithm.提高心脏手术重症监护病房中的机器学习性能：使用元启发式算法进行超参数优化。

PLoS One. 2025 Feb 10;20(2):e0311250. doi: 10.1371/journal.pone.0311250. eCollection 2025.

Neonatal apnea and hypopnea prediction in infants with Robin sequence with neural additive models for time series.基于时间序列神经加法模型的罗宾序列婴儿新生儿呼吸暂停和呼吸浅慢预测

PLOS Digit Health. 2024 Dec 13;3(12):e0000678. doi: 10.1371/journal.pdig.0000678. eCollection 2024 Dec.

Quantifying the Enhancement of Sarcopenic Skeletal Muscle Preservation Through a Hybrid Exercise Program: Randomized Controlled Trial.通过混合运动计划量化对少肌症性骨骼肌的保护增强作用：随机对照试验

JMIR Aging. 2024 Nov 15;7:e58175. doi: 10.2196/58175.

AI Algorithms for Modeling the Risk, Progression, and Treatment of Sepsis, Including Early-Onset Sepsis-A Systematic Review.用于模拟脓毒症（包括早发性脓毒症）风险、进展和治疗的人工智能算法——系统评价

J Clin Med. 2024 Oct 7;13(19):5959. doi: 10.3390/jcm13195959.

Establishment and validation of apnea risk prediction models in preterm infants: a retrospective case control study.早产儿呼吸暂停风险预测模型的建立与验证：一项回顾性病例对照研究。

BMC Pediatr. 2024 Oct 11;24(1):654. doi: 10.1186/s12887-024-05125-y.

RNA-Seq analysis for breast cancer detection: a study on paired tissue samples using hybrid optimization and deep learning techniques.RNA-Seq 分析在乳腺癌检测中的应用：基于混合优化和深度学习技术的配对组织样本研究。

J Cancer Res Clin Oncol. 2024 Oct 10;150(10):455. doi: 10.1007/s00432-024-05968-z.

A nomogram for predicting neonatal apnea: a retrospective analysis based on the MIMIC database.一种预测新生儿窒息的列线图：基于MIMIC数据库的回顾性分析。

Front Pediatr. 2024 Sep 5;12:1357972. doi: 10.3389/fped.2024.1357972. eCollection 2024.

Exploring Computational Techniques in Preprocessing Neonatal Physiological Signals for Detecting Adverse Outcomes: Scoping Review.探索用于检测不良结局的新生儿生理信号预处理中的计算技术：范围综述。

Interact J Med Res. 2024 Aug 20;13:e46946. doi: 10.2196/46946.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献