Rahman Jessica, Brankovic Aida, Tracy Mark, Khanna Sankalp
Commonwealth Scientific and Industrial Research Organisation (CSIRO) Australian e-Health Research Centre, Australia, Sydney, Australia.
Commonwealth Scientific and Industrial Research Organisation (CSIRO) Australian e-Health Research Centre, Australia, Brisbane, Australia.
Interact J Med Res. 2024 Aug 20;13:e46946. doi: 10.2196/46946.
Computational signal preprocessing is a prerequisite for developing data-driven predictive models for clinical decision support. Thus, identifying the best practices that adhere to clinical principles is critical to ensure transparency and reproducibility to drive clinical adoption. It further fosters reproducible, ethical, and reliable conduct of studies. This procedure is also crucial for setting up a software quality management system to ensure regulatory compliance in developing software as a medical device aimed at early preclinical detection of clinical deterioration.
This scoping review focuses on the neonatal intensive care unit setting and summarizes the state-of-the-art computational methods used for preprocessing neonatal clinical physiological signals; these signals are used for the development of machine learning models to predict the risk of adverse outcomes.
Five databases (PubMed, Web of Science, Scopus, IEEE, and ACM Digital Library) were searched using a combination of keywords and MeSH (Medical Subject Headings) terms. A total of 3585 papers from 2013 to January 2023 were identified based on the defined search terms and inclusion criteria. After removing duplicates, 2994 (83.51%) papers were screened by title and abstract, and 81 (0.03%) were selected for full-text review. Of these, 52 (64%) were eligible for inclusion in the detailed analysis.
Of the 52 articles reviewed, 24 (46%) studies focused on diagnostic models, while the remainder (n=28, 54%) focused on prognostic models. The analysis conducted in these studies involved various physiological signals, with electrocardiograms being the most prevalent. Different programming languages were used, with MATLAB and Python being notable. The monitoring and capturing of physiological data used diverse systems, impacting data quality and introducing study heterogeneity. Outcomes of interest included sepsis, apnea, bradycardia, mortality, necrotizing enterocolitis, and hypoxic-ischemic encephalopathy, with some studies analyzing combinations of adverse outcomes. We found a partial or complete lack of transparency in reporting the setting and the methods used for signal preprocessing. This includes reporting methods to handle missing data, segment size for considered analysis, and details regarding the modification of the state-of-the-art methods for physiological signal processing to align with the clinical principles for neonates. Only 7 (13%) of the 52 reviewed studies reported all the recommended preprocessing steps, which could have impacts on the downstream analysis.
The review found heterogeneity in the techniques used and inconsistent reporting of parameters and procedures used for preprocessing neonatal physiological signals, which is necessary to confirm adherence to clinical and software quality management system practices, usefulness, and choice of best practices. Enhancing transparency in reporting and standardizing procedures will boost study interpretation and reproducibility and expedite clinical adoption, instilling confidence in the research findings and streamlining the translation of research outcomes into clinical practice, ultimately contributing to the advancement of neonatal care and patient outcomes.
计算信号预处理是开发用于临床决策支持的数据驱动预测模型的前提条件。因此,确定符合临床原则的最佳实践对于确保透明度和可重复性以推动临床应用至关重要。这进一步促进了研究的可重复、合乎道德且可靠的开展。此过程对于建立软件质量管理系统也至关重要,以确保在开发作为医疗设备的软件时符合监管要求,该软件旨在早期临床前检测临床恶化情况。
本范围综述聚焦于新生儿重症监护病房环境,总结用于预处理新生儿临床生理信号的最新计算方法;这些信号用于开发机器学习模型以预测不良结局的风险。
使用关键词和医学主题词(MeSH)术语组合搜索五个数据库(PubMed、科学网、Scopus、IEEE和ACM数字图书馆)。根据定义的搜索词和纳入标准,共识别出2013年至2023年1月的3585篇论文。去除重复项后,通过标题和摘要筛选出2994篇(83.51%)论文,81篇(0.03%)被选作全文评审。其中,52篇(64%)符合纳入详细分析的条件。
在 reviewed 的52篇文章中,24篇(46%)研究聚焦于诊断模型,其余(n = 28,54%)聚焦于预后模型。这些研究中进行的分析涉及各种生理信号,心电图最为常见。使用了不同的编程语言,MATLAB和Python较为突出。生理数据的监测和采集使用了多种系统,影响了数据质量并引入了研究异质性。关注的结局包括败血症、呼吸暂停、心动过缓、死亡率、坏死性小肠结肠炎和缺氧缺血性脑病,一些研究分析了不良结局的组合。我们发现报告信号预处理的设置和所用方法时部分或完全缺乏透明度。这包括处理缺失数据的报告方法、用于分析的片段大小以及对生理信号处理的最新方法进行修改以符合新生儿临床原则的细节。在 reviewed 的52项研究中,只有7项(13%)报告了所有推荐的预处理步骤,这可能会对下游分析产生影响。
该综述发现,在用于预处理新生儿生理信号的技术方面存在异质性,并且在报告参数和程序时不一致,而这些对于确认符合临床和软件质量管理系统实践、实用性以及最佳实践的选择是必要的。提高报告的透明度并规范程序将增强研究的可解释性和可重复性,并加快临床应用,增强对研究结果的信心,简化研究成果转化为临床实践的过程,最终推动新生儿护理和患者结局的改善。