• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多元长短期记忆神经网络检测健康数据中的异常信号,以进行质量保证。

Using multivariate long short-term memory neural network to detect aberrant signals in health data for quality assurance.

机构信息

Biomedical Informatics Center, School of Medicine and Health Sciences, George Washington University, Washington, D.C., USA.

Biomedical Informatics Center, School of Medicine and Health Sciences, George Washington University, Washington, D.C., USA.

出版信息

Int J Med Inform. 2021 Mar;147:104368. doi: 10.1016/j.ijmedinf.2020.104368. Epub 2020 Dec 16.

DOI:10.1016/j.ijmedinf.2020.104368
PMID:33401168
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9518650/
Abstract

BACKGROUND

The data quality of electronic health records (EHR) has been a topic of increasing interest to clinical and health services researchers. One indicator of possible errors in data is a large change in the frequency of observations in chronic illnesses. In this study, we built and demonstrated the utility of a stacked multivariate LSTM model to predict an acceptable range for the frequency of observations.

METHODS

We applied the LSTM approach to a large EHR dataset with over 400 million total encounters. We computed sensitivity and specificity for predicting if the frequency of an observation in a given week is an aberrant signal.

RESULTS

Compared with the simple frequency monitoring approach, our proposed multivariate LSTM approach increased the sensitivity of finding aberrant signals in 6 randomly selected diagnostic codes from 75 to 88% and the specificity from 68 to 91%. We also experimented with two different LSTM algorithms, namely, direct multi-step and recursive multi-step. Both models were able to detect the aberrant signals while the recursive multi-step algorithm performed better.

CONCLUSIONS

Simply monitoring the frequency trend, as is the common practice in systems that do monitor the data quality, would not be able to distinguish between the fluctuations caused by seasonal disease changes, seasonal patient visits, or a change in data sources. Our study demonstrated the ability of stacked multivariate LSTM models to recognize true data quality issues rather than fluctuations that are caused by different reasons, including seasonal changes and outbreaks.

摘要

背景

电子健康记录(EHR)的数据质量一直是临床和卫生服务研究人员越来越关注的话题。数据中可能存在错误的一个指标是慢性病观察频率的大幅变化。在这项研究中,我们构建并展示了堆叠多变量 LSTM 模型的效用,以预测观察频率的可接受范围。

方法

我们将 LSTM 方法应用于一个拥有超过 4 亿总就诊次数的大型 EHR 数据集。我们计算了在给定周内观察频率是否为异常信号的预测的敏感性和特异性。

结果

与简单的频率监测方法相比,我们提出的多变量 LSTM 方法将 6 个随机选择的诊断代码中异常信号的检测敏感性从 75%提高到 88%,特异性从 68%提高到 91%。我们还尝试了两种不同的 LSTM 算法,即直接多步和递归多步。两种模型都能够检测到异常信号,而递归多步算法的性能更好。

结论

仅仅监测频率趋势,就像系统中常见的监测数据质量的做法一样,无法区分季节性疾病变化、季节性患者就诊或数据源变化引起的波动。我们的研究表明,堆叠多变量 LSTM 模型能够识别真正的数据质量问题,而不是由季节性变化和爆发等不同原因引起的波动。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/463a26885065/nihms-1657574-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/83876521b32e/nihms-1657574-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/8a9cea5685eb/nihms-1657574-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/463a26885065/nihms-1657574-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/83876521b32e/nihms-1657574-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/8a9cea5685eb/nihms-1657574-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8627/9518650/463a26885065/nihms-1657574-f0003.jpg

相似文献

1
Using multivariate long short-term memory neural network to detect aberrant signals in health data for quality assurance.使用多元长短期记忆神经网络检测健康数据中的异常信号,以进行质量保证。
Int J Med Inform. 2021 Mar;147:104368. doi: 10.1016/j.ijmedinf.2020.104368. Epub 2020 Dec 16.
2
LSTM Model for Prediction of Heart Failure in Big Data.基于大数据的心力衰竭预测 LSTM 模型
J Med Syst. 2019 Mar 19;43(5):111. doi: 10.1007/s10916-019-1243-3.
3
Predicting machine's performance record using the stacked long short-term memory (LSTM) neural networks.利用堆叠长短期记忆(LSTM)神经网络预测机器的性能记录。
J Appl Clin Med Phys. 2022 Mar;23(3):e13558. doi: 10.1002/acm2.13558. Epub 2022 Feb 16.
4
Long short-term memory (LSTM) recurrent neural network for muscle activity detection.长短期记忆(LSTM)递归神经网络用于肌肉活动检测。
J Neuroeng Rehabil. 2021 Oct 21;18(1):153. doi: 10.1186/s12984-021-00945-w.
5
PC-LSTM: Ontology-based Long Short-Term Memory State Model for Data Incompleteness Prediction.PC-LSTM:基于本体的长短时记忆状态模型用于数据缺失预测。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2606-2610. doi: 10.1109/EMBC48229.2022.9871867.
6
Real-time liver tracking algorithm based on LSTM and SVR networks for use in surface-guided radiation therapy.基于 LSTM 和 SVR 网络的实时肝脏跟踪算法,用于表面引导放射治疗。
Radiat Oncol. 2021 Jan 14;16(1):13. doi: 10.1186/s13014-020-01729-7.
7
A deep LSTM autoencoder-based framework for predictive maintenance of a proton radiotherapy delivery system.基于深度 LSTM 自动编码器的质子放射治疗系统预测性维护框架。
Artif Intell Med. 2022 Oct;132:102387. doi: 10.1016/j.artmed.2022.102387. Epub 2022 Aug 30.
8
Cohort selection for clinical trials using hierarchical neural network.使用分层神经网络进行临床试验的队列选择。
J Am Med Inform Assoc. 2019 Nov 1;26(11):1203-1208. doi: 10.1093/jamia/ocz099.
9
[A fetal electrocardiogram signal extraction method based on long short term memory network optimized by genetic algorithm].基于遗传算法优化的长短期记忆网络的胎儿心电图信号提取方法
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2021 Apr 25;38(2):257-267. doi: 10.7507/1001-5515.202004063.
10
Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF.基于注意力机制的卷积神经网络-长短时记忆网络-条件随机场在中文临床文本中的实体识别。
BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):74. doi: 10.1186/s12911-019-0787-y.

引用本文的文献

1
Challenges for Data Quality in the Clinical Data Life Cycle: Systematic Review.临床数据生命周期中数据质量面临的挑战:系统评价
J Med Internet Res. 2025 Apr 23;27:e60709. doi: 10.2196/60709.
2
Are ICD codes reliable for observational studies? Assessing coding consistency for data quality.国际疾病分类代码用于观察性研究是否可靠?评估数据质量的编码一致性。
Digit Health. 2024 Oct 29;10:20552076241297056. doi: 10.1177/20552076241297056. eCollection 2024 Jan-Dec.

本文引用的文献

1
A Rule-Based Data Quality Assessment System for Electronic Health Record Data.基于规则的数据质量评估系统在电子健康记录数据中的应用。
Appl Clin Inform. 2020 Aug;11(4):622-634. doi: 10.1055/s-0040-1715567. Epub 2020 Sep 23.
2
Design and Refinement of a Data Quality Assessment Workflow for a Large Pediatric Research Network.大型儿科研究网络数据质量评估工作流程的设计与优化
EGEMS (Wash DC). 2019 Aug 1;7(1):36. doi: 10.5334/egems.294.
3
Building of EMR Tools to Support Quality and Research in a Memory Disorders Clinic.构建电子病历工具以支持记忆障碍诊所的质量与研究。
Front Neurol. 2019 Mar 7;10:161. doi: 10.3389/fneur.2019.00161. eCollection 2019.
4
Missing Data, Data Cleansing, and Treatment From a Primary Study: Implications for Predictive Models.
Comput Inform Nurs. 2018 Aug;36(8):367-371. doi: 10.1097/CIN.0000000000000473.
5
A comparative study on predicting influenza outbreaks.流感爆发预测的对比研究。
Biosci Trends. 2017 Nov 20;11(5):533-541. doi: 10.5582/bst.2017.01257. Epub 2017 Oct 24.
6
A Harmonized Data Quality Assessment Terminology and Framework for the Secondary Use of Electronic Health Record Data.电子健康记录数据二次使用的统一数据质量评估术语和框架。
EGEMS (Wash DC). 2016 Sep 11;4(1):1244. doi: 10.13063/2327-9214.1244. eCollection 2016.
7
A comparison of a multistate inpatient EHR database to the HCUP Nationwide Inpatient Sample.一个多州住院患者电子健康记录数据库与医疗成本和利用项目全国住院患者样本的比较。
BMC Health Serv Res. 2015 Sep 15;15:384. doi: 10.1186/s12913-015-1025-7.
8
A review of data quality assessment methods for public health information systems.公共卫生信息系统数据质量评估方法综述。
Int J Environ Res Public Health. 2014 May 14;11(5):5170-207. doi: 10.3390/ijerph110505170.
9
A vision for the systematic monitoring and improvement of the quality of electronic health data.关于系统监测和改善电子健康数据质量的愿景。
Stud Health Technol Inform. 2013;192:884-8.
10
Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research.电子健康记录数据质量评估的方法和维度:为临床研究提供可重用性。
J Am Med Inform Assoc. 2013 Jan 1;20(1):144-51. doi: 10.1136/amiajnl-2011-000681. Epub 2012 Jun 25.