• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子病历的自动化数据提取:数据挖掘构建胃肠病学临床试验入组研究数据库的有效性。

Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.

机构信息

Department of Medical Sciences, Gastroenterology and Hepatology, Uppsala University, Uppsala.

IQVIA Sweden AB, Solna, Stockholm, Sweden.

出版信息

Ups J Med Sci. 2022 Jan 27;127. doi: 10.48101/ujms.v127.8260. eCollection 2022.

DOI:10.48101/ujms.v127.8260
PMID:35173908
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8809051/
Abstract

BACKGROUND

Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP).

METHODS

The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data.

RESULTS

From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6-101.1% over the years with 96.1% weighted average sensitivity.

CONCLUSION

Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials.

摘要

背景

电子病历(EMR)用于存储与患者相关的医疗保健信息。利用数据挖掘技术,可以有效地利用和从中受益于这些大量的数据。我们旨在评估自定义提取程序(CXP)提取的数据的有效性。

方法

CXP 以快速标准化的流程提取和构建数据。CXP 被编程为使用定义的国际疾病分类第 10 版(ICD-10)代码从 EMR 中提取 TNFα-原生活动性溃疡性结肠炎(UC)患者的数据。提取的数据与 EMR 的手动评估并行读取,以与 CXP 提取的数据进行比较。

结果

从完整的 EMR 集中,提取了 2802 名带有代码 K51(UC)的患者。然后,CXP 根据纳入和排除标准提取了 332 名患者。其中,97.5%的患者被正确识别,最终确定了 320 例符合研究条件的病例。将 CXP 提取的数据与手动评估的 EMR 进行比较,每年的回收率为 95.6-101.1%,加权平均灵敏度为 96.1%。

结论

使用 CXP 软件可以被认为是一种从 EMR 中提取相关数据而不会产生重大错误的有效方法。因此,通过从 EMR 中提取,CXP 可以准确地识别患者,并能够通过找到具有所需代码的患者以及根据指定的纳入和排除标准对项目进行分类,为研究和临床试验提供便利。除此之外,还可以从 EMR 中快速检索医疗程序和实验室数据,以创建定制的提取材料数据库,以便立即用于临床试验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/f5169e3fbdc3/UJMS-127-8260-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/79a00e0d6da6/UJMS-127-8260-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/a19281b47b92/UJMS-127-8260-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/84c4dcce006d/UJMS-127-8260-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/f5169e3fbdc3/UJMS-127-8260-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/79a00e0d6da6/UJMS-127-8260-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/a19281b47b92/UJMS-127-8260-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/84c4dcce006d/UJMS-127-8260-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aac5/8809051/f5169e3fbdc3/UJMS-127-8260-g004.jpg

相似文献

1
Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials.电子病历的自动化数据提取:数据挖掘构建胃肠病学临床试验入组研究数据库的有效性。
Ups J Med Sci. 2022 Jan 27;127. doi: 10.48101/ujms.v127.8260. eCollection 2022.
2
Automated data extraction--a feasible way to construct patient registers of primary care utilization.自动化数据提取——构建初级保健利用患者登记册的可行方法。
Ups J Med Sci. 2012 Mar;117(1):52-6. doi: 10.3109/03009734.2011.653015.
3
Accuracy of claim data in the identification and classification of adults with congenital heart diseases in electronic medical records.电子病历中索赔数据对成人先天性心脏病的识别和分类的准确性。
Arch Cardiovasc Dis. 2019 Jan;112(1):31-43. doi: 10.1016/j.acvd.2018.07.002. Epub 2019 Jan 3.
4
Text-mining in electronic healthcare records can be used as efficient tool for screening and data collection in cardiovascular trials: a multicenter validation study.电子医疗记录中的文本挖掘可以作为心血管试验中筛选和数据收集的有效工具:一项多中心验证研究。
J Clin Epidemiol. 2021 Apr;132:97-105. doi: 10.1016/j.jclinepi.2020.11.014. Epub 2020 Nov 25.
5
Extracting information from the text of electronic medical records to improve case detection: a systematic review.从电子病历文本中提取信息以改善病例检测:一项系统综述
J Am Med Inform Assoc. 2016 Sep;23(5):1007-15. doi: 10.1093/jamia/ocv180. Epub 2016 Feb 5.
6
Challenges of Using ICD-9-CM and ICD-10-CM Codes for Soft-Tissue Sarcoma in Databases for Health Services Research.在卫生服务研究数据库中使用ICD - 9 - CM和ICD - 10 - CM编码对软组织肉瘤进行编码的挑战。
Perspect Health Inf Manag. 2019 Apr 1;16(Spring):1a. eCollection 2019 Spring.
7
[A customized method for information extraction from unstructured text data in the electronic medical records].[一种从电子病历非结构化文本数据中提取信息的定制方法]
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):256-263.
8
Using text mining to extract depressive symptoms and to validate the diagnosis of major depressive disorder from electronic health records.利用文本挖掘技术从电子健康记录中提取抑郁症状并验证重性抑郁障碍的诊断。
J Affect Disord. 2020 Jan 1;260:617-623. doi: 10.1016/j.jad.2019.09.044. Epub 2019 Sep 11.
9
An eClinical trial system for cancer that integrates with clinical pathways and electronic medical records.癌症电子临床试验系统,与临床路径和电子病历集成。
Clin Trials. 2012 Aug;9(4):408-17. doi: 10.1177/1740774512445912. Epub 2012 May 17.
10
Text Mining in Electronic Medical Records Enables Quick and Efficient Identification of Pregnancy Cases Occurring After Breast Cancer.电子病历中的文本挖掘能够快速有效地识别乳腺癌后发生的妊娠病例。
JCO Clin Cancer Inform. 2019 Oct;3:1-12. doi: 10.1200/CCI.19.00031.

引用本文的文献

1
Urban-Rural Disparities in Community Participation after Spinal Cord Injury in Ontario.安大略省脊髓损伤后社区参与的城乡差异
Healthcare (Basel). 2024 Oct 20;12(20):2089. doi: 10.3390/healthcare12202089.
2
Automated algorithm for medical data structuring, and segmentation using artificial intelligence within secured environment for dataset creation.用于医学数据结构化的自动化算法,以及在安全环境中使用人工智能进行分割以创建数据集。
Eur J Radiol Open. 2024 Jun 27;13:100582. doi: 10.1016/j.ejro.2024.100582. eCollection 2024 Dec.
3
Behind the scenes: Key lessons learned from the RELIEVE-AKI clinical trial.

本文引用的文献

1
Automated data mining of the electronic health record for investigation of healthcare-associated outbreaks.电子健康记录的自动化数据挖掘在调查医疗保健相关暴发中的应用。
Infect Control Hosp Epidemiol. 2019 Mar;40(3):314-319. doi: 10.1017/ice.2018.343. Epub 2019 Feb 18.
2
Data Mining and Endocrine Diseases: A New Way to Classify?数据挖掘与内分泌疾病:一种新的分类方法?
Arch Med Res. 2018 Apr;49(3):213-215. doi: 10.1016/j.arcmed.2018.08.005. Epub 2018 Aug 14.
3
Data Processing and Text Mining Technologies on Electronic Medical Records: A Review.
幕后故事:RELIEVE-AKI 临床试验的重要经验教训。
J Crit Care. 2024 Oct;83:154845. doi: 10.1016/j.jcrc.2024.154845. Epub 2024 Jun 15.
4
A pragmatic methodology to extract anesthetic and physiological data from the electronic health record.一种从电子健康记录中提取麻醉和生理数据的实用方法。
Paediatr Anaesth. 2024 Apr;34(4):318-323. doi: 10.1111/pan.14817. Epub 2023 Dec 6.
5
Ulcerative colitis progression: a retrospective analysis of disease burden using electronic medical records.溃疡性结肠炎的进展:基于电子病历的疾病负担回顾性分析。
Ups J Med Sci. 2022 Oct 18;127. doi: 10.48101/ujms.v127.8833. eCollection 2022.
电子病历的数据处理和文本挖掘技术:综述。
J Healthc Eng. 2018 Apr 8;2018:4302425. doi: 10.1155/2018/4302425. eCollection 2018.
4
A systematic review of data mining and machine learning for air pollution epidemiology.空气污染流行病学中数据挖掘与机器学习的系统综述。
BMC Public Health. 2017 Nov 28;17(1):907. doi: 10.1186/s12889-017-4914-3.
5
Using Data Mining Strategies in Clinical Decision Making: A Literature Review.在临床决策中运用数据挖掘策略:文献综述
Comput Inform Nurs. 2016 Oct;34(10):448-454. doi: 10.1097/CIN.0000000000000282.
6
In this issue - Benefits and disbenefits of computerised medical records (CMR): clinical, patient-centred, implementation and sociotechnical factors.本期——电子病历(CMR)的益处与弊端:临床、以患者为中心、实施及社会技术因素
J Innov Health Inform. 2016 Jun 14;23(1):864. doi: 10.14236/jhi.v23i1.864.
7
Adoption of electronic health records in UK hospitals: lessons from the USA.英国医院采用电子健康记录:来自美国的经验教训。
Lancet. 2014 Jul 5;384(9937):8-9. doi: 10.1016/S0140-6736(14)61099-0.
8
Automated data extraction--a feasible way to construct patient registers of primary care utilization.自动化数据提取——构建初级保健利用患者登记册的可行方法。
Ups J Med Sci. 2012 Mar;117(1):52-6. doi: 10.3109/03009734.2011.653015.
9
Data-mining technologies for diabetes: a systematic review.用于糖尿病的数据挖掘技术:一项系统综述。
J Diabetes Sci Technol. 2011 Nov 1;5(6):1549-56. doi: 10.1177/193229681100500631.
10
Use of the noninvasive components of the Mayo score to assess clinical response in ulcerative colitis.使用梅奥评分的非侵入性成分评估溃疡性结肠炎的临床反应。
Inflamm Bowel Dis. 2008 Dec;14(12):1660-6. doi: 10.1002/ibd.20520.