• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

慢性乙型肝炎真实世界队列研究的自动化数据收集工具:利用光学字符识别和自然语言处理技术提高效率。

Automated data collection tool for real-world cohort studies of chronic hepatitis B: Leveraging OCR and NLP technologies for improved efficiency.

作者信息

Zhou Xiaomei, Zeng Tao, Zhang Yibo, Liao Yingying, Smith Jaime, Zhang Lin, Wang Chao, Li Qinghai, Wu Dongbo, Chong Yutian, Li Xinhua

机构信息

Information Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, 510630, China.

Department of Infectious Diseases, The Third Affiliated Hospital of Sun Yat-sen University, Guangdong Key Laboratory of Liver Disease, Guangzhou, 510630, China.

出版信息

New Microbes New Infect. 2024 Aug 28;62:101469. doi: 10.1016/j.nmni.2024.101469. eCollection 2024 Dec.

DOI:10.1016/j.nmni.2024.101469
PMID:39282140
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11402059/
Abstract

BACKGROUND

Collecting and standardizing clinical research data is a very tedious task. This study is to develop an intelligent data collection tool, named CHB-EDC, for real-world cohort studies of chronic hepatitis B (CHB), which can assist in standardized and efficient data collection.

METHODS

CHB_EDC is capable of automatically processing various formats of data, including raw data in image format, using internationally recognized data standards, OCR, and NLP models. It can automatically populate the data into eCRFs designed in the REDCap system, supporting the integration of patient data from electronic medical record systems through commonly used web application interfaces. This tool enables intelligent extraction and aggregation of data, as well as secure and anonymous data sharing.

RESULTS

For non-electronic data collection, the average accuracy of manual collection was 98.65 %, with an average time of 63.64 min to collect information for one patient. The average accuracy CHB-EDC was 98.66 %, with an average time of 3.57 min to collect information for one patient. In the same data collection task, CHB-EDC achieved a comparable average accuracy to manual collection. However, in terms of time, CHB-EDC significantly outperformed manual collection (p < 0.05). Our research has significantly reduced the required collection time and lowered the cost of data collection while ensuring accuracy.

CONCLUSION

The tool has significantly improved the efficiency of data collection while ensuring accuracy, enabling standardized collection of real-world data.

摘要

背景

收集和规范临床研究数据是一项非常繁琐的任务。本研究旨在开发一种名为CHB-EDC的智能数据收集工具,用于慢性乙型肝炎(CHB)的真实世界队列研究,该工具可协助进行标准化且高效的数据收集。

方法

CHB_EDC能够使用国际认可的数据标准、光学字符识别(OCR)和自然语言处理(NLP)模型自动处理各种格式的数据,包括图像格式的原始数据。它可以自动将数据填充到REDCap系统中设计的电子病例报告表(eCRF)中,支持通过常用的网络应用程序接口集成来自电子病历系统的患者数据。该工具能够实现数据的智能提取和汇总,以及安全且匿名的数据共享。

结果

对于非电子数据收集,人工收集的平均准确率为98.65%,为一名患者收集信息的平均时间为63.64分钟。CHB-EDC的平均准确率为98.66%,为一名患者收集信息的平均时间为3.57分钟。在相同的数据收集任务中,CHB-EDC的平均准确率与人工收集相当。然而,在时间方面,CHB-EDC明显优于人工收集(p < 0.05)。我们的研究在确保准确性的同时,显著减少了所需的收集时间并降低了数据收集成本。

结论

该工具在确保准确性的同时显著提高了数据收集效率,能够实现真实世界数据的标准化收集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/49174fa46693/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/0c040fb40e08/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/bafa7ea5b0fb/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/bf51c9608a9c/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/49174fa46693/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/0c040fb40e08/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/bafa7ea5b0fb/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/bf51c9608a9c/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbbf/11402059/49174fa46693/gr4.jpg

相似文献

1
Automated data collection tool for real-world cohort studies of chronic hepatitis B: Leveraging OCR and NLP technologies for improved efficiency.慢性乙型肝炎真实世界队列研究的自动化数据收集工具:利用光学字符识别和自然语言处理技术提高效率。
New Microbes New Infect. 2024 Aug 28;62:101469. doi: 10.1016/j.nmni.2024.101469. eCollection 2024 Dec.
2
Clinical Source Data Production and Quality Control in Real-world Studies: Proposal for Development of the eSource Record System.真实世界研究中的临床源数据生成与质量控制:电子源记录系统开发提案
JMIR Res Protoc. 2022 Dec 23;11(12):e42754. doi: 10.2196/42754.
3
Automated Electronic Health Record to Electronic Data Capture Transfer in Clinical Studies in the German Health Care System: Feasibility Study and Gap Analysis.德国医疗体系中临床研究中自动电子健康记录到电子数据捕获的转换:可行性研究和差距分析。
J Med Internet Res. 2023 Aug 4;25:e47958. doi: 10.2196/47958.
4
Automating Access to Real-World Evidence.实现真实世界证据获取的自动化。
JTO Clin Res Rep. 2022 May 17;3(6):100340. doi: 10.1016/j.jtocrr.2022.100340. eCollection 2022 Jun.
5
Medical Registry Data Collection Efficiency: A Crossover Study Comparing Web-Based Electronic Data Capture and a Standard Spreadsheet.医疗登记数据收集效率:一项比较基于网络的电子数据捕获和标准电子表格的交叉研究。
J Med Internet Res. 2016 Jun 8;18(6):e141. doi: 10.2196/jmir.5576.
6
Facilitating clinical research through automation: Combining optical character recognition with natural language processing.通过自动化促进临床研究:结合光学字符识别和自然语言处理。
Clin Trials. 2022 Oct;19(5):504-511. doi: 10.1177/17407745221093621. Epub 2022 May 24.
7
Application of optical character recognition with natural language processing for large-scale quality metric data extraction in colonoscopy reports.光学字符识别与自然语言处理在结肠镜报告中大规模质量度量数据提取的应用。
Gastrointest Endosc. 2021 Mar;93(3):750-757. doi: 10.1016/j.gie.2020.08.038. Epub 2020 Sep 3.
8
Electronic Source Data Transcription for Electronic Case Report Forms in China: Validation of the Electronic Source Record Tool in a Real-world Ophthalmology Study.中国电子病例报告表的电子源数据转录:电子源记录工具在真实世界眼科研究中的验证
JMIR Form Res. 2022 Dec 16;6(12):e43229. doi: 10.2196/43229.
9
REDCap on FHIR: Clinical Data Interoperability Services.基于快速医疗互操作性资源(FHIR)的研究电子数据采集(REDCap):临床数据互操作性服务
J Biomed Inform. 2021 Sep;121:103871. doi: 10.1016/j.jbi.2021.103871. Epub 2021 Jul 21.
10
Improving the Efficacy of the Data Entry Process for Clinical Research With a Natural Language Processing-Driven Medical Information Extraction System: Quantitative Field Research.利用自然语言处理驱动的医学信息提取系统提高临床研究数据录入过程的效率:定量实地研究
JMIR Med Inform. 2019 Jul 16;7(3):e13331. doi: 10.2196/13331.

本文引用的文献

1
Large language models encode clinical knowledge.大语言模型编码临床知识。
Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.
2
[Guidelines for the prevention and treatment of chronic hepatitis B (version 2022)].《慢性乙型肝炎防治指南(2022年版)》
Zhonghua Gan Zang Bing Za Zhi. 2022 Dec 20;30(12):1309-1331. doi: 10.3760/cma.j.cn501113-20221204-00607.
3
Artificial Intelligence in Hepatology- Ready for the Primetime.肝病学中的人工智能——准备好迎接黄金时代。
J Clin Exp Hepatol. 2023 Jan-Feb;13(1):149-161. doi: 10.1016/j.jceh.2022.06.009. Epub 2022 Jun 29.
4
A large language model for electronic health records.用于电子健康记录的大型语言模型。
NPJ Digit Med. 2022 Dec 26;5(1):194. doi: 10.1038/s41746-022-00742-2.
5
Artificial intelligence in liver diseases: Improving diagnostics, prognostics and response prediction.人工智能在肝脏疾病中的应用:改善诊断、预后评估及反应预测
JHEP Rep. 2022 Feb 2;4(4):100443. doi: 10.1016/j.jhepr.2022.100443. eCollection 2022 Apr.
6
Cost-effectiveness of universal screening for chronic hepatitis B virus infection in China: an economic evaluation.中国慢性乙型肝炎病毒感染的普遍筛查的成本效益:经济评估。
Lancet Glob Health. 2022 Feb;10(2):e278-e287. doi: 10.1016/S2214-109X(21)00517-9.
7
Artificial intelligence in precision medicine in hepatology.人工智能在肝脏病精准医学中的应用。
J Gastroenterol Hepatol. 2021 Mar;36(3):569-580. doi: 10.1111/jgh.15415.
8
[The guidelines of prevention and treatment for chronic hepatitis B (2019 version)].《慢性乙型肝炎防治指南(2019年版)》
Zhonghua Gan Zang Bing Za Zhi. 2019 Dec 20;27(12):938-961. doi: 10.3760/cma.j.issn.1007-3418.2019.12.007.
9
Deep learning in clinical natural language processing: a methodical review.深度学习在临床自然语言处理中的应用:系统综述。
J Am Med Inform Assoc. 2020 Mar 1;27(3):457-470. doi: 10.1093/jamia/ocz200.
10
Extraction from Medical Records.从医疗记录中提取信息。
Stud Health Technol Inform. 2019;261:62-67.