• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

识别传染病领域研究中最重要的数据:基于人工智能的思考。

Identifying the most important data for research in the field of infectious diseases: thinking on the basis of artificial intelligence.

机构信息

Carolina Garcia-Vidal, MD, PhD. Infectious Diseases Department, Hospital Clínic-IDIBAPS, Carrer de Villarroel 170, 08036, Barcelona, Spain.

出版信息

Rev Esp Quimioter. 2023 Dec;36(6):592-596. doi: 10.37201/req/032.2023. Epub 2023 Aug 12.

DOI:10.37201/req/032.2023
PMID:37575020
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10710675/
Abstract

OBJECTIVE

Clinical data on which artificial intelligence (AI) algorithms are trained and tested provide the basis to improve diagnosis or treatment of infectious diseases (ID). We aimed to identify important data for ID research to prioritise efforts being undertaken in AI programmes.

METHODS

We searched for 1,000 articlesfrom high-impact ID journals on PubMed, selecting 288 of the latest articles from 10 top journals. We classified them into structured or unstructured data. Variables were homogenised and grouped into the following categories: epidemiology, admission, demographics, comorbidities, clinical manifestations, laboratory, microbiology, other diagnoses, treatment, outcomes and other non-categorizable variables.

RESULTS

4,488 individual variables were collected, from the 288 articles. 3,670 (81.8%) variables were classified as structured data whilst 818 (18.2%) as unstructured data. From the structured data, 2,319 (63.2%) variables were classified as direct-retrievable from electronic health records-whilst 1,351 (36.8%) were indirect. The most frequent unstructured data were related to clinical manifestations and were repeated across articles. Data on demographics, comorbidities and microbiology constituted the most frequent group of variables.

CONCLUSIONS

This article identified that structured variables have comprised the most important data in research to generate knowledge in the field of ID. Extracting these data should be a priority when a medical centre intends to start an AI programme for ID. We also documented that the most important unstructured data in this field are those related to clinical manifestations. Such data could easily undergo some structuring with the use of semi-structured medical records focusing on a few symptoms.

摘要

目的

人工智能 (AI) 算法所训练和测试的临床数据为改善传染病 (ID) 的诊断或治疗提供了依据。我们旨在确定 ID 研究的重要数据,以优先考虑 AI 计划中正在进行的工作。

方法

我们在 PubMed 上搜索了 1000 篇高影响力的 ID 期刊文章,从 10 种顶级期刊中选择了最新的 288 篇文章。我们将它们分为结构化或非结构化数据。变量进行了同质化处理,并分为以下几类:流行病学、入院、人口统计学、合并症、临床表现、实验室、微生物学、其他诊断、治疗、结局和其他不可分类变量。

结果

从 288 篇文章中收集了 4488 个个体变量。288 篇文章中有 3670 个(81.8%)变量被归类为结构化数据,818 个(18.2%)为非结构化数据。从结构化数据中,2319 个(63.2%)变量被归类为可直接从电子健康记录中检索,而 1351 个(36.8%)为间接变量。最常见的非结构化数据与临床表现有关,并在文章中重复出现。人口统计学、合并症和微生物学数据构成了变量中最常见的组。

结论

本文确定了结构化变量构成了 ID 领域产生知识的研究中最重要的数据。当医疗中心打算为 ID 启动 AI 计划时,提取这些数据应是优先事项。我们还记录了该领域最重要的非结构化数据是与临床表现相关的数据。这些数据可以通过使用半结构化医疗记录聚焦于少数症状来轻松进行一些结构化处理。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f59/10710675/8a087d6b69d1/revespquimioter-36-592-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f59/10710675/8a087d6b69d1/revespquimioter-36-592-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f59/10710675/8a087d6b69d1/revespquimioter-36-592-g001.jpg

相似文献

1
Identifying the most important data for research in the field of infectious diseases: thinking on the basis of artificial intelligence.识别传染病领域研究中最重要的数据:基于人工智能的思考。
Rev Esp Quimioter. 2023 Dec;36(6):592-596. doi: 10.37201/req/032.2023. Epub 2023 Aug 12.
2
Developing Artificial Intelligence Models for Extracting Oncologic Outcomes from Japanese Electronic Health Records.开发人工智能模型,从日本电子健康记录中提取肿瘤学结局。
Adv Ther. 2023 Mar;40(3):934-950. doi: 10.1007/s12325-022-02397-7. Epub 2022 Dec 22.
3
Artificial intelligence-based mining of electronic health record data to accelerate the digital transformation of the national cardiovascular ecosystem: design protocol of the CardioMining study.基于人工智能的电子健康记录数据挖掘加速国家心血管生态系统的数字化转型:CardioMining 研究的设计方案。
BMJ Open. 2023 Apr 3;13(4):e068698. doi: 10.1136/bmjopen-2022-068698.
4
Artificial Intelligence Can Generate Fraudulent but Authentic-Looking Scientific Medical Articles: Pandora's Box Has Been Opened.人工智能可以生成虚假但看起来真实的科学医学文章:潘多拉的盒子已经被打开。
J Med Internet Res. 2023 May 31;25:e46924. doi: 10.2196/46924.
5
A hybrid solution for extracting structured medical information from unstructured data in medical records via a double-reading/entry system.一种通过双读/录入系统从病历中的非结构化数据提取结构化医学信息的混合解决方案。
BMC Med Inform Decis Mak. 2016 Aug 30;16(1):114. doi: 10.1186/s12911-016-0357-5.
6
Getting More Out of Large Databases and EHRs with Natural Language Processing and Artificial Intelligence: The Future Is Here.借助自然语言处理和人工智能从大型数据库和电子健康记录中获取更多价值:未来已来。
J Bone Joint Surg Am. 2022 Oct 19;104(Suppl 3):51-55. doi: 10.2106/JBJS.22.00567.
7
Automated algorithm for medical data structuring, and segmentation using artificial intelligence within secured environment for dataset creation.用于医学数据结构化的自动化算法,以及在安全环境中使用人工智能进行分割以创建数据集。
Eur J Radiol Open. 2024 Jun 27;13:100582. doi: 10.1016/j.ejro.2024.100582. eCollection 2024 Dec.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
CardioNet: a manually curated database for artificial intelligence-based research on cardiovascular diseases.CardioNet:一个用于心血管疾病人工智能研究的人工整理数据库。
BMC Med Inform Decis Mak. 2021 Jan 28;21(1):29. doi: 10.1186/s12911-021-01392-2.
10
Using Artificial Intelligence With Natural Language Processing to Combine Electronic Health Record's Structured and Free Text Data to Identify Nonvalvular Atrial Fibrillation to Decrease Strokes and Death: Evaluation and Case-Control Study.利用人工智能与自然语言处理相结合,整合电子健康记录的结构化和自由文本数据,以识别非瓣膜性心房颤动,从而降低中风和死亡风险:评估和病例对照研究。
J Med Internet Res. 2021 Nov 9;23(11):e28946. doi: 10.2196/28946.

引用本文的文献

1
Strengths and weaknesses of computerized clinical decision support systems: insights from a digital control center (C3 COVID-19) for early and personalized treatment for COVID-19.计算机化临床决策支持系统的优势与不足:来自一个用于新冠病毒病早期个性化治疗的数字控制中心(C3 COVID-19)的见解
Rev Esp Quimioter. 2025 Feb;38(1):1-7. doi: 10.37201/req/088.2024. Epub 2024 Oct 29.

本文引用的文献

1
Artificial Intelligence: Promise, Pitfalls, and Perspective.人工智能:前景、陷阱与展望。
JAMA. 2020 Jun 23;323(24):2448-2449. doi: 10.1001/jama.2020.8737.
2
The potential for artificial intelligence in healthcare.人工智能在医疗保健领域的潜力。
Future Healthc J. 2019 Jun;6(2):94-98. doi: 10.7861/futurehosp.6-2-94.
3
Artificial intelligence to support clinical decision-making processes.支持临床决策过程的人工智能。
EBioMedicine. 2019 Aug;46:27-29. doi: 10.1016/j.ebiom.2019.07.019. Epub 2019 Jul 11.
4
Artificial intelligence-enabled healthcare delivery.人工智能赋能的医疗保健服务。
J R Soc Med. 2019 Jan;112(1):22-28. doi: 10.1177/0141076818815510. Epub 2018 Dec 3.
5
Looking to tomorrow's healthcare today: a participatory health perspective.展望明日医疗:参与式健康视角
Intern Med J. 2018 Jan;48(1):92-96. doi: 10.1111/imj.13661.