• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EMR2vec:弥合患者数据与临床试验之间的差距。

EMR2vec: Bridging the gap between patient data and clinical trial.

作者信息

Dhayne Houssein, Kilany Rima, Haque Rafiqul, Taher Yehia

机构信息

Saint Joseph University, Mar Roukos, Beirut, Lebanon.

Intelligencia, 66 Avenue des Champs Elysees, Paris, France.

出版信息

Comput Ind Eng. 2021 Jun;156:107236. doi: 10.1016/j.cie.2021.107236. Epub 2021 Mar 15.

DOI:10.1016/j.cie.2021.107236
PMID:33746344
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7959675/
Abstract

The human suffering from diseases caused by life-threatening viruses such as SARS, Ebola, and COVID-19 motivated many of us to study and discover the best means to harness the potential of data integration to assist clinical researchers to curb these viruses. Integrating patients data with clinical trials data is enormously promising as it provides a comprehensive knowledge base that accelerates the clinical research response-ability to tackle emerging infectious disease outbreaks. This work introduces EMR2vec, a platform that customises advanced NLP, machine learning and semantic web techniques to link potential patients to suitable clinical trials. Linking these two different but complementary datasets allows clinicians and researchers to compare patients to clinical research opportunities or to automatically select patients for personalized clinical care. The platform derives a 'bag of medical terms' (BoMT) from eligibility criteria by normalizing extracted entities through SNOMED-CT ontology. With the usage of BoMT, an ontological reasoning method is proposed to represent EMR and clinical trials in a vector space model. The platform presents a matching process that reduces vector dimensionality using a neural network, then applies orthogonality projection to measure the similarity between vectors. Finally, the proposed EMR2vec platform is evaluated with an extendable prototype based on Big data tools.

摘要

感染严重急性呼吸综合征(SARS)、埃博拉病毒和新型冠状病毒肺炎(COVID-19)等危及生命的病毒所导致的人类苦难,促使我们许多人去研究并发现利用数据整合潜力的最佳方法,以协助临床研究人员控制这些病毒。将患者数据与临床试验数据整合极具前景,因为它提供了一个全面的知识库,可加快临床研究应对新出现的传染病爆发的能力。这项工作引入了EMR2vec平台,该平台定制了先进的自然语言处理、机器学习和语义网技术,以将潜在患者与合适的临床试验相匹配。将这两个不同但互补的数据集相链接,可使临床医生和研究人员将患者与临床研究机会进行比较,或自动为个性化临床护理选择患者。该平台通过SNOMED-CT本体对提取的实体进行标准化,从入选标准中得出“医学术语包”(BoMT)。利用BoMT,提出了一种本体推理方法,以在向量空间模型中表示电子病历(EMR)和临床试验。该平台展示了一个匹配过程,该过程使用神经网络降低向量维度,然后应用正交投影来测量向量之间的相似度。最后,基于大数据工具,使用一个可扩展的原型对所提出的EMR2vec平台进行了评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cdf361ded76b/gr12_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/154e0202b4f0/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cbff41da61ce/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/750a81a245eb/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cc580a39cf4f/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/1e7ec8710d60/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/00790879018c/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/115f006d01dd/gr7_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/b7577e0d9c27/gr8_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/c3a7fb2b2c0a/gr9_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/d51948b0ab91/gr10_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/7b0984ed0407/gr11_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cdf361ded76b/gr12_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/154e0202b4f0/gr1_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cbff41da61ce/gr2_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/750a81a245eb/gr3_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cc580a39cf4f/gr4_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/1e7ec8710d60/gr5_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/00790879018c/gr6_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/115f006d01dd/gr7_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/b7577e0d9c27/gr8_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/c3a7fb2b2c0a/gr9_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/d51948b0ab91/gr10_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/7b0984ed0407/gr11_lrg.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/956c/7959675/cdf361ded76b/gr12_lrg.jpg

相似文献

1
EMR2vec: Bridging the gap between patient data and clinical trial.EMR2vec:弥合患者数据与临床试验之间的差距。
Comput Ind Eng. 2021 Jun;156:107236. doi: 10.1016/j.cie.2021.107236. Epub 2021 Mar 15.
2
Integrating semantic dimension into openEHR archetypes for the management of cerebral palsy electronic medical records.将语义维度整合到openEHR原型中以管理脑瘫电子病历。
J Biomed Inform. 2016 Oct;63:307-324. doi: 10.1016/j.jbi.2016.08.018. Epub 2016 Aug 24.
3
Automatic SNOMED CT coding of Chinese clinical terms via attention-based semantic matching.通过基于注意力的语义匹配对中文临床术语进行自动SNOMED CT编码。
Int J Med Inform. 2022 Mar;159:104676. doi: 10.1016/j.ijmedinf.2021.104676. Epub 2021 Dec 28.
4
Cognitive IT-systems for big data analysis in medicine.用于医学大数据分析的认知信息技术系统。
Int J Risk Saf Med. 2015;27 Suppl 1:S108-9. doi: 10.3233/JRS-150711.
5
Automatic trial eligibility surveillance based on unstructured clinical data.基于非结构化临床数据的自动试验资格监测。
Int J Med Inform. 2019 Sep;129:13-19. doi: 10.1016/j.ijmedinf.2019.05.018. Epub 2019 May 23.
6
PREDOSE: a semantic web platform for drug abuse epidemiology using social media.前置:一个利用社交媒体进行药物滥用流行病学研究的语义网平台。
J Biomed Inform. 2013 Dec;46(6):985-97. doi: 10.1016/j.jbi.2013.07.007. Epub 2013 Jul 25.
7
Ontological and Non-Ontological Resources for Associating Medical Dictionary for Regulatory Activities Terms to SNOMED Clinical Terms With Semantic Properties.用于将监管活动医学词典术语与具有语义属性的SNOMED临床术语相关联的本体论和非本体论资源。
Front Pharmacol. 2019 Sep 10;10:975. doi: 10.3389/fphar.2019.00975. eCollection 2019.
8
The Translational Medicine Ontology and Knowledge Base: driving personalized medicine by bridging the gap between bench and bedside.转化医学本体与知识库:通过弥合实验室与临床之间的差距推动个性化医疗。
J Biomed Semantics. 2011 May 17;2 Suppl 2(Suppl 2):S1. doi: 10.1186/2041-1480-2-S2-S1.
9
An ontology network for Diabetes Mellitus in Mexico.墨西哥糖尿病本体论网络。
J Biomed Semantics. 2021 Oct 9;12(1):19. doi: 10.1186/s13326-021-00252-2.
10
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

引用本文的文献

1
Enhancing Patient-Trial Matching With Large Language Models: A Scoping Review of Emerging Applications and Approaches.利用大语言模型加强患者与试验匹配:新兴应用与方法的范围综述
JCO Clin Cancer Inform. 2025 Jun;9:e2500071. doi: 10.1200/CCI-25-00071. Epub 2025 Jun 9.
2
Development and utility of a clinical research informatics application for participant recruitment and workflow management for a return of results pilot trial in familial hypercholesterolemia in the Million Veteran Program.百万退伍军人计划中家族性高胆固醇血症结果返还试点试验的临床研究信息学应用程序的开发及其在参与者招募和工作流程管理中的效用。
JAMIA Open. 2024 Mar 8;7(1):ooae020. doi: 10.1093/jamiaopen/ooae020. eCollection 2024 Apr.
3

本文引用的文献

1
Systematic review of the registered clinical trials for coronavirus disease 2019 (COVID-19).关于 2019 年冠状病毒病(COVID-19)的注册临床试验的系统评价。
J Transl Med. 2020 Jul 6;18(1):274. doi: 10.1186/s12967-020-02442-5.
2
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
3
Using electronic health records for clinical trials: Where do we stand and where can we go?
LeafAI: query generator for clinical cohort discovery rivaling a human programmer.
LeafAI:用于临床队列发现的查询生成器,可与人类程序员相媲美。
J Am Med Inform Assoc. 2023 Nov 17;30(12):1954-1964. doi: 10.1093/jamia/ocad149.
4
A review of research on eligibility criteria for clinical trials.临床试验入选标准研究述评。
Clin Exp Med. 2023 Oct;23(6):1867-1879. doi: 10.1007/s10238-022-00975-1. Epub 2023 Jan 5.
5
Optimizing deep neural networks to predict the effect of social distancing on COVID-19 spread.优化深度神经网络以预测社交距离对新冠病毒传播的影响。
Comput Ind Eng. 2022 Apr;166:107970. doi: 10.1016/j.cie.2022.107970. Epub 2022 Jan 29.
6
Clinical Research Informatics.临床研究信息学。
Yearb Med Inform. 2022 Aug;31(1):161-164. doi: 10.1055/s-0042-1742530. Epub 2022 Dec 4.
利用电子健康记录进行临床试验:我们目前的状况如何,又能走向何方?
CMAJ. 2019 Feb 4;191(5):E128-E133. doi: 10.1503/cmaj.180841.
4
Re-Evaluating Eligibility Criteria for Oncology Clinical Trials: Analysis of Investigational New Drug Applications in 2015.重新评估肿瘤学临床试验的资格标准:2015年研究性新药申请分析
J Clin Oncol. 2017 Nov 20;35(33):3745-3752. doi: 10.1200/JCO.2017.73.4186. Epub 2017 Oct 2.
5
EliIE: An open-source information extraction system for clinical trial eligibility criteria.EliIE:一个用于临床试验资格标准的开源信息提取系统。
J Am Med Inform Assoc. 2017 Nov 1;24(6):1062-1071. doi: 10.1093/jamia/ocx019.
6
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
7
Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records.深度患者:一种从电子健康记录中预测患者未来的无监督表示。
Sci Rep. 2016 May 17;6:26094. doi: 10.1038/srep26094.
8
The value of structured data elements from electronic health records for identifying subjects for primary care clinical trials.电子健康记录中结构化数据元素在识别初级保健临床试验受试者方面的价值。
BMC Med Inform Decis Mak. 2016 Jan 11;16:1. doi: 10.1186/s12911-016-0239-x.
9
Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials.使用电子健康记录的基于案例的推理能够有效地识别出符合临床试验条件的患者。
J Am Med Inform Assoc. 2015 Apr;22(e1):e141-50. doi: 10.1093/jamia/ocu050. Epub 2015 Mar 13.
10
Using electronic health records for clinical research: the case of the EHR4CR project.利用电子健康记录进行临床研究:以EHR4CR项目为例。
J Biomed Inform. 2015 Feb;53:162-73. doi: 10.1016/j.jbi.2014.10.006. Epub 2014 Oct 18.