• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
A computable phenotype for patients with SARS-CoV2 testing that occurred outside the hospital.针对在医院外进行严重急性呼吸综合征冠状病毒2(SARS-CoV-2)检测的患者的可计算表型。
medRxiv. 2023 Jan 19:2023.01.19.23284738. doi: 10.1101/2023.01.19.23284738.
2
A computable case definition for patients with SARS-CoV2 testing that occurred outside the hospital.针对在医院外进行严重急性呼吸综合征冠状病毒2(SARS-CoV-2)检测的患者的可计算病例定义。
JAMIA Open. 2023 Jul 5;6(3):ooad047. doi: 10.1093/jamiaopen/ooad047. eCollection 2023 Oct.
3
Accuracy of Computable Phenotyping Approaches for SARS-CoV-2 Infection and COVID-19 Hospitalizations from the Electronic Health Record.基于电子健康记录的新冠病毒感染和新冠住院可计算表型分析方法的准确性
medRxiv. 2021 May 13:2021.03.16.21253770. doi: 10.1101/2021.03.16.21253770.
4
A multicenter evaluation of computable phenotyping approaches for SARS-CoV-2 infection and COVID-19 hospitalizations.一项针对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染和新冠肺炎住院病例的可计算表型分析方法的多中心评估。
NPJ Digit Med. 2022 Mar 8;5(1):27. doi: 10.1038/s41746-022-00570-4.
5
Controlled, double-blind, randomized trial to assess the efficacy and safety of hydroxychloroquine chemoprophylaxis in SARS CoV2 infection in healthcare personnel in the hospital setting: A structured summary of a study protocol for a randomised controlled trial.在医院环境中评估羟氯喹化学预防 SARS-CoV2 感染在医护人员中的疗效和安全性的对照、双盲、随机试验:一项随机对照试验研究方案的结构化总结。
Trials. 2020 Jun 3;21(1):472. doi: 10.1186/s13063-020-04400-4.
6
LATTE: A knowledge-based method to normalize various expressions of laboratory test results in free text of Chinese electronic health records.LATTE:一种基于知识的方法,用于规范化中文电子健康记录自由文本中实验室检查结果的各种表达方式。
J Biomed Inform. 2020 Feb;102:103372. doi: 10.1016/j.jbi.2019.103372. Epub 2019 Dec 31.
7
Effectiveness and cost-effectiveness of four different strategies for SARS-CoV-2 surveillance in the general population (CoV-Surv Study): a structured summary of a study protocol for a cluster-randomised, two-factorial controlled trial.在普通人群中进行 SARS-CoV-2 监测的四种不同策略的有效性和成本效益(CoV-Surv 研究):一项关于集群随机、双因素对照试验的研究方案的结构化总结。
Trials. 2021 Jan 8;22(1):39. doi: 10.1186/s13063-020-04982-z.
8
Should rapid antigen tests be first-line for COVID-19 testing? Results of a prospective urban cohort study.快速抗原检测是否应作为 COVID-19 检测的一线手段?一项前瞻性城市队列研究的结果。
BMC Infect Dis. 2023 Apr 18;23(1):243. doi: 10.1186/s12879-023-08171-6.
9
Inaccurate recording of routinely collected data items influences identification of COVID-19 patients.常规收集的数据项记录不准确会影响 COVID-19 患者的识别。
Int J Med Inform. 2022 Sep;165:104808. doi: 10.1016/j.ijmedinf.2022.104808. Epub 2022 Jun 10.
10
Deep Phenotyping of Chinese Electronic Health Records by Recognizing Linguistic Patterns of Phenotypic Narratives With a Sequence Motif Discovery Tool: Algorithm Development and Validation.利用序列基序发现工具识别表型叙述的语言模式对中国电子健康记录进行深度表型分析:算法开发与验证
J Med Internet Res. 2022 Jun 3;24(6):e37213. doi: 10.2196/37213.

针对在医院外进行严重急性呼吸综合征冠状病毒2(SARS-CoV-2)检测的患者的可计算表型。

A computable phenotype for patients with SARS-CoV2 testing that occurred outside the hospital.

作者信息

Wang Lijing, Zipursky Amy, Geva Alon, McMurry Andrew J, Mandl Kenneth D, Miller Timothy A

出版信息

medRxiv. 2023 Jan 19:2023.01.19.23284738. doi: 10.1101/2023.01.19.23284738.

DOI:10.1101/2023.01.19.23284738
PMID:36711461
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9882620/
Abstract

OBJECTIVE

To identify a cohort of COVID-19 cases, including when evidence of virus positivity was only mentioned in the clinical text, not in structured laboratory data in the electronic health record (EHR).

MATERIALS AND METHODS

Statistical classifiers were trained on feature representations derived from unstructured text in patient electronic health records (EHRs). We used a proxy dataset of patients COVID-19 polymerase chain reaction (PCR) tests for training. We selected a model based on performance on our proxy dataset and applied it to instances without COVID-19 PCR tests. A physician reviewed a sample of these instances to validate the classifier.

RESULTS

On the test split of the proxy dataset, our best classifier obtained 0.56 F1, 0.6 precision, and 0.52 recall scores for SARS-CoV2 positive cases. In an expert validation, the classifier correctly identified 90.8% (79/87) as COVID-19 positive and 97.8% (91/93) as not SARS-CoV2 positive. The classifier identified an additional 960 positive cases that did not have SARS-CoV2 lab tests in hospital, and only 177 of those cases had the ICD-10 code for COVID-19.

DISCUSSION

Proxy dataset performance may be worse because these instances sometimes include discussion of pending lab tests. The most predictive features are meaningful and interpretable. The type of external test that was performed is rarely mentioned.

CONCLUSION

COVID-19 cases that had testing done outside of the hospital can be reliably detected from the text in EHRs. Training on a proxy dataset was a suitable method for developing a highly performant classifier without labor intensive labeling efforts.

摘要

目的

确定一组新冠肺炎病例,包括那些病毒阳性证据仅在临床文本中提及,而不在电子健康记录(EHR)的结构化实验室数据中的病例。

材料与方法

统计分类器基于患者电子健康记录(EHR)中非结构化文本的特征表示进行训练。我们使用患者新冠肺炎聚合酶链反应(PCR)检测的代理数据集进行训练。我们根据代理数据集上的性能选择了一个模型,并将其应用于没有新冠肺炎PCR检测的实例。一名医生对这些实例的样本进行了审查,以验证分类器。

结果

在代理数据集的测试分割中,我们最好的分类器对严重急性呼吸综合征冠状病毒2(SARS-CoV2)阳性病例的F1得分为0.56,精确率为0.6,召回率为0.52。在专家验证中,分类器正确地将90.8%(79/87)识别为新冠肺炎阳性,将97.8%(91/93)识别为非SARS-CoV2阳性。该分类器识别出另外960例在医院没有进行SARS-CoV2实验室检测的阳性病例,其中只有177例具有新冠肺炎的国际疾病分类第十版(ICD-10)编码。

讨论

代理数据集的性能可能较差,因为这些实例有时包括对待处理实验室检测的讨论。最具预测性的特征是有意义且可解释的。很少提及所进行的外部检测类型。

结论

可以从EHR文本中可靠地检测出在医院外进行检测的新冠肺炎病例。在代理数据集上进行训练是一种合适的方法,无需大量人工标注工作就能开发出高性能的分类器。