• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子健康记录表型在精准医疗中的应用:单机构乳腺癌治疗的观点和注意事项。

Electronic Health Record Phenotypes for Precision Medicine: Perspectives and Caveats From Treatment of Breast Cancer at a Single Institution.

机构信息

Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Center for Pharmacoepidemiology Research and Training, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

出版信息

Clin Transl Sci. 2018 Jan;11(1):85-92. doi: 10.1111/cts.12514.

DOI:10.1111/cts.12514
PMID:29084368
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5759745/
Abstract

Precision medicine is at the forefront of biomedical research. Cancer registries provide rich perspectives and electronic health records (EHRs) are commonly utilized to gather additional clinical data elements needed for translational research. However, manual annotation is resource-intense and not readily scalable. Informatics-based phenotyping presents an ideal solution, but perspectives obtained can be impacted by both data source and algorithm selection. We derived breast cancer (BC) receptor status phenotypes from structured and unstructured EHR data using rule-based algorithms, including natural language processing (NLP). Overall, the use of NLP increased BC receptor status coverage by 39.2% from 69.1% with structured medication information alone. Using all available EHR data, estrogen receptor-positive BC cases were ascertained with high precision (P = 0.976) and recall (R = 0.987) compared with gold standard chart-reviewed patients. However, status negation (R = 0.591) decreased 40.2% when relying on structured medications alone. Using multiple EHR data types (and thorough understanding of the perspectives offered) are necessary to derive robust EHR-based precision medicine phenotypes.

摘要

精准医学处于生物医学研究的前沿。癌症登记处提供了丰富的视角,电子健康记录(EHR)通常被用于收集转化研究所需的其他临床数据元素。然而,手动注释需要大量资源,且难以扩展。基于信息学的表型分析提供了一个理想的解决方案,但所获得的视角可能会受到数据源和算法选择的影响。我们使用基于规则的算法(包括自然语言处理(NLP))从结构化和非结构化的 EHR 数据中推导出乳腺癌(BC)受体状态表型。总的来说,与仅使用结构化药物信息相比,NLP 的使用将 BC 受体状态的覆盖率从 69.1%提高了 39.2%。使用所有可用的 EHR 数据,与金标准的图表审查患者相比,雌激素受体阳性的 BC 病例具有很高的精确性(P=0.976)和召回率(R=0.987)。然而,仅依靠结构化药物时,状态否定率(R=0.591)降低了 40.2%。为了推导出基于 EHR 的精准医学表型,需要使用多种 EHR 数据类型(并充分了解所提供的视角)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a9d/5759745/554788563065/CTS-11-85-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a9d/5759745/554788563065/CTS-11-85-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4a9d/5759745/554788563065/CTS-11-85-g001.jpg

相似文献

1
Electronic Health Record Phenotypes for Precision Medicine: Perspectives and Caveats From Treatment of Breast Cancer at a Single Institution.电子健康记录表型在精准医疗中的应用:单机构乳腺癌治疗的观点和注意事项。
Clin Transl Sci. 2018 Jan;11(1):85-92. doi: 10.1111/cts.12514.
2
Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择
Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.
3
Identifying Information Gaps in Electronic Health Records by Using Natural Language Processing: Gynecologic Surgery History Identification.利用自然语言处理识别电子健康记录中的信息空白:妇科手术史识别。
J Med Internet Res. 2022 Jan 28;24(1):e29015. doi: 10.2196/29015.
4
Scalable relevance ranking algorithm via semantic similarity assessment improves efficiency of medical chart review.通过语义相似性评估的可扩展相关性排序算法提高了医学图表审查的效率。
J Biomed Inform. 2022 Aug;132:104109. doi: 10.1016/j.jbi.2022.104109. Epub 2022 Jun 1.
5
Development and Evaluation of a Natural Language Processing Annotation Tool to Facilitate Phenotyping of Cognitive Status in Electronic Health Records: Diagnostic Study.开发和评估一种自然语言处理标注工具以促进电子健康记录中认知状态的表型分析:诊断研究。
J Med Internet Res. 2022 Aug 30;24(8):e40384. doi: 10.2196/40384.
6
Relevant Word Order Vectorization for Improved Natural Language Processing in Electronic Health Records.相关词序向量化提高电子健康记录中的自然语言处理能力。
Sci Rep. 2019 Jun 25;9(1):9253. doi: 10.1038/s41598-019-45705-y.
7
Truly personalized therapy - an end to the era of one size fits all.真正的个性化治疗——“一刀切”时代的终结。
Nat Rev Clin Oncol. 2019 Feb;16(2):77-78. doi: 10.1038/s41571-018-0165-1.
8
Natural language processing improves identification of colorectal cancer testing in the electronic medical record.自然语言处理提高电子病历中结直肠癌检测的识别能力。
Med Decis Making. 2012 Jan-Feb;32(1):188-97. doi: 10.1177/0272989X11400418. Epub 2011 Mar 10.
9
Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.精准医学的文本挖掘:为电子健康记录和生物医学文献构建结构以理解基因与健康。
Adv Exp Med Biol. 2016;939:139-166. doi: 10.1007/978-981-10-1503-8_7.
10
The Value of Unstructured Electronic Health Record Data in Geriatric Syndrome Case Identification.非结构化电子健康记录数据在老年综合征病例识别中的价值。
J Am Geriatr Soc. 2018 Aug;66(8):1499-1507. doi: 10.1111/jgs.15411. Epub 2018 Jul 4.

引用本文的文献

1
CORAL: Expert-Curated Oncology Reports to Advance Language Model Inference.CORAL:经专家策划的肿瘤学报告,以推进语言模型推理。
NEJM AI. 2024 Apr;1(4). doi: 10.1056/aidbp2300110. Epub 2024 Mar 13.
2
A genome-wide Association study of the Count of Codeine prescriptions.一项关于可待因处方数量的全基因组关联研究。
Sci Rep. 2024 Oct 1;14(1):22780. doi: 10.1038/s41598-024-73925-4.
3
Natural Language Processing Applications for Computer-Aided Diagnosis in Oncology.用于肿瘤学计算机辅助诊断的自然语言处理应用

本文引用的文献

1
Triple-negative breast cancer: the importance of molecular and histologic subtyping, and recognition of low-grade variants.三阴性乳腺癌:分子和组织学亚型分类的重要性以及低级别变异型的识别
NPJ Breast Cancer. 2016 Nov 16;2:16036. doi: 10.1038/npjbcancer.2016.36. eCollection 2016.
2
Using machine learning to parse breast pathology reports.使用机器学习解析乳腺病理报告。
Breast Cancer Res Treat. 2017 Jan;161(2):203-211. doi: 10.1007/s10549-016-4035-1. Epub 2016 Nov 8.
3
An informatics research agenda to support precision medicine: seven key areas.
Diagnostics (Basel). 2023 Jan 12;13(2):286. doi: 10.3390/diagnostics13020286.
4
Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing.通过癌症自然语言处理的范围综述评估癌症研究和患者护理的电子健康记录。
JCO Clin Cancer Inform. 2022 Jul;6:e2200006. doi: 10.1200/CCI.22.00006.
5
Effectiveness of Artificial Intelligence for Personalized Medicine in Neoplasms: A Systematic Review.人工智能在肿瘤个体化医学中的有效性:系统评价。
Biomed Res Int. 2022 Apr 7;2022:7842566. doi: 10.1155/2022/7842566. eCollection 2022.
6
CancerBERT: a cancer domain-specific language model for extracting breast cancer phenotypes from electronic health records.CancerBERT:一种癌症领域特定的语言模型,用于从电子健康记录中提取乳腺癌表型。
J Am Med Inform Assoc. 2022 Jun 14;29(7):1208-1216. doi: 10.1093/jamia/ocac040.
7
Automated Phenotyping Tool for Identifying Developmental Language Disorder Cases in Health Systems Data (APT-DLD): A New Research Algorithm for Deployment in Large-Scale Electronic Health Record Systems.用于在卫生系统数据中识别发育性语言障碍病例的自动化表型分析工具(APT-DLD):一种用于在大规模电子健康记录系统中部署的新研究算法。
J Speech Lang Hear Res. 2020 Sep 15;63(9):3019-3035. doi: 10.1044/2020_JSLHR-19-00397. Epub 2020 Aug 11.
8
Common data elements of breast cancer for research databases: A systematic review.研究数据库中乳腺癌的通用数据元素:一项系统综述。
J Family Med Prim Care. 2020 Mar 26;9(3):1296-1301. doi: 10.4103/jfmpc.jfmpc_931_19. eCollection 2020 Mar.
9
Annotating and detecting phenotypic information for chronic obstructive pulmonary disease.标注与检测慢性阻塞性肺疾病的表型信息。
JAMIA Open. 2019 Apr 26;2(2):261-271. doi: 10.1093/jamiaopen/ooz009. eCollection 2019 Jul.
10
A frame semantic overview of NLP-based information extraction for cancer-related EHR notes.基于框架语义的自然语言处理信息抽取在癌症相关电子病历中的应用综述。
J Biomed Inform. 2019 Dec;100:103301. doi: 10.1016/j.jbi.2019.103301. Epub 2019 Oct 4.
支持精准医学的信息学研究议程:七个关键领域。
J Am Med Inform Assoc. 2016 Jul;23(4):791-5. doi: 10.1093/jamia/ocv213. Epub 2016 Apr 23.
4
Toward precision medicine of breast cancer.迈向乳腺癌的精准医学。
Theor Biol Med Model. 2016 Feb 29;13:7. doi: 10.1186/s12976-016-0035-4.
5
Use of Biomarkers to Guide Decisions on Adjuvant Systemic Therapy for Women With Early-Stage Invasive Breast Cancer: American Society of Clinical Oncology Clinical Practice Guideline.利用生物标志物指导早期浸润性乳腺癌女性辅助性全身治疗决策:美国临床肿瘤学会临床实践指南
J Clin Oncol. 2016 Apr 1;34(10):1134-50. doi: 10.1200/JCO.2015.65.2289. Epub 2016 Feb 8.
6
Clinical trial designs incorporating predictive biomarkers.纳入预测性生物标志物的临床试验设计。
Cancer Treat Rev. 2016 Feb;43:74-82. doi: 10.1016/j.ctrv.2015.12.008. Epub 2016 Jan 5.
7
Extracting research-quality phenotypes from electronic health records to support precision medicine.从电子健康记录中提取研究质量的表型,以支持精准医学。
Genome Med. 2015 Apr 30;7(1):41. doi: 10.1186/s13073-015-0166-y. eCollection 2015.
8
HemOnc.org: A Collaborative Online Knowledge Platform for Oncology Professionals.HemOnc.org:肿瘤学专业人员的协作在线知识平台。
J Oncol Pract. 2015 May;11(3):e336-50. doi: 10.1200/JOP.2014.001511. Epub 2015 Mar 3.
9
Methods of integrating data to uncover genotype-phenotype interactions.整合数据以揭示基因型-表型相互作用的方法。
Nat Rev Genet. 2015 Feb;16(2):85-97. doi: 10.1038/nrg3868. Epub 2015 Jan 13.
10
US incidence of breast cancer subtypes defined by joint hormone receptor and HER2 status.美国联合激素受体和 HER2 状态定义的乳腺癌亚型发病率。
J Natl Cancer Inst. 2014 Apr 28;106(5):dju055. doi: 10.1093/jnci/dju055.