• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

精准医学的文本挖掘:为电子健康记录和生物医学文献构建结构以理解基因与健康。

Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.

作者信息

Simmons Michael, Singhal Ayush, Lu Zhiyong

机构信息

National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), 8600 Rockville Pike, Bldg 38A, 10N1003A, Bethesda, MD, 20894, USA.

出版信息

Adv Exp Med Biol. 2016;939:139-166. doi: 10.1007/978-981-10-1503-8_7.

DOI:10.1007/978-981-10-1503-8_7
PMID:27807747
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5931382/
Abstract

The key question of precision medicine is whether it is possible to find clinically actionable granularity in diagnosing disease and classifying patient risk. The advent of next-generation sequencing and the widespread adoption of electronic health records (EHRs) have provided clinicians and researchers a wealth of data and made possible the precise characterization of individual patient genotypes and phenotypes. Unstructured text-found in biomedical publications and clinical notes-is an important component of genotype and phenotype knowledge. Publications in the biomedical literature provide essential information for interpreting genetic data. Likewise, clinical notes contain the richest source of phenotype information in EHRs. Text mining can render these texts computationally accessible and support information extraction and hypothesis generation. This chapter reviews the mechanics of text mining in precision medicine and discusses several specific use cases, including database curation for personalized cancer medicine, patient outcome prediction from EHR-derived cohorts, and pharmacogenomic research. Taken as a whole, these use cases demonstrate how text mining enables effective utilization of existing knowledge sources and thus promotes increased value for patients and healthcare systems. Text mining is an indispensable tool for translating genotype-phenotype data into effective clinical care that will undoubtedly play an important role in the eventual realization of precision medicine.

摘要

精准医学的关键问题在于,是否有可能在疾病诊断和患者风险分类中找到具有临床可操作性的精细程度。下一代测序技术的出现以及电子健康记录(EHR)的广泛应用,为临床医生和研究人员提供了大量数据,并使得精确描述个体患者的基因型和表型成为可能。在生物医学出版物和临床记录中发现的非结构化文本,是基因型和表型知识的重要组成部分。生物医学文献中的出版物为解读遗传数据提供了重要信息。同样,临床记录包含了电子健康记录中最丰富的表型信息来源。文本挖掘可以使这些文本在计算上易于获取,并支持信息提取和假设生成。本章回顾了精准医学中文本挖掘的机制,并讨论了几个具体的用例,包括个性化癌症医学的数据库管理、从电子健康记录衍生队列中预测患者结局以及药物基因组学研究。总体而言,这些用例展示了文本挖掘如何有效利用现有知识来源,从而为患者和医疗系统提升价值。文本挖掘是将基因型-表型数据转化为有效临床护理的不可或缺的工具,无疑将在精准医学的最终实现中发挥重要作用。

相似文献

1
Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.精准医学的文本挖掘:为电子健康记录和生物医学文献构建结构以理解基因与健康。
Adv Exp Med Biol. 2016;939:139-166. doi: 10.1007/978-981-10-1503-8_7.
2
Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.从生物医学文献中挖掘基因型-表型关系以用于数据库管理和精准医学。
PLoS Comput Biol. 2016 Nov 30;12(11):e1005017. doi: 10.1371/journal.pcbi.1005017. eCollection 2016 Nov.
3
Text mining facilitates database curation - extraction of mutation-disease associations from Bio-medical literature.文本挖掘有助于数据库管理——从生物医学文献中提取突变与疾病的关联。
BMC Bioinformatics. 2015 Jun 6;16:185. doi: 10.1186/s12859-015-0609-x.
4
Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.BioCreative VI 精准医学赛道概述:精准医学中的蛋白质相互作用和突变挖掘。
Database (Oxford). 2019 Jan 1;2019:bay147. doi: 10.1093/database/bay147.
5
Text Mining for Drug Discovery.用于药物发现的文本挖掘
Methods Mol Biol. 2019;1939:231-252. doi: 10.1007/978-1-4939-9089-4_13.
6
Integrating pharmacogenomics into electronic health records with clinical decision support.将药物基因组学与临床决策支持系统整合到电子健康记录中。
Am J Health Syst Pharm. 2016 Dec 1;73(23):1967-1976. doi: 10.2146/ajhp160030.
7
Text-mining clinically relevant cancer biomarkers for curation into the CIViC database.从临床相关癌症生物标志物文本中挖掘信息,将其纳入 CIViC 数据库。
Genome Med. 2019 Dec 3;11(1):78. doi: 10.1186/s13073-019-0686-y.
8
Recent progress in automatically extracting information from the pharmacogenomic literature.从药物基因组学文献中自动提取信息的最新进展。
Pharmacogenomics. 2010 Oct;11(10):1467-89. doi: 10.2217/pgs.10.136.
9
Measuring the Value of a Practical Text Mining Approach to Identify Patients With Housing Issues in the Free-Text Notes in Electronic Health Record: Findings of a Retrospective Cohort Study.衡量实用文本挖掘方法在电子健康记录中的自由文本记录中识别住房问题患者的价值:一项回顾性队列研究的结果。
Front Public Health. 2021 Aug 27;9:697501. doi: 10.3389/fpubh.2021.697501. eCollection 2021.
10
Developing knowledge resources to support precision medicine: principles from the Clinical Pharmacogenetics Implementation Consortium (CPIC).开发知识资源以支持精准医学:来自临床药物基因组学实施联盟(CPIC)的原则
J Am Med Inform Assoc. 2016 Jul;23(4):796-801. doi: 10.1093/jamia/ocw027. Epub 2016 Mar 28.

引用本文的文献

1
Personalized insights into liver disease management: a text mining analysis of online consultation data.肝病管理的个性化见解:在线咨询数据的文本挖掘分析
Front Public Health. 2025 May 9;13:1467117. doi: 10.3389/fpubh.2025.1467117. eCollection 2025.
2
SciLinker: a large-scale text mining framework for mapping associations among biological entities.SciLinker:一个用于映射生物实体之间关联的大规模文本挖掘框架。
Front Artif Intell. 2025 Mar 19;8:1528562. doi: 10.3389/frai.2025.1528562. eCollection 2025.
3
A simplified retriever to improve accuracy of phenotype normalizations by large language models.一种简化的检索器,用于提高大语言模型进行表型标准化的准确性。
Front Digit Health. 2025 Mar 4;7:1495040. doi: 10.3389/fdgth.2025.1495040. eCollection 2025.
4
Exposure to angiotensin-converting enzyme inhibitors that cross the blood-brain barrier and the risk of dementia among people with HIV.接触可穿过血脑屏障的血管紧张素转换酶抑制剂与HIV感染者患痴呆症的风险
AIDS. 2024 Dec 1;38(15):1999-2009. doi: 10.1097/QAD.0000000000004009. Epub 2024 Sep 9.
5
The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII.生物创意 VIII 中生物医学关系提取数据集(BioRED)赛道概述。
Database (Oxford). 2024 Aug 8;2024. doi: 10.1093/database/baae069.
6
Exposure to angiotensin-converting enzyme inhibitors that cross the blood-brain barrier and the risk of dementia among patients with human immunodeficiency virus.接触可穿过血脑屏障的血管紧张素转换酶抑制剂与人类免疫缺陷病毒患者的痴呆风险
medRxiv. 2024 Jan 17:2024.01.16.24301275. doi: 10.1101/2024.01.16.24301275.
7
Clinical Prediction Models for Hospital-Induced Delirium Using Structured and Unstructured Electronic Health Record Data: Protocol for a Development and Validation Study.使用结构化和非结构化电子健康记录数据的医院诱发谵妄临床预测模型:一项开发与验证研究的方案
JMIR Res Protoc. 2023 Nov 9;12:e48521. doi: 10.2196/48521.
8
Individualisation, personalisation and person-centredness in mental healthcare: a scoping review of concepts and linguistic network visualisation.精神卫生保健中的个体化、个性化和以患者为中心:概念和语言网络可视化的范围综述。
BMJ Ment Health. 2023 Oct;26(1). doi: 10.1136/bmjment-2023-300831.
9
Artificial Intelligence and Precision Medicine: A New Frontier for the Treatment of Brain Tumors.人工智能与精准医学:脑肿瘤治疗的新前沿
Life (Basel). 2022 Dec 22;13(1):24. doi: 10.3390/life13010024.
10
Exploring the Association of Cancer and Depression in Electronic Health Records: Combining Encoded Diagnosis and Mining Free-Text Clinical Notes.探索电子健康记录中癌症与抑郁症的关联:结合编码诊断与挖掘自由文本临床笔记
JMIR Cancer. 2022 Jul 11;8(3):e39003. doi: 10.2196/39003.

本文引用的文献

1
Text mining for precision medicine: automating disease-mutation relationship extraction from biomedical literature.精准医学的文本挖掘:从生物医学文献中自动提取疾病-突变关系
J Am Med Inform Assoc. 2016 Jul;23(4):766-72. doi: 10.1093/jamia/ocw041. Epub 2016 Apr 27.
2
Precision oncology: origins, optimism, and potential.精准肿瘤学:起源、乐观主义和潜力。
Lancet Oncol. 2016 Feb;17(2):e81-e86. doi: 10.1016/S1470-2045(15)00620-8.
3
The Framingham Heart Study--67 years of discovery in metabolic disease.弗雷明汉心脏研究——代谢性疾病 67 年的探索。
Nat Rev Endocrinol. 2016 Mar;12(3):177-83. doi: 10.1038/nrendo.2015.226. Epub 2016 Jan 18.
4
Human genotype-phenotype databases: aims, challenges and opportunities.人类基因型-表型数据库:目标、挑战与机遇。
Nat Rev Genet. 2015 Dec;16(12):702-15. doi: 10.1038/nrg3932. Epub 2015 Nov 10.
5
GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains.GNormPlus:一种用于标记基因、基因家族和蛋白质结构域的综合方法。
Biomed Res Int. 2015;2015:918710. doi: 10.1155/2015/918710. Epub 2015 Aug 25.
6
Clinical Research Methodology 2: Observational Clinical Research.临床研究方法2:观察性临床研究。
Anesth Analg. 2015 Oct;121(4):1043-1051. doi: 10.1213/ANE.0000000000000861.
7
Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance.结合电子健康记录中的计费代码、临床记录和药物信息可提供卓越的表型分析性能。
J Am Med Inform Assoc. 2016 Apr;23(e1):e20-7. doi: 10.1093/jamia/ocv130. Epub 2015 Sep 2.
8
Impact of Precision Medicine in Diverse Cancers: A Meta-Analysis of Phase II Clinical Trials.精准医学在多种癌症中的影响:一项II期临床试验的荟萃分析
J Clin Oncol. 2015 Nov 10;33(32):3817-25. doi: 10.1200/JCO.2015.61.5997. Epub 2015 Aug 24.
9
Learning the Structure of Biomedical Relationships from Unstructured Text.从非结构化文本中学习生物医学关系的结构
PLoS Comput Biol. 2015 Jul 28;11(7):e1004216. doi: 10.1371/journal.pcbi.1004216. eCollection 2015 Jul.
10
Phenotype-Driven Plasma Biobanking Strategies and Methods.表型驱动的血浆生物样本库策略与方法
J Pers Med. 2015 May 14;5(2):140-52. doi: 10.3390/jpm5020140.