• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对研究不足的医学概念领域进行自动编码:将身体活动报告与《国际功能、残疾和健康分类》相联系。

Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.

作者信息

Newman-Griffis Denis, Fosler-Lussier Eric

机构信息

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, Pennsylvania, USA.

Epidemiology & Biostatistics Section, Rehabilitation Medicine Department, National Institutes of Health Clinical Center, Bethesda, Maryland, USA.

出版信息

Front Digit Health. 2021 Mar;3. doi: 10.3389/fdgth.2021.620828. Epub 2021 Mar 10.

DOI:10.3389/fdgth.2021.620828
PMID:33791684
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8009547/
Abstract

Linking clinical narratives to standardized vocabularies and coding systems is a key component of unlocking the information in medical text for analysis. However, many domains of medical concepts, such as functional outcomes and social determinants of health, lack well-developed terminologies that can support effective coding of medical text. We present a framework for developing natural language processing (NLP) technologies for automated coding of medical information in under-studied domains, and demonstrate its applicability through a case study on physical mobility function. Mobility function is a component of many health measures, from post-acute care and surgical outcomes to chronic frailty and disability, and is represented as one domain of human activity in the International Classification of Functioning, Disability, and Health (ICF). However, mobility and other types of functional activity remain under-studied in the medical informatics literature, and neither the ICF nor commonly-used medical terminologies capture functional status terminology in practice. We investigated two data-driven paradigms, classification and candidate selection, to link narrative observations of mobility status to standardized ICF codes, using a dataset of clinical narratives from physical therapy encounters. Recent advances in language modeling and word embedding were used as features for established machine learning models and a novel deep learning approach, achieving a macro-averaged F-1 score of 84% on linking mobility activity reports to ICF codes. Both classification and candidate selection approaches present distinct strengths for automated coding in under-studied domains, and we highlight that the combination of (i) a small annotated data set; (ii) expert definitions of codes of interest; and (iii) a representative text corpus is sufficient to produce high-performing automated coding systems. This research has implications for continued development of language technologies to analyze functional status information, and the ongoing growth of NLP tools for a variety of specialized applications in clinical care and research.

摘要

将临床叙述与标准化词汇表和编码系统相链接,是解锁医学文本信息以进行分析的关键组成部分。然而,许多医学概念领域,如功能结局和健康的社会决定因素,缺乏能够支持有效编码医学文本的完善术语。我们提出了一个用于开发自然语言处理(NLP)技术的框架,以对研究较少领域的医学信息进行自动编码,并通过一项关于身体活动功能的案例研究来证明其适用性。活动功能是许多健康指标的一个组成部分,从急性后期护理和手术结局到慢性衰弱和残疾,并且在《国际功能、残疾和健康分类》(ICF)中被表示为人类活动的一个领域。然而,活动及其他类型的功能活动在医学信息学文献中仍研究不足,而且无论是ICF还是常用的医学术语在实践中都未涵盖功能状态术语。我们研究了两种数据驱动范式,即分类和候选选择,以将活动状态的叙述性观察与标准化的ICF编码相链接,使用了来自物理治疗会诊的临床叙述数据集。语言建模和词嵌入的最新进展被用作既定机器学习模型和一种新颖深度学习方法的特征,在将活动报告与ICF编码相链接方面实现了84%的宏平均F1分数。分类和候选选择方法在研究较少的领域进行自动编码时都具有明显优势,并且我们强调(i)一个小的带注释数据集;(ii)感兴趣编码的专家定义;以及(iii)一个有代表性的文本语料库的组合足以产生高性能的自动编码系统。这项研究对于持续开发用于分析功能状态信息的语言技术以及NLP工具在临床护理和研究中各种专门应用的持续增长具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/de89522f9c07/fdgth-03-620828-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/73f0a822fe7a/fdgth-03-620828-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/10772f799119/fdgth-03-620828-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/6aca9ca0d7e5/fdgth-03-620828-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/bcc428403034/fdgth-03-620828-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/0d9c2fe64046/fdgth-03-620828-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/bfcd584de8dd/fdgth-03-620828-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/e3a0025767ee/fdgth-03-620828-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/e4a70e1a0983/fdgth-03-620828-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/de89522f9c07/fdgth-03-620828-g0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/73f0a822fe7a/fdgth-03-620828-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/10772f799119/fdgth-03-620828-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/6aca9ca0d7e5/fdgth-03-620828-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/bcc428403034/fdgth-03-620828-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/0d9c2fe64046/fdgth-03-620828-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/bfcd584de8dd/fdgth-03-620828-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/e3a0025767ee/fdgth-03-620828-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/e4a70e1a0983/fdgth-03-620828-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e6f4/8521959/de89522f9c07/fdgth-03-620828-g0009.jpg

相似文献

1
Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.对研究不足的医学概念领域进行自动编码:将身体活动报告与《国际功能、残疾和健康分类》相联系。
Front Digit Health. 2021 Mar;3. doi: 10.3389/fdgth.2021.620828. Epub 2021 Mar 10.
2
Linking Free Text Documentation of Functioning and Disability to the ICF With Natural Language Processing.通过自然语言处理将功能与残疾的自由文本记录与《国际功能、残疾和健康分类》相联系。
Front Rehabil Sci. 2021 Nov;2. doi: 10.3389/fresc.2021.742702. Epub 2021 Nov 5.
3
Human and automated coding of rehabilitation discharge summaries according to the International Classification of Functioning, Disability, and Health.根据《国际功能、残疾和健康分类》对康复出院小结进行人工编码和自动编码。
J Am Med Inform Assoc. 2006 Sep-Oct;13(5):508-15. doi: 10.1197/jamia.M2107. Epub 2006 Jun 23.
4
Compiling standardized information from clinical practice: using content analysis and ICF Linking Rules in a goal-oriented youth rehabilitation program.从临床实践中编纂标准化信息:在以目标为导向的青年康复计划中使用内容分析和国际功能、残疾和健康分类链接规则。
Disabil Rehabil. 2019 Mar;41(5):613-621. doi: 10.1080/09638288.2017.1380718. Epub 2017 Sep 23.
5
Applying NLP methods to code functional performance in electronic health records using the international classification of functioning, disability, and health.运用自然语言处理方法,依据国际功能、残疾与健康分类对电子健康记录中的功能表现进行编码。
Disabil Health J. 2025 May 24:101888. doi: 10.1016/j.dhjo.2025.101888.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Autonomous International Classification of Diseases Coding Using Pretrained Language Models and Advanced Prompt Learning Techniques: Evaluation of an Automated Analysis System Using Medical Text.使用预训练语言模型和先进提示学习技术的自主国际疾病分类编码:对一个使用医学文本的自动分析系统的评估
JMIR Med Inform. 2025 Jan 6;13:e63020. doi: 10.2196/63020.
8
Automated recognition of functioning, activity and participation in COVID-19 from electronic patient records by natural language processing: a proof- of- concept.利用自然语言处理技术从电子病历中自动识别 COVID-19 患者的功能、活动和参与情况:概念验证。
Ann Med. 2022 Dec;54(1):235-243. doi: 10.1080/07853890.2021.2025418.
9
Qualitative assessment of the International Classification of Functioning, Disability, and Health with respect to the desiderata for controlled medical vocabularies.《国际功能、残疾和健康分类》关于受控医学词汇 desiderata 的定性评估
Int J Med Inform. 2006 May;75(5):384-95. doi: 10.1016/j.ijmedinf.2005.07.026. Epub 2005 Aug 24.
10
Classifying social determinants of health from unstructured electronic health records using deep learning-based natural language processing.利用基于深度学习的自然语言处理技术从非结构化电子健康记录中分类社会健康决定因素。
J Biomed Inform. 2022 Mar;127:103984. doi: 10.1016/j.jbi.2021.103984. Epub 2022 Jan 7.

引用本文的文献

1
Artificial intelligence-enhanced mapping of the international classification of functioning, disability and health via a mobile app: a randomized controlled trial.通过移动应用程序利用人工智能增强对《国际功能、残疾和健康分类》的映射:一项随机对照试验
Front Public Health. 2025 Aug 5;13:1590401. doi: 10.3389/fpubh.2025.1590401. eCollection 2025.
2
Applying NLP methods to code functional performance in electronic health records using the international classification of functioning, disability, and health.运用自然语言处理方法,依据国际功能、残疾与健康分类对电子健康记录中的功能表现进行编码。
Disabil Health J. 2025 May 24:101888. doi: 10.1016/j.dhjo.2025.101888.
3

本文引用的文献

1
Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.利用语义类型预测和大规模数据集提高全面的医学实体链接。
J Biomed Inform. 2021 Sep;121:103880. doi: 10.1016/j.jbi.2021.103880. Epub 2021 Aug 12.
2
A comprehensive study of mobility functioning information in clinical notes: Entity hierarchy, corpus annotation, and sequence labeling.临床笔记中移动功能信息的综合研究:实体层次结构、语料库标注和序列标记。
Int J Med Inform. 2021 Mar;147:104351. doi: 10.1016/j.ijmedinf.2020.104351. Epub 2020 Dec 24.
3
Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.
AI Thinking: a framework for rethinking artificial intelligence in practice.
人工智能思维:一个在实践中重新思考人工智能的框架。
R Soc Open Sci. 2025 Jan 8;12(1):241482. doi: 10.1098/rsos.241482. eCollection 2025 Jan.
4
Realizing the potential of social determinants data in EHR systems: A scoping review of approaches for screening, linkage, extraction, analysis, and interventions.认识电子健康记录系统中社会决定因素数据的潜力:对筛查、关联、提取、分析和干预方法的范围审查
J Clin Transl Sci. 2024 Oct 10;8(1):e147. doi: 10.1017/cts.2024.571. eCollection 2024.
5
Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review.用于从电子健康记录中提取日常生活活动信息的自然语言处理系统。一项系统综述。
JAMIA Open. 2024 May 24;7(2):ooae044. doi: 10.1093/jamiaopen/ooae044. eCollection 2024 Jul.
6
Natural Language Processing to Classify Caregiver Strategies Supporting Participation Among Children and Youth with Craniofacial Microsomia and Other Childhood-Onset Disabilities.用于对支持患有颅面微小畸形及其他儿童期起病残疾的儿童和青少年参与的照护者策略进行分类的自然语言处理
J Healthc Inform Res. 2023 Sep 18;7(4):480-500. doi: 10.1007/s41666-023-00149-y. eCollection 2023 Dec.
7
Classification of neurologic outcomes from medical notes using natural language processing.使用自然语言处理技术从医学记录中对神经学结果进行分类。
Expert Syst Appl. 2023 Mar 15;214. doi: 10.1016/j.eswa.2022.119171. Epub 2022 Nov 6.
8
A roadmap to reduce information inequities in disability with digital health and natural language processing.通过数字健康和自然语言处理减少残疾信息不平等的路线图。
PLOS Digit Health. 2022 Nov 17;1(11):e0000135. doi: 10.1371/journal.pdig.0000135. eCollection 2022 Nov.
9
Extracting body function information using rule-based methods: Highlighting structure and formatting challenges in clinical text.使用基于规则的方法提取身体功能信息:突出临床文本中的结构和格式挑战。
Front Digit Health. 2022 Sep 6;4:914171. doi: 10.3389/fdgth.2022.914171. eCollection 2022.
10
Capturing and Operationalizing Participation in Pediatric Re/Habilitation Research Using Artificial Intelligence: A Scoping Review.利用人工智能促进儿童康复研究中的参与并将其付诸实践:一项范围综述
Front Rehabil Sci. 2022;3. doi: 10.3389/fresc.2022.855240.
医学概念规范化中的歧义:电子健康记录数据集的类型和覆盖范围分析。
J Am Med Inform Assoc. 2021 Mar 1;28(3):516-532. doi: 10.1093/jamia/ocaa269.
4
HARE: a Flexible Highlighting Annotator for Ranking and Exploration.HARE:一种用于排序和探索的灵活高亮注释器。
Proc Conf Empir Methods Nat Lang Process. 2019 Nov;2019:85-90. doi: 10.18653/v1/d19-3015.
5
PheMap: a multi-resource knowledge base for high-throughput phenotyping within electronic health records.PheMap:一个用于电子健康记录中高通量表型分析的多资源知识库。
J Am Med Inform Assoc. 2020 Nov 1;27(11):1675-1687. doi: 10.1093/jamia/ocaa104.
6
Challenges of Developing a Natural Language Processing Method With Electronic Health Records to Identify Persons With Chronic Mobility Disability.开发一种使用电子健康记录识别慢性移动障碍患者的自然语言处理方法所面临的挑战。
Arch Phys Med Rehabil. 2020 Oct;101(10):1739-1746. doi: 10.1016/j.apmr.2020.04.024. Epub 2020 May 21.
7
Detecting Social and Behavioral Determinants of Health with Structured and Free-Text Clinical Data.利用结构化和自由文本临床数据检测健康的社会和行为决定因素。
Appl Clin Inform. 2020 Jan;11(1):172-181. doi: 10.1055/s-0040-1702214. Epub 2020 Mar 4.
8
Use of electronic health records and standardized terminologies: A nationwide survey of nursing staff experiences.使用电子健康记录和标准化术语:全国范围内护理人员体验的调查。
Int J Nurs Stud. 2020 Apr;104:103523. doi: 10.1016/j.ijnurstu.2020.103523. Epub 2020 Jan 7.
9
Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches.最近在使用深度神经网络方法识别瑞典语和西班牙语临床文本中的医学实体方面取得了进展。
BMC Med Inform Decis Mak. 2019 Dec 23;19(Suppl 7):274. doi: 10.1186/s12911-019-0981-y.
10
Broadening horizons: the case for capturing function and the role of health informatics in its use.拓宽视野:捕捉功能的案例以及健康信息学在其使用中的作用。
BMC Public Health. 2019 Oct 15;19(1):1288. doi: 10.1186/s12889-019-7630-3.