• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PetBERT:一种用于在第一方兽医电子健康记录中检测爆发的自动 ICD-11 综合征疾病编码。

PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records.

机构信息

Department of Computer Science, Durham University, Durham, UK.

Centre for Health Informatics, Computing, and Statistics, Lancaster Medical School, Lancaster University, Lancaster, UK.

出版信息

Sci Rep. 2023 Oct 21;13(1):18015. doi: 10.1038/s41598-023-45155-7.

DOI:10.1038/s41598-023-45155-7
PMID:37865683
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10590382/
Abstract

Effective public health surveillance requires consistent monitoring of disease signals such that researchers and decision-makers can react dynamically to changes in disease occurrence. However, whilst surveillance initiatives exist in production animal veterinary medicine, comparable frameworks for companion animals are lacking. First-opinion veterinary electronic health records (EHRs) have the potential to reveal disease signals and often represent the initial reporting of clinical syndromes in animals presenting for medical attention, highlighting their possible significance in early disease detection. Yet despite their availability, there are limitations surrounding their free text-based nature, inhibiting the ability for national-level mortality and morbidity statistics to occur. This paper presents PetBERT, a large language model trained on over 500 million words from 5.1 million EHRs across the UK. PetBERT-ICD is the additional training of PetBERT as a multi-label classifier for the automated coding of veterinary clinical EHRs with the International Classification of Disease 11 framework, achieving F1 scores exceeding 83% across 20 disease codings with minimal annotations. PetBERT-ICD effectively identifies disease outbreaks, outperforming current clinician-assigned point-of-care labelling strategies up to 3 weeks earlier. The potential for PetBERT-ICD to enhance disease surveillance in veterinary medicine represents a promising avenue for advancing animal health and improving public health outcomes.

摘要

有效的公共卫生监测需要持续监测疾病信号,以便研究人员和决策者能够对疾病发生的变化做出动态反应。然而,虽然在生产动物兽医医学中有监测举措,但缺乏类似的伴侣动物框架。第一意见兽医电子健康记录 (EHR) 有可能揭示疾病信号,并且通常代表着动物就诊时临床综合征的初始报告,突出了它们在早期疾病检测中的可能意义。尽管它们已经存在,但由于其基于自由文本的性质存在限制,因此无法进行国家级的死亡率和发病率统计。本文介绍了 PetBERT,这是一个在英国 510 万份 EHR 中超过 5 亿字的大型语言模型。PetBERT-ICD 是对 PetBERT 的额外训练,作为一种多标签分类器,用于对兽医临床 EHR 进行国际疾病分类第 11 版的自动编码,在 20 种疾病编码中实现了超过 83%的 F1 分数,只需最小的注释。PetBERT-ICD 能够有效地识别疾病爆发,比当前临床医生分配的即时检测标签策略提前多达 3 周。PetBERT-ICD 增强兽医医学疾病监测的潜力代表了一个有前途的途径,可以促进动物健康和改善公共卫生结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/0b3d0faf033c/41598_2023_45155_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/4369d1c83f15/41598_2023_45155_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/162efc121465/41598_2023_45155_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/0b3d0faf033c/41598_2023_45155_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/4369d1c83f15/41598_2023_45155_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/162efc121465/41598_2023_45155_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d4e/10590382/0b3d0faf033c/41598_2023_45155_Fig3_HTML.jpg

相似文献

1
PetBERT: automated ICD-11 syndromic disease coding for outbreak detection in first opinion veterinary electronic health records.PetBERT:一种用于在第一方兽医电子健康记录中检测爆发的自动 ICD-11 综合征疾病编码。
Sci Rep. 2023 Oct 21;13(1):18015. doi: 10.1038/s41598-023-45155-7.
2
Explainable text-tabular models for predicting mortality risk in companion animals.用于预测伴侣动物死亡风险的可解释文本-表格模型。
Sci Rep. 2024 Jun 20;14(1):14217. doi: 10.1038/s41598-024-64551-1.
3
Using topic modelling for unsupervised annotation of electronic health records to identify an outbreak of disease in UK dogs.使用主题建模对电子健康记录进行无监督标注,以识别英国犬群中的疾病爆发。
PLoS One. 2021 Dec 9;16(12):e0260402. doi: 10.1371/journal.pone.0260402. eCollection 2021.
4
Exploiting ICD Hierarchy for Classification of EHRs in Spanish Through Multi-Task Transformers.利用 ICD 层级结构通过多任务转换器对西班牙电子病历进行分类。
IEEE J Biomed Health Inform. 2022 Mar;26(3):1374-1383. doi: 10.1109/JBHI.2021.3112130. Epub 2022 Mar 7.
5
Syndromic surveillance in an ICD-10 world.ICD-10 体系下的症状监测
AMIA Annu Symp Proc. 2014 Nov 14;2014:1806-14. eCollection 2014.
6
Veterinary syndromic surveillance: Current initiatives and potential for development.兽医综合征监测:当前的举措和发展潜力。
Prev Vet Med. 2011 Aug 1;101(1-2):1-17. doi: 10.1016/j.prevetmed.2011.05.004. Epub 2011 Jun 2.
7
Cardiology record multi-label classification using latent Dirichlet allocation.使用潜在狄利克雷分配进行心脏病学记录的多标签分类。
Comput Methods Programs Biomed. 2018 Oct;164:111-119. doi: 10.1016/j.cmpb.2018.07.002. Epub 2018 Jul 17.
8
The passive surveillance of ticks using companion animal electronic health records.利用伴侣动物电子健康记录对蜱虫进行被动监测。
Epidemiol Infect. 2017 Jul;145(10):2020-2029. doi: 10.1017/S0950268817000826. Epub 2017 May 2.
9
Explainable ICD multi-label classification of EHRs in Spanish with convolutional attention.基于卷积注意力的西班牙语电子病历可解释 ICD 多标签分类。
Int J Med Inform. 2022 Jan;157:104615. doi: 10.1016/j.ijmedinf.2021.104615. Epub 2021 Oct 29.
10
A Pseudo Label-Wise Attention Network for Automatic ICD Coding.基于伪标签注意力网络的 ICD 自动编码方法。
IEEE J Biomed Health Inform. 2022 Oct;26(10):5201-5212. doi: 10.1109/JBHI.2022.3193291. Epub 2022 Oct 5.

引用本文的文献

1
Developing electronic health records as a source of real-world data for veterinary pharmacoepidemiology.开发电子健康记录作为兽医药物流行病学真实世界数据的来源。
Front Vet Sci. 2025 Apr 1;12:1550468. doi: 10.3389/fvets.2025.1550468. eCollection 2025.
2
Premature mortality analysis of 52,000 deceased cats and dogs exposes socioeconomic disparities.52000 只死亡猫犬的过早死亡率分析揭示了社会经济差异。
Sci Rep. 2024 Nov 20;14(1):28763. doi: 10.1038/s41598-024-77385-8.
3
Text mining for disease surveillance in veterinary clinical data: part two, training computers to identify features in clinical text.

本文引用的文献

1
A systematic review of the prediction of hospital length of stay: Towards a unified framework.住院时间预测的系统评价:迈向统一框架
PLOS Digit Health. 2022 Apr 14;1(4):e0000017. doi: 10.1371/journal.pdig.0000017. eCollection 2022 Apr.
2
Accessing veterinary healthcare during the COVID-19 pandemic: A mixed-methods analysis of UK and Republic of Ireland dog owners' concerns and experiences.在 COVID-19 大流行期间获得兽医保健服务:对英、爱两国狗主人关注和经历的混合方法分析。
Vet Rec. 2022 Aug;191(3):e1681. doi: 10.1002/vetr.1681. Epub 2022 May 5.
3
Using topic modelling for unsupervised annotation of electronic health records to identify an outbreak of disease in UK dogs.
用于兽医临床数据疾病监测的文本挖掘:第二部分,训练计算机识别临床文本中的特征。
Front Vet Sci. 2024 Aug 22;11:1352726. doi: 10.3389/fvets.2024.1352726. eCollection 2024.
4
Explainable text-tabular models for predicting mortality risk in companion animals.用于预测伴侣动物死亡风险的可解释文本-表格模型。
Sci Rep. 2024 Jun 20;14(1):14217. doi: 10.1038/s41598-024-64551-1.
使用主题建模对电子健康记录进行无监督标注,以识别英国犬群中的疾病爆发。
PLoS One. 2021 Dec 9;16(12):e0260402. doi: 10.1371/journal.pone.0260402. eCollection 2021.
4
AZD1222/ChAdOx1 nCoV-19 vaccination induces a polyfunctional spike protein-specific T1 response with a diverse TCR repertoire.AZD1222/ChAdOx1 nCoV-19 疫苗接种诱导具有多样化 TCR 谱的多功能 Spike 蛋白特异性 T1 反应。
Sci Transl Med. 2021 Nov 17;13(620):eabj7211. doi: 10.1126/scitranslmed.abj7211.
5
SARS-CoV-2 neutralising antibodies in dogs and cats in the United Kingdom.英国犬猫体内的新型冠状病毒中和抗体
Curr Res Virol Sci. 2021;2:100011. doi: 10.1016/j.crviro.2021.100011. Epub 2021 Aug 5.
6
Outbreak of Severe Vomiting in Dogs Associated with a Canine Enteric Coronavirus, United Kingdom.英国一起犬传染性肠冠状病毒相关的严重呕吐疫情。
Emerg Infect Dis. 2021 Feb;27(2):517-528. doi: 10.3201/eid2702.202452.
7
Machine Learning Comes of Age: Local Impact versus National Generalizability.机器学习走向成熟:局部影响与全国通用性
Anesthesiology. 2020 May;132(5):939-941. doi: 10.1097/ALN.0000000000003223.
8
Venn-diaNet : venn diagram based network propagation analysis framework for comparing multiple biological experiments.Venn-diaNet:基于韦恩图的网络传播分析框架,用于比较多个生物学实验。
BMC Bioinformatics. 2019 Dec 27;20(Suppl 23):667. doi: 10.1186/s12859-019-3302-7.
9
Potential loss of revenue due to errors in clinical coding during the implementation of the Malaysia diagnosis related group (MY-DRG) Casemix system in a teaching hospital in Malaysia.马来西亚一家教学医院在实施马来西亚诊断相关分组(MY-DRG)病例组合系统过程中,因临床编码错误导致的潜在收入损失。
BMC Health Serv Res. 2018 Jan 25;18(1):38. doi: 10.1186/s12913-018-2843-1.
10
Small animal disease surveillance: GI disease and salmonellosis.小动物疾病监测:胃肠道疾病和沙门氏菌病。
Vet Rec. 2017 Sep 2;181(9):228-232. doi: 10.1136/vr.j3642.