• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯方法:将伤害叙述分类为原因组的有用工具。

Bayesian methods: a useful tool for classifying injury narratives into cause groups.

机构信息

School of Industrial Engineering, Purdue University, 1287 Grissom Hall, West Lafayette, IN 47907, USA.

出版信息

Inj Prev. 2009 Aug;15(4):259-65. doi: 10.1136/ip.2008.021337.

DOI:10.1136/ip.2008.021337
PMID:19652000
Abstract

To compare two Bayesian methods (Fuzzy and Naïve) for classifying injury narratives in large administrative databases into event cause groups, a dataset of 14 000 narratives was randomly extracted from claims filed with a worker's compensation insurance provider. Two expert coders assigned one-digit and two-digit Bureau of Labor Statistics (BLS) Occupational Injury and Illness Classification event codes to each narrative. The narratives were separated into a training set of 11 000 cases and a prediction set of 3000 cases. The training set was used to develop two Bayesian classifiers that assigned BLS codes to narratives. Each model was then evaluated for the prediction set. Both models performed well and tended to predict one-digit BLS codes more accurately than two-digit codes. The overall sensitivity of the Fuzzy method was, respectively, 78% and 64% for one-digit and two-digit codes, specificity was 93% and 95%, and positive predictive value (PPV) was 78% and 65%. The Naïve method showed similar accuracy: a sensitivity of 80% and 70%, specificity of 96% and 97%, and PPV of 80% and 70%. For large administrative databases, Bayesian methods show significant promise as a means of classifying injury narratives into cause groups. Overall, Naïve Bayes provided slightly more accurate predictions than Fuzzy Bayes.

摘要

为了比较两种贝叶斯方法(模糊和朴素)在将大型行政数据库中的伤害叙述分类为事件原因组中的应用,从一家工人赔偿保险公司提交的索赔中随机提取了 14000 个叙述的数据集。两位专家编码员为每个叙述分配了一个位数和两位数的 BLS(劳工统计局)职业伤害和疾病分类事件代码。叙述被分为 11000 个案例的训练集和 3000 个案例的预测集。使用训练集开发了两种将 BLS 代码分配给叙述的贝叶斯分类器。然后,对每种模型在预测集中进行了评估。两种模型的性能都很好,并且倾向于更准确地预测一位数 BLS 代码而不是两位数代码。模糊方法的总体敏感性分别为一位数和两位数代码的 78%和 64%,特异性分别为 93%和 95%,阳性预测值(PPV)分别为 78%和 65%。朴素方法表现出类似的准确性:一位数和两位数代码的敏感性分别为 80%和 70%,特异性分别为 96%和 97%,PPV 分别为 80%和 70%。对于大型行政数据库,贝叶斯方法作为将伤害叙述分类为原因组的一种手段具有很大的应用潜力。总体而言,朴素贝叶斯的预测结果比模糊贝叶斯略为准确。

相似文献

1
Bayesian methods: a useful tool for classifying injury narratives into cause groups.贝叶斯方法:将伤害叙述分类为原因组的有用工具。
Inj Prev. 2009 Aug;15(4):259-65. doi: 10.1136/ip.2008.021337.
2
A combined Fuzzy and Naive Bayesian strategy can be used to assign event codes to injury narratives.一种组合的模糊和朴素贝叶斯策略可用于为伤害描述分配事件代码。
Inj Prev. 2011 Dec;17(6):407-14. doi: 10.1136/ip.2010.030593. Epub 2011 Apr 11.
3
A practical tool for public health surveillance: Semi-automated coding of short injury narratives from large administrative databases using Naïve Bayes algorithms.一种用于公共卫生监测的实用工具:使用朴素贝叶斯算法对来自大型行政数据库的简短伤害描述进行半自动编码。
Accid Anal Prev. 2015 Nov;84:165-76. doi: 10.1016/j.aap.2015.06.014. Epub 2015 Sep 26.
4
Classifying injury narratives of large administrative databases for surveillance-A practical approach combining machine learning ensembles and human review.用于监测的大型行政数据库损伤叙述分类——一种结合机器学习集成和人工审核的实用方法。
Accid Anal Prev. 2017 Jan;98:359-371. doi: 10.1016/j.aap.2016.10.014. Epub 2016 Nov 15.
5
Near-miss narratives from the fire service: a Bayesian analysis.消防部门的险些事故叙述:贝叶斯分析。
Accid Anal Prev. 2014 Jan;62:119-29. doi: 10.1016/j.aap.2013.09.012. Epub 2013 Oct 1.
6
Capturing cases in workers' compensation databases: the example of neck pain.在工伤赔偿数据库中获取病例:以颈部疼痛为例。
Am J Ind Med. 2006 Jul;49(7):557-68. doi: 10.1002/ajim.20329.
7
Concordance between ANSI occupational back injury codes and claim form diagnoses and a lower bound estimate of the fraction associated with disc displacement/herniation.美国国家标准学会(ANSI)职业性背部损伤代码与索赔表诊断之间的一致性以及与椎间盘移位/突出相关部分的下限估计。
Am J Ind Med. 1996 Nov;30(5):556-68. doi: 10.1002/(SICI)1097-0274(199611)30:5<556::AID-AJIM4>3.0.CO;2-R.
8
Bayesian logistic injury severity score: a method for predicting mortality using international classification of disease-9 codes.贝叶斯逻辑斯蒂损伤严重度评分:一种使用国际疾病分类第9版编码预测死亡率的方法。
Acad Emerg Med. 2008 May;15(5):466-75. doi: 10.1111/j.1553-2712.2008.00105.x.
9
Computerized coding of injury narrative data from the National Health Interview Survey.来自美国国家健康访谈调查的伤害叙述数据的计算机编码。
Accid Anal Prev. 2004 Mar;36(2):165-71. doi: 10.1016/s0001-4575(02)00146-x.
10
Comparison of methods for auto-coding causation of injury narratives.损伤描述因果关系自动编码方法的比较
Accid Anal Prev. 2016 Mar;88:117-23. doi: 10.1016/j.aap.2015.12.006. Epub 2015 Dec 30.

引用本文的文献

1
Workers' compensation claim counts and rates by injury event/exposure among state-insured private employers in Ohio, 2007-2017.2007-2017 年俄亥俄州参保私营雇主按伤害事件/暴露分类的工人赔偿索赔数和费率。
J Safety Res. 2021 Dec;79:148-167. doi: 10.1016/j.jsr.2021.08.015. Epub 2021 Sep 17.
2
Testing and Validating Semi-automated Approaches to the Occupational Exposure Assessment of Polycyclic Aromatic Hydrocarbons.测试和验证多环芳烃职业暴露评估的半自动方法。
Ann Work Expo Health. 2021 Jul 3;65(6):682-693. doi: 10.1093/annweh/wxab002.
3
Applying Machine Learning to Workers' Compensation Data to Identify Industry-Specific Ergonomic and Safety Prevention Priorities: Ohio, 2001 to 2011.
应用机器学习分析工人赔偿数据,以确定特定行业的人体工程学和安全预防重点:俄亥俄州,2001 年至 2011 年。
J Occup Environ Med. 2018 Jan;60(1):55-73. doi: 10.1097/JOM.0000000000001162.
4
Computerized "Learn-As-You-Go" classification of traumatic brain injuries using NEISS narrative data.利用国家电子伤害监督系统(NEISS)叙述性数据对创伤性脑损伤进行计算机化的“边学边分类”。
Accid Anal Prev. 2016 Apr;89:111-7. doi: 10.1016/j.aap.2016.01.012. Epub 2016 Feb 3.
5
Comparison of methods for auto-coding causation of injury narratives.损伤描述因果关系自动编码方法的比较
Accid Anal Prev. 2016 Mar;88:117-23. doi: 10.1016/j.aap.2015.12.006. Epub 2015 Dec 30.
6
Harnessing information from injury narratives in the 'big data' era: understanding and applying machine learning for injury surveillance.在“大数据”时代利用伤害叙事中的信息:理解并应用机器学习进行伤害监测。
Inj Prev. 2016 Apr;22 Suppl 1(Suppl 1):i34-42. doi: 10.1136/injuryprev-2015-041813. Epub 2016 Jan 4.
7
Workers' compensation claims for musculoskeletal disorders among wholesale and retail trade industry workers--Ohio, 2005-2009.批发和零售贸易行业工人肌肉骨骼疾病的工人赔偿索赔——俄亥俄州,2005-2009 年。
MMWR Morb Mortal Wkly Rep. 2013 Jun 7;62(22):437-42.
8
Development and evaluation of a Naïve Bayesian model for coding causation of workers' compensation claims.开发和评估用于编码工人赔偿索赔因果关系的朴素贝叶斯模型。
J Safety Res. 2012 Dec;43(5-6):327-32. doi: 10.1016/j.jsr.2012.10.012. Epub 2012 Nov 1.