• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

来自美国国家健康访谈调查的伤害叙述数据的计算机编码。

Computerized coding of injury narrative data from the National Health Interview Survey.

作者信息

Wellman Helen M, Lehto Mark R, Sorock Gary S, Smith Gordon S

机构信息

Liberty Mutual Research Institute for Safety, 71 Frankland Road, Hopkinton, MA 01748, USA.

出版信息

Accid Anal Prev. 2004 Mar;36(2):165-71. doi: 10.1016/s0001-4575(02)00146-x.

DOI:10.1016/s0001-4575(02)00146-x
PMID:14642871
Abstract

OBJECTIVE

To investigate the accuracy of a computerized method for classifying injury narratives into external-cause-of-injury and poisoning (E-code) categories.

METHODS

This study used injury narratives and corresponding E-codes assigned by experts from the 1997 and 1998 US National Health Interview Survey (NHIS). A Fuzzy Bayesian model was used to assign injury descriptions to 13 E-code categories. Sensitivity, specificity and positive predictive value were measured by comparing the computer generated codes with E-code categories assigned by experts.

RESULTS

The computer program correctly classified 4695 (82.7%) of the 5677 injury narratives when multiple words were included as keywords in the model. The use of multiple-word predictors compared with using single words alone improved both the sensitivity and specificity of the computer generated codes. The program is capable of identifying and filtering out cases that would benefit most from manual coding. For example, the program could be used to code the narrative if the maximum probability of a category given the keywords in the narrative was at least 0.9. If the maximum probability was lower than 0.9 (which will be the case for approximately 33% of the narratives) the case would be filtered out for manual review.

CONCLUSIONS

A computer program based on Fuzzy Bayes logic is capable of accurately categorizing cause-of-injury codes from injury narratives. The capacity to filter out certain cases for manual coding improves the utility of this process.

摘要

目的

探讨一种将损伤描述分类为损伤外因和中毒(E编码)类别的计算机方法的准确性。

方法

本研究使用了1997年和1998年美国国家健康访谈调查(NHIS)专家分配的损伤描述及相应的E编码。采用模糊贝叶斯模型将损伤描述分配到13个E编码类别中。通过将计算机生成的编码与专家分配的E编码类别进行比较,来测量敏感性、特异性和阳性预测值。

结果

当模型中包含多个单词作为关键词时,计算机程序正确分类了5677条损伤描述中的4695条(82.7%)。与仅使用单个单词相比,使用多个单词预测器提高了计算机生成编码的敏感性和特异性。该程序能够识别并筛选出最适合人工编码的案例。例如,如果给定损伤描述关键词的类别最大概率至少为0.9,则该程序可用于对损伤描述进行编码。如果最大概率低于0.9(约33%的损伤描述会出现这种情况),则该案例将被筛选出来进行人工审核。

结论

基于模糊贝叶斯逻辑的计算机程序能够准确地将损伤描述中的损伤原因编码进行分类。筛选出某些案例进行人工编码的能力提高了这一过程的实用性。

相似文献

1
Computerized coding of injury narrative data from the National Health Interview Survey.来自美国国家健康访谈调查的伤害叙述数据的计算机编码。
Accid Anal Prev. 2004 Mar;36(2):165-71. doi: 10.1016/s0001-4575(02)00146-x.
2
A combined Fuzzy and Naive Bayesian strategy can be used to assign event codes to injury narratives.一种组合的模糊和朴素贝叶斯策略可用于为伤害描述分配事件代码。
Inj Prev. 2011 Dec;17(6):407-14. doi: 10.1136/ip.2010.030593. Epub 2011 Apr 11.
3
Bayesian methods: a useful tool for classifying injury narratives into cause groups.贝叶斯方法:将伤害叙述分类为原因组的有用工具。
Inj Prev. 2009 Aug;15(4):259-65. doi: 10.1136/ip.2008.021337.
4
Accuracy of external cause of injury codes reported in Washington State hospital discharge records.华盛顿州医院出院记录中报告的伤害外部原因编码的准确性。
Inj Prev. 2001 Dec;7(4):334-8. doi: 10.1136/ip.7.4.334.
5
Classifying injury narratives of large administrative databases for surveillance-A practical approach combining machine learning ensembles and human review.用于监测的大型行政数据库损伤叙述分类——一种结合机器学习集成和人工审核的实用方法。
Accid Anal Prev. 2017 Jan;98:359-371. doi: 10.1016/j.aap.2016.10.014. Epub 2016 Nov 15.
6
A practical tool for public health surveillance: Semi-automated coding of short injury narratives from large administrative databases using Naïve Bayes algorithms.一种用于公共卫生监测的实用工具:使用朴素贝叶斯算法对来自大型行政数据库的简短伤害描述进行半自动编码。
Accid Anal Prev. 2015 Nov;84:165-76. doi: 10.1016/j.aap.2015.06.014. Epub 2015 Sep 26.
7
Near-miss narratives from the fire service: a Bayesian analysis.消防部门的险些事故叙述:贝叶斯分析。
Accid Anal Prev. 2014 Jan;62:119-29. doi: 10.1016/j.aap.2013.09.012. Epub 2013 Oct 1.
8
Machine learning of motor vehicle accident categories from narrative data.从叙述性数据中对机动车事故类别进行机器学习。
Methods Inf Med. 1996 Dec;35(4-5):309-16.
9
Improving autocoding performance of rare categories in injury classification: Is more training data or filtering the solution?提高伤害分类中罕见类别的自动编码性能:更多的训练数据还是过滤是解决方案?
Accid Anal Prev. 2018 Jan;110:115-127. doi: 10.1016/j.aap.2017.10.020. Epub 2017 Nov 8.
10
Accuracy of e-codes assigned to emergency department records.分配给急诊科记录的电子代码的准确性。
Acad Emerg Med. 1995 Jul;2(7):615-20. doi: 10.1111/j.1553-2712.1995.tb03599.x.

引用本文的文献

1
Computerized "Learn-As-You-Go" classification of traumatic brain injuries using NEISS narrative data.利用国家电子伤害监督系统(NEISS)叙述性数据对创伤性脑损伤进行计算机化的“边学边分类”。
Accid Anal Prev. 2016 Apr;89:111-7. doi: 10.1016/j.aap.2016.01.012. Epub 2016 Feb 3.
2
Comparison of methods for auto-coding causation of injury narratives.损伤描述因果关系自动编码方法的比较
Accid Anal Prev. 2016 Mar;88:117-23. doi: 10.1016/j.aap.2015.12.006. Epub 2015 Dec 30.
3
Harnessing information from injury narratives in the 'big data' era: understanding and applying machine learning for injury surveillance.
在“大数据”时代利用伤害叙事中的信息:理解并应用机器学习进行伤害监测。
Inj Prev. 2016 Apr;22 Suppl 1(Suppl 1):i34-42. doi: 10.1136/injuryprev-2015-041813. Epub 2016 Jan 4.
4
Injury narrative text classification using factorization model.基于因子分解模型的损伤叙事文本分类
BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S5. doi: 10.1186/1472-6947-15-S1-S5. Epub 2015 May 20.
5
Development and evaluation of a Naïve Bayesian model for coding causation of workers' compensation claims.开发和评估用于编码工人赔偿索赔因果关系的朴素贝叶斯模型。
J Safety Res. 2012 Dec;43(5-6):327-32. doi: 10.1016/j.jsr.2012.10.012. Epub 2012 Nov 1.
6
Blurring the distinctions between on and off the job injuries: similarities and differences in circumstances.模糊工伤与非工伤之间的界限:情况中的异同
Inj Prev. 2006 Aug;12(4):236-41. doi: 10.1136/ip.2006.011676.
7
Using narrative text and coded data to develop hazard scenarios for occupational injury interventions.使用叙述性文本和编码数据来制定职业伤害干预的危险场景。
Inj Prev. 2004 Aug;10(4):249-54. doi: 10.1136/ip.2004.005181.