使用本福特定律评估职业卫生数据质量。

The use of Benford's law for evaluation of quality of occupational hygiene data.

作者信息

De Vocht Frank, Kromhout Hans

机构信息

Centre for Occupational and Environmental Health, School of Community Based Medicine, Manchester Academic Health Science Centre, The University of Manchester, Ellen Wilkinson Building, Oxford Road, Manchester M13 9PL, UK.

出版信息

Ann Occup Hyg. 2013 Apr;57(3):296-304. doi: 10.1093/annhyg/mes067. Epub 2012 Sep 20.

DOI:10.1093/annhyg/mes067

PMID:22997413

Abstract

Benford's law is the contra-intuitive empirical observation that the digits 1-9 are not equally likely to appear as the initial digit in numbers resulting from the same phenomenon. Manipulated, unrelated, or created numbers usually do not follow Benford's law, and as such this law has been used in the investigation of fraudulent data in, for example, accounting and to identify errors in data sets due to, for example, data transfer. We describe the use of Benford's law to screen occupational hygiene measurement data sets using exposure data from the European rubber manufacturing industry as an illustration. Two rubber process dust measurement data sets added to the European Union ExAsRub project but initially collected by the UK Health and Safety Executive (HSE) and British Rubber Manufacturers' Association (BRMA) and one pre- and one post-treatment n-nitrosamines data set collated in the German MEGA database and also added to the ExAsRub database were compared with the expected first-digit (1BL) and second-digit (2BL) Benford distributions. Evaluation indicated only small deviations from the expected 1BL and 2BL distributions for the data sets collated by the UK HSE and industry (BRMA), respectively, while for the MEGA data larger deviations were observed. To a large extent the latter could be attributed to imputation and replacement by a constant of n-nitrosamine measurements below the limit of detection, but further evaluation of these data to determine why other deviations from 1BL and 2BL expected distributions exist may be beneficial. Benford's law is a straightforward and easy-to-implement analytical tool to evaluate the quality of occupational hygiene data sets, and as such can be used to detect potential problems in large data sets that may be caused by malcontent a priori or a posteriori manipulation of data sets and by issues like treatment of observations below the limit of detection, rounding and transfer of data.

摘要

本福特定律是一种与直觉相悖的经验观察结果，即数字1 - 9在源于同一现象的数字中作为首位数字出现的可能性并不相同。经过人为操纵、不相关或编造的数字通常不遵循本福特定律，因此该定律已被用于调查例如会计领域中的欺诈数据，以及识别数据集中因数据传输等原因导致的错误。我们以欧洲橡胶制造业的暴露数据为例，描述了如何使用本福特定律来筛选职业卫生测量数据集。将添加到欧盟ExAsRub项目但最初由英国健康与安全执行局（HSE）和英国橡胶制造商协会（BRMA）收集的两个橡胶工艺粉尘测量数据集，以及整理在德国MEGA数据库中并添加到ExAsRub数据库的一个处理前和一个处理后的N - 亚硝胺数据集，与预期的本福特首位数字（1BL）和第二位数字（2BL）分布进行了比较。评估表明，英国HSE和行业（BRMA）整理的数据集分别与预期的1BL和2BL分布仅有小偏差，而MEGA数据则观察到较大偏差。在很大程度上，后者可归因于对低于检测限的N - 亚硝胺测量值进行插补和用常数替换，但进一步评估这些数据以确定为何存在与1BL和2BL预期分布的其他偏差可能会有所帮助。本福特定律是一种简单且易于实施的分析工具，可用于评估职业卫生数据集的质量，因此可用于检测大数据集中可能由先验或后验数据集操纵以及低于检测限的观测值处理、数据舍入和传输等问题引起的潜在问题。

相似文献

The use of Benford's law for evaluation of quality of occupational hygiene data.使用本福特定律评估职业卫生数据质量。

Ann Occup Hyg. 2013 Apr;57(3):296-304. doi: 10.1093/annhyg/mes067. Epub 2012 Sep 20.

Agreement of drug discovery data with Benford's law.药物发现数据与贝努利定律的一致性。

Expert Opin Drug Discov. 2013 Jan;8(1):1-5. doi: 10.1517/17460441.2013.740007. Epub 2012 Nov 3.

Benford's Law and the screening of analytical data: the case of pollutant concentrations in ambient air.本福特定律与分析数据筛选：以环境空气中污染物浓度为例

Analyst. 2005 Sep;130(9):1280-5. doi: 10.1039/b504462f. Epub 2005 Jul 26.

Use of Benford's law in drug discovery data.贝叶斯定律在药物发现数据中的应用。

Drug Discov Today. 2010 May;15(9-10):328-31. doi: 10.1016/j.drudis.2010.03.003. Epub 2010 Mar 16.

Benford's law and the quality of occupational hygiene data.本福特定律与职业卫生数据质量

Ann Occup Hyg. 2014 Apr;58(3):397-400. doi: 10.1093/annhyg/meu003. Epub 2014 Feb 15.

Benford's Law for Quality Assurance of Manner of Death Counts in Small and Large Databases.用于大小数据库中死亡方式计数质量保证的本福特定律。

J Forensic Sci. 2017 Sep;62(5):1326-1331. doi: 10.1111/1556-4029.13437. Epub 2017 Jun 20.

A database of exposures in the rubber manufacturing industry: design and quality control.橡胶制造业暴露数据库：设计与质量控制

Ann Occup Hyg. 2005 Nov;49(8):691-701. doi: 10.1093/annhyg/mei035. Epub 2005 Aug 26.

Using the Benford's Law as a First Step to Assess the Quality of the Cancer Registry Data.以本福特定律作为评估癌症登记数据质量的第一步。

Front Public Health. 2016 Oct 13;4:225. doi: 10.3389/fpubh.2016.00225. eCollection 2016.

Benford's Law: textbook exercises and multiple-choice testbanks.本福特定律：教科书习题与多项选择题题库。

PLoS One. 2015 Feb 17;10(2):e0117972. doi: 10.1371/journal.pone.0117972. eCollection 2015.

Evaluation of Large-scale Data to Detect Irregularity in Payment for Medical Services. An Extended Use of Benford's Law.评估大规模数据以检测医疗服务支付中的违规行为。本福特定律的扩展应用。

Methods Inf Med. 2016 May 17;55(3):284-91. doi: 10.3414/ME15-01-0076. Epub 2016 Apr 20.

引用本文的文献

Injuries and fatalities in Colombian mining emergencies (2005-2018): a retrospective ecological study.哥伦比亚采矿事故中的伤亡情况（2005 - 2018年）：一项回顾性生态研究

Rev Bras Med Trab. 2023 Feb 13;20(4):591-598. doi: 10.47626/1679-4435-2022-799. eCollection 2022 Oct-Dec.

Job-exposure matrix for historical exposures to rubber dust, rubber fumes and n-Nitrosamines in the British rubber industry.英国橡胶工业中橡胶粉尘、橡胶烟雾和 N-亚硝胺的历史暴露职业暴露矩阵。

Occup Environ Med. 2019 Apr;76(4):259-267. doi: 10.1136/oemed-2018-105182. Epub 2019 Feb 16.

An analysis of bibliometric indicators to JCR according to Benford's law.根据本福特定律对期刊引证报告（JCR）的文献计量指标进行分析。

Scientometrics. 2016;107:1489-1499. doi: 10.1007/s11192-016-1908-3. Epub 2016 Mar 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用本福特定律评估职业卫生数据质量。

The use of Benford's law for evaluation of quality of occupational hygiene data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献