Kendall G M, Appleton J D, Chernyavskiy P, Arsham A, Little M P
Cancer Epidemiology Unit, NDPH, University of Oxford, Richard Doll Building, Old Road Campus, Headington, Oxford, OX3 7LF, UK.
British Geological Survey, Kingsley Dunham Centre, Nicker Hill, Keyworth, Nottingham, NG12 5GG, UK.
J Environ Radioact. 2025 Feb;282:107595. doi: 10.1016/j.jenvrad.2024.107595. Epub 2024 Dec 27.
We investigate methods that improve the estimation of indoor gamma ray dose rates at locations where measurements had not been made. These new predictions use a greater range of modelling techniques and larger variety of explanatory variables than our previous examinations of this subject. Specifically, we now employ three types of machine learning models in addition to the geostatistical, nearest neighbour and other earlier models. A large number of parameters, mostly describing the characteristics of dwellings in the area in question, have been added to the set of explanatory variables. The use of machine learning methods results in significantly improved predictions over earlier models. The machine learning models are noisy and there is some instability in the relative importance of particular explanatory variables although there are general and consistent tendencies supporting the importance of certain classes of variable. However, the range of predicted indoor gamma ray dose rates is much smaller than that of the measurements. It is probable that epidemiological studies using such predictions will have lower statistical power than those based on direct measurements.
我们研究了在未进行测量的地点改进室内伽马射线剂量率估算的方法。与我们之前对该主题的研究相比,这些新的预测使用了更广泛的建模技术和更多种类的解释变量。具体而言,除了地质统计、最近邻和其他早期模型之外,我们现在还采用了三种类型的机器学习模型。大量参数(大多描述相关区域内住宅的特征)已被添加到解释变量集中。与早期模型相比,机器学习方法的使用使得预测有了显著改进。机器学习模型存在噪声,特定解释变量的相对重要性存在一些不稳定性,尽管存在支持某些变量类重要性的一般且一致的趋势。然而,预测的室内伽马射线剂量率范围比测量值的范围小得多。使用此类预测的流行病学研究的统计效力可能低于基于直接测量的研究。