Center for Radiological Research, Columbia University Irving Medical Center, New York, NY, USA.
J Environ Radioact. 2023 Aug;264:107205. doi: 10.1016/j.jenvrad.2023.107205. Epub 2023 May 15.
Radioactive contamination of forests by long-lived radionuclides from nuclear accidents such as Chernobyl and Fukushima continues to be studied and quantitatively modeled. Whereas traditional statistical and machine learning (ML) techniques generate predictions by focusing on correlations between variables, quantification of causal effects of radioactivity deposition levels on contamination of plant tissues represents a more fundamental and relevant research goal. Modeling of cause-and-effect relationships is advantageous over standard predictive modeling, particularly by improving the generalizability of results to other situations, where the distributions of variables, including potential confounders, differ from those in the training data. Here we used the state-of-the-art causal forest (CF) algorithm to quantify the causal effect of Cs land contamination after the Fukushima accident on Cs activity concentrations in the wood of four common Japanese forest tree species: Hinoki cypress (Chamaecyparis obtusa), konara oak (Quercus serrata), red pine (Pinus densiflora), and Sugi cedar (Cryptomeria japonica). We estimated the average causal effect for the population, quantified how it was influenced by other environmental variables, and produced effect estimates at the individual level. The estimated causal effect was quite robust to various refutation methods, and was negatively influenced by high mean annual precipitation, elevation, and time after the accident. Wood subtype (e.g. sapwood, heartwood) and tree species made smaller contributions to the causal effect. We believe that causal ML techniques have promising potential in radiation ecology and can usefully expand the toolkit of modeling approaches available to researchers in this field.
放射性核素在森林中的长期污染一直受到研究和定量模型的关注,这些放射性核素来自切尔诺贝利和福岛等核事故。传统的统计和机器学习 (ML) 技术通过关注变量之间的相关性来生成预测,而放射性沉积水平对植物组织污染的因果效应的量化则代表了一个更基本和相关的研究目标。因果关系模型比标准预测模型具有优势,特别是通过提高结果对其他情况的泛化能力,其中包括潜在混杂因素在内的变量分布与训练数据不同。在这里,我们使用最先进的因果森林 (CF) 算法来量化福岛事故后 Cs 土地污染对四种常见日本森林树种木材中 Cs 活度浓度的因果效应:扁柏(Chamaecyparis obtusa)、柯那拉栎(Quercus serrata)、红松(Pinus densiflora)和柳杉(Cryptomeria japonica)。我们估计了种群的平均因果效应,量化了它如何受到其他环境变量的影响,并在个体水平上产生了效应估计。估计的因果效应对各种反驳方法都相当稳健,受年均降水量高、海拔和事故后时间的负面影响。木材亚型(如边材、心材)和树种对因果效应的贡献较小。我们相信因果 ML 技术在辐射生态学中具有广阔的应用前景,可以为该领域的研究人员提供有用的建模方法工具包扩展。