Suppr超能文献

应用分类回归树评估 pH 值预测的地域特异性。

Assessing the geographic specificity of pH prediction by classification and regression trees.

机构信息

Department of Biochemistry, Northeastern University, Boston, Massachusetts, United States of America.

Department of Science, New Rochelle High School, New Rochelle, New York, United States of America.

出版信息

PLoS One. 2021 Aug 11;16(8):e0255119. doi: 10.1371/journal.pone.0255119. eCollection 2021.

Abstract

Soil pH effects a wide range of critical biogeochemical processes that dictate plant growth and diversity. Previous literature has established the capacity of classification and regression trees (CARTs) to predict soil pH, but limitations of CARTs in this context have not been fully explored. The current study collected soil pH, climatic, and topographic data from 100 locations across New York's Temperate Deciduous Forests (in the United States of America) to investigate the extrapolative capacity of a previously developed CART model as compared to novel CART and random forest (RF) models. Results showed that the previously developed CART underperformed in terms of predictive accuracy (RRMSE = 14.52%) when compared to a novel tree (RRMSE = 9.33%), and that a novel random forest outperformed both models (RRMSE = 8.88%), though its predictions did not differ significantly from the novel tree (p = 0.26). The most important predictors for model construction were climatic factors. These findings confirm existing reports that CART models are constrained by the spatial autocorrelation of geographic data and encourage the restricted application of relevant machine learning models to regions from which training data was collected. They also contradict previous literature implying that random forests should meaningfully boost the predictive accuracy of CARTs in the context of soil pH.

摘要

土壤 pH 值影响着广泛的关键生物地球化学过程,这些过程决定着植物的生长和多样性。先前的文献已经确立了分类回归树(CART)预测土壤 pH 值的能力,但在这种情况下,CART 的局限性尚未得到充分探索。本研究从美国纽约温带落叶林的 100 个地点收集了土壤 pH 值、气候和地形数据,以调查先前开发的 CART 模型与新的 CART 和随机森林(RF)模型相比的外推能力。结果表明,与新树(RRMSE=9.33%)相比,先前开发的 CART 在预测精度方面表现不佳(RRMSE=14.52%),而新的随机森林在两种模型中表现最佳(RRMSE=8.88%),尽管其预测结果与新树没有显著差异(p=0.26)。模型构建的最重要预测因子是气候因素。这些发现证实了现有的报告,即 CART 模型受到地理数据空间自相关的限制,并鼓励将相关机器学习模型限制应用于从其收集训练数据的区域。它们还与先前的文献相矛盾,这些文献暗示在土壤 pH 值的背景下,随机森林应该显著提高 CART 的预测精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecaa/8357141/a3ef13cd87f9/pone.0255119.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验