• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在空间数据科学中的可解释性预测因子:信息视界。

On the interpretability of predictors in spatial data science: the information horizon.

机构信息

Cluster of Excellence "Machine Learning: New Perspectives for Science", Eberhard Karls University Tuebingen, Maria von Linden Str. 6, 72076, Tübingen, Germany.

Soil and Spatial Data Science, Soilution GbR, Heiligegeiststrasse 13, 06484, Quedlinburg, Germany.

出版信息

Sci Rep. 2020 Oct 7;10(1):16737. doi: 10.1038/s41598-020-73773-y.

DOI:10.1038/s41598-020-73773-y
PMID:33028910
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7542468/
Abstract

Two important theories in spatial modelling relate to structural and spatial dependence. Structural dependence refers to environmental state-factor models, where an environmental property is modelled as a function of the states and interactions of environmental predictors, such as climate, parent material or relief. Commonly, the functions are regression or supervised classification algorithms. Spatial dependence is present in most environmental properties and forms the basis for spatial interpolation and geostatistics. In machine learning, modelling with geographic coordinates or Euclidean distance fields, which resemble linear variograms with infinite ranges, can produce similar interpolations. Interpolations do not lend themselves to causal interpretations. Conversely, with structural modeling, one can, potentially, extract knowledge from the modelling. Two important characteristics of such interpretable environmental modelling are scale and information content. Scale is relevant because very coarse scale predictors can show nearly infinite ranges, falling out of what we call the information horizon, i.e. interpretation using domain knowledge isn't possible. Regarding information content, recent studies have shown that meaningless predictors, such as paintings or photographs of faces, can be used for spatial environmental modelling of ecological and soil properties, with accurate evaluation statistics. Here, we examine under which conditions modelling with such predictors can lead to accurate statistics and whether an information horizon can be derived for scale and information content.

摘要

空间建模中有两个重要的理论,分别与结构性和空间依赖性有关。结构性依赖指的是环境状态因子模型,其中环境属性被建模为环境预测因子(如气候、母质或地形)的状态和相互作用的函数。通常,这些函数是回归或有监督分类算法。空间依赖性存在于大多数环境属性中,是空间插值和地统计学的基础。在机器学习中,使用地理坐标或欧几里得距离场建模,其类似于具有无限范围的线性变程,可以产生类似的插值。插值不适用于因果解释。相反,通过结构性建模,人们可以从建模中提取知识。这种可解释的环境建模的两个重要特征是尺度和信息含量。尺度是相关的,因为非常粗糙的尺度预测因子可能会显示出几乎无限的范围,超出了我们所说的信息范围,即使用领域知识进行解释是不可能的。关于信息含量,最近的研究表明,无意义的预测因子(如绘画或人脸照片)可以用于生态和土壤属性的空间环境建模,并且具有准确的评估统计数据。在这里,我们研究了在何种条件下,使用此类预测因子进行建模可以导致准确的统计数据,以及是否可以为尺度和信息含量推导信息范围。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/f9a1f0c17928/41598_2020_73773_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/c4ebe37c460d/41598_2020_73773_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/5612be4f6822/41598_2020_73773_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/69e5fa9a444c/41598_2020_73773_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/b4b939c129d8/41598_2020_73773_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/d34ba0810ef8/41598_2020_73773_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/be8b995ec74e/41598_2020_73773_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/8ba99a5cdda5/41598_2020_73773_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/f9a1f0c17928/41598_2020_73773_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/c4ebe37c460d/41598_2020_73773_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/5612be4f6822/41598_2020_73773_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/69e5fa9a444c/41598_2020_73773_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/b4b939c129d8/41598_2020_73773_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/d34ba0810ef8/41598_2020_73773_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/be8b995ec74e/41598_2020_73773_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/8ba99a5cdda5/41598_2020_73773_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5487/7542468/f9a1f0c17928/41598_2020_73773_Fig8_HTML.jpg

相似文献

1
On the interpretability of predictors in spatial data science: the information horizon.在空间数据科学中的可解释性预测因子:信息视界。
Sci Rep. 2020 Oct 7;10(1):16737. doi: 10.1038/s41598-020-73773-y.
2
Contextual spatial modelling in the horizontal and vertical domains.上下文空间建模在水平和垂直领域。
Sci Rep. 2022 Jun 9;12(1):9496. doi: 10.1038/s41598-022-13514-5.
3
The relevant range of scales for multi-scale contextual spatial modelling.多尺度上下文空间建模的相关尺度范围。
Sci Rep. 2019 Oct 15;9(1):14800. doi: 10.1038/s41598-019-51395-3.
4
Evaluating heterogeneity in indoor and outdoor air pollution using land-use regression and constrained factor analysis.利用土地利用回归和约束因子分析评估室内和室外空气污染的异质性。
Res Rep Health Eff Inst. 2010 Dec(152):5-80; discussion 81-91.
5
Comparison of Machine Learning and Land Use Regression for fine scale spatiotemporal estimation of ambient air pollution: Modeling ozone concentrations across the contiguous United States.机器学习和土地利用回归在精细时空估算环境空气污染中的比较:在美国大陆范围内模拟臭氧浓度。
Environ Int. 2020 Sep;142:105827. doi: 10.1016/j.envint.2020.105827. Epub 2020 Jun 25.
6
High Resolution Mapping of Soil Properties Using Remote Sensing Variables in South-Western Burkina Faso: A Comparison of Machine Learning and Multiple Linear Regression Models.利用遥感变量对布基纳法索西南部土壤特性进行高分辨率制图:机器学习与多元线性回归模型的比较
PLoS One. 2017 Jan 23;12(1):e0170478. doi: 10.1371/journal.pone.0170478. eCollection 2017.
7
GIS, geostatistics, metadata banking, and tree-based models for data analysis and mapping in environmental monitoring and epidemiology.地理信息系统、地统计学、元数据库以及用于环境监测和流行病学数据分析与绘图的树状模型。
Int J Med Microbiol. 2006 May;296 Suppl 40:23-36. doi: 10.1016/j.ijmm.2006.02.015. Epub 2006 Apr 4.
8
Geographically weighted regression and geostatistical techniques to construct the geogenic radon potential map of the Lazio region: A methodological proposal for the European Atlas of Natural Radiation.用于构建拉齐奥地区地源氡潜能图的地理加权回归和地质统计技术:欧洲自然辐射地图集的方法建议
J Environ Radioact. 2017 Jan;166(Pt 2):355-375. doi: 10.1016/j.jenvrad.2016.05.010. Epub 2016 May 27.
9
Mapping the geogenic radon potential for Germany by machine learning.基于机器学习的德国地球成因氡潜力图绘制。
Sci Total Environ. 2021 Feb 1;754:142291. doi: 10.1016/j.scitotenv.2020.142291. Epub 2020 Sep 14.
10
Modelling non-stationary spatial covariance structure from space-time monitoring data.从时空监测数据中建模非平稳空间协方差结构。
Ciba Found Symp. 1997;210:38-48; discussion 48-51, 68-78. doi: 10.1002/9780470515419.ch4.

引用本文的文献

1
Stochastic lithofacies and petrophysical property modeling for fast history matching in heterogeneous clastic reservoir applications.用于非均质碎屑岩储层快速历史拟合的随机岩相和岩石物理性质建模
Sci Rep. 2024 Jan 2;14(1):22. doi: 10.1038/s41598-023-50853-3.
2
Contextual spatial modelling in the horizontal and vertical domains.上下文空间建模在水平和垂直领域。
Sci Rep. 2022 Jun 9;12(1):9496. doi: 10.1038/s41598-022-13514-5.
3
Towards a Situated Spatial Epidemiology of Violence: A Placially-Informed Geospatial Analysis of Homicide in Alagoas, Brazil.
迈向暴力的情境空间流行病学:巴西阿拉戈斯州杀人案的地点信息启发的地理空间分析。
Int J Environ Res Public Health. 2020 Dec 11;17(24):9283. doi: 10.3390/ijerph17249283.