Suppr超能文献

使用手持式近红外分光光度计构建近红外(NIR)土壤光谱数据集和预测性机器学习模型。

Building a near-infrared (NIR) soil spectral dataset and predictive machine learning models using a handheld NIR spectrophotometer.

作者信息

Partida Colleen, Safanelli Jose Lucas, Mitu Sadia Mannan, Murad Mohammad Omar Faruk, Ge Yufeng, Ferguson Richard, Shepherd Keith, Sanderman Jonathan

机构信息

Woodwell Climate Research Center, 149 Woods Hole Rd., Falmouth, MA, 02540, United States.

Department of Biological Systems Engineering, University of Nebraska-Lincoln, E Campus Mall, Lincoln, NE, 68583, United States.

出版信息

Data Brief. 2024 Dec 16;58:111229. doi: 10.1016/j.dib.2024.111229. eCollection 2025 Feb.

Abstract

This near-infrared spectral dataset consists of 2,106 diverse mineral soil samples scanned, on average, on six different units of the same low-cost commercially available handheld spectrophotometer. Most soil samples were selected from the USDA NRCS National Soil Survey Center-Kellogg Soil Survey Laboratory (NSSC-KSSL) soil archives to represent the diversity of mineral soils (0-30 cm) found in the United States, while 90 samples were selected from Ghana, Kenya, and Nigeria to represent available African soils in the same archive. All scanning was performed on dried and sieved (<2 mm) soil samples. Machine learning predictive models were developed for soil organic carbon (SOC), pH, bulk density (BD), carbonate (CaCO3), exchangeable potassium (Ex. K), sand, silt, and clay content from their spectra in the R programming language using most of this dataset (1,976 US soils) and are included in this data release. Two model types, Cubist and partial least squares regression (PLSR) were developed using two strategies: (1) using an average of the spectral scans across devices for each sample and, (2) using the replicate spectral scans across devices for each sample. We present the internal performance of these models here. The dry spectra and Cubist models for these soil properties are available for download from 10.5281/zenodo.7586621. An example of detailed code used to produce these models is hosted at the Open Soil Spectral Library, a free service of the Soil Spectroscopy for the Global Good Network (soilspectroscopy.org), enabling broad use of these data for multiple soil monitoring applications.

摘要

这个近红外光谱数据集包含2106个不同的矿质土壤样本,平均在同一台低成本商用手持式分光光度计的六个不同单元上进行扫描。大多数土壤样本选自美国农业部自然资源保护局国家土壤调查中心 - 凯洛格土壤调查实验室(NSSC - KSSL)的土壤档案,以代表美国发现的矿质土壤(0 - 30厘米)的多样性,而90个样本选自加纳、肯尼亚和尼日利亚,以代表同一档案中可用的非洲土壤。所有扫描均在干燥并过筛(<2毫米)的土壤样本上进行。使用该数据集的大部分(1976个美国土壤样本),在R编程语言中根据土壤光谱开发了土壤有机碳(SOC)、pH值、容重(BD)、碳酸盐(CaCO3)、交换性钾(Ex. K)、砂、粉砂和粘土含量的机器学习预测模型,并包含在本次数据发布中。使用两种策略开发了两种模型类型,即Cubist模型和偏最小二乘回归(PLSR)模型:(1)对每个样本使用跨设备光谱扫描的平均值,以及(2)对每个样本使用跨设备的重复光谱扫描。我们在此展示这些模型的内部性能。这些土壤属性的干燥光谱和Cubist模型可从10.5281/zenodo.7586621下载。用于生成这些模型的详细代码示例托管在开放土壤光谱库中,这是土壤光谱造福全球网络(soilspectroscopy.org)的一项免费服务,可使这些数据广泛用于多种土壤监测应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0ac/11731769/c67e3d681e78/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验