Suppr超能文献

预测近期收获的番茄和番茄萼片对未来真菌感染的敏感性。

Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections.

机构信息

BioSense Institute, University of Novi Sad, 21000, Novi Sad, Serbia.

Wageningen University and Research, 6708 PB, Wageningen, The Netherlands.

出版信息

Sci Rep. 2021 Nov 30;11(1):23109. doi: 10.1038/s41598-021-02302-2.

Abstract

Tomato is an important commercial product which is perishable by nature and highly susceptible to fungal incidence once it is harvested. Not all tomatoes are equally vulnerable to pathogenic fungi, and an early detection of the vulnerable ones can help in taking timely preventive actions, ranging from isolating tomato batches to adjusting storage conditions, but also in making right business decisions like dynamic pricing based on quality or better shelf life estimate. More importantly, early detection of vulnerable produce can help in taking timely actions to minimize potential post-harvest losses. This paper investigates Near-infrared (NIR) hyperspectral imaging (1000-1700 nm) and machine learning to build models to automatically predict the susceptibility of sepals of recently harvested tomatoes to future fungal infections. Hyperspectral images of newly harvested tomatoes (cultivar Brioso) from 5 different growers were acquired before the onset of any visible fungal infection. After imaging, the tomatoes were placed under controlled conditions suited for fungal germination and growth for a 4-day period, and then imaged using normal color cameras. All sepals in the color images were ranked for fungal severity using crowdsourcing, and the final severity of each sepal was fused using principal component analysis. A novel hyperspectral data processing pipeline is presented which was used to automatically segment the tomato sepals from spectral images with multiple tomatoes connected via a truss. The key modelling question addressed in this research is whether there is a correlation between the hyperspectral data captured at harvest and the fungal infection observed 4 days later. Using 10-fold and group k-fold cross-validation, XG-Boost and Random Forest based regression models were trained on the features derived from the hyperspectral data corresponding to each sepal in the training set and tested on hold out test set. The best model found a Pearson correlation of 0.837, showing that there is strong linear correlation between the NIR spectra and the future fungal severity of the sepal. The sepal specific predictions were aggregated to predict the susceptibility of individual tomatoes, and a correlation of 0.92 was found. Besides modelling, focus is also on model interpretation, particularly to understand which spectral features are most relevant to model prediction. Two approaches to model interpretation were explored, feature importance and SHAP (SHapley Additive exPlanations), resulting in similar conclusions that the NIR range between 1390-1420 nm contributes most to the model's final decision.

摘要

番茄是一种重要的商业产品,具有易腐性,收获后极易受到真菌的影响。并非所有的番茄都容易受到病原菌的影响,早期发现易感染的番茄可以帮助及时采取预防措施,包括隔离番茄批次、调整储存条件,也可以根据质量或更好的保质期估计做出正确的商业决策,如动态定价。更重要的是,早期发现易腐产品可以帮助及时采取行动,将潜在的产后损失降到最低。本文研究了近红外(NIR)高光谱成像(1000-1700nm)和机器学习,以建立模型来自动预测最近收获的番茄萼片对未来真菌感染的易感性。从 5 个不同种植者处采集新收获的番茄(Brioso 品种)的高光谱图像,在出现任何可见真菌感染之前进行。成像后,将番茄置于适合真菌发芽和生长的控制条件下 4 天,然后使用普通彩色相机进行成像。使用众包对彩色图像中的所有萼片进行真菌严重程度排名,并使用主成分分析融合每个萼片的最终严重程度。提出了一种新的高光谱数据处理管道,用于自动从通过桁架连接的多个番茄的光谱图像中分割番茄萼片。本研究中解决的关键建模问题是,在收获时捕获的高光谱数据与 4 天后观察到的真菌感染之间是否存在相关性。使用 10 折和组 k 折交叉验证,在训练集中基于每个萼片中的高光谱数据特征训练 XG-Boost 和随机森林回归模型,并在测试集上进行测试。发现最佳模型的皮尔逊相关系数为 0.837,表明 NIR 光谱与萼片未来的真菌严重程度之间存在很强的线性相关性。将萼片特异性预测值聚合以预测单个番茄的易感性,发现相关性为 0.92。除了建模,还关注模型解释,特别是了解哪些光谱特征与模型预测最相关。探索了两种模型解释方法,特征重要性和 SHAP(SHapley Additive exPlanations),得出了相似的结论,即 NIR 范围在 1390-1420nm 之间对模型的最终决策贡献最大。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e0b7/8633320/2a9ae50e51f2/41598_2021_2302_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验