• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用机器学习预测昆虫爆发:以山地松甲虫为例

Predicting insect outbreaks using machine learning: A mountain pine beetle case study.

作者信息

Ramazi Pouria, Kunegel-Lion Mélodie, Greiner Russell, Lewis Mark A

机构信息

Department of Mathematical and Statistical Sciences University of Alberta Edmonton AB Canada.

Department of Computing Science University of Alberta Edmonton AB Canada.

出版信息

Ecol Evol. 2021 Sep 12;11(19):13014-13028. doi: 10.1002/ece3.7921. eCollection 2021 Oct.

DOI:10.1002/ece3.7921
PMID:34646449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8495826/
Abstract

Planning forest management relies on predicting insect outbreaks such as mountain pine beetle, particularly in the intermediate-term future, e.g., 5-year. Machine-learning algorithms are potential solutions to this challenging problem due to their many successes across a variety of prediction tasks. However, there are many subtle challenges in applying them: identifying the best learning models and the best subset of available covariates (including time lags) and properly evaluating the models to avoid misleading performance-measures. We systematically address these issues in predicting the chance of a mountain pine beetle outbreak in the Cypress Hills area and seek models with the best performance at predicting future 1-, 3-, 5- and 7-year infestations. We train nine machine-learning models, including two generalized boosted regression trees (GBM) that predict future 1- and 3-year infestations with 92% and 88% AUC, and two novel mixed models that predict future 5- and 7-year infestations with 86% and 84% AUC, respectively. We also consider forming the train and test datasets by splitting the original dataset rather than using the appropriate year-based approach and show that this may obtain models that score high on the test dataset but low in practice, resulting in inaccurate performance evaluations. For example, a -nearest neighbor model with the actual performance of 68% AUC, scores the misleadingly high 78% on a test dataset obtained from a random split, but the more accurate 66% on a year-based split. We then investigate how the prediction accuracy varies with respect to the provided history length of the covariates and find that neural network and naive Bayes, predict more accurately as history-length increases, particularly for future 1- and 3-year predictions, and roughly the same holds with GBM. Our approach is applicable to other invasive species. The resulting predictors can be used in planning forest and pest management and planning sampling locations in field studies.

摘要

规划森林管理依赖于预测诸如山地松甲虫等昆虫的爆发,尤其是在中期未来,例如5年。机器学习算法因其在各种预测任务中取得的诸多成功而成为解决这一具有挑战性问题的潜在方案。然而,应用它们存在许多细微的挑战:识别最佳学习模型和可用协变量(包括时间滞后)的最佳子集,并正确评估模型以避免误导性的性能度量。我们系统地解决了在预测柏树山地区山地松甲虫爆发可能性时的这些问题,并寻求在预测未来1年、3年、5年和7年虫害方面具有最佳性能的模型。我们训练了九个机器学习模型,包括两个广义增强回归树(GBM),它们预测未来1年和3年虫害的曲线下面积(AUC)分别为92%和88%,以及两个新颖的混合模型,它们预测未来5年和7年虫害的AUC分别为86%和84%。我们还考虑通过分割原始数据集而不是使用适当的基于年份的方法来形成训练和测试数据集,并表明这可能会得到在测试数据集上得分高但在实际中得分低的模型,从而导致不准确的性能评估。例如,一个实际AUC为68%的最近邻模型,在从随机分割获得的测试数据集上得分高达78%,但在基于年份的分割上得分更准确的66%。然后,我们研究预测准确性如何随协变量的提供历史长度而变化,发现神经网络和朴素贝叶斯随着历史长度的增加预测更准确,特别是对于未来1年和3年的预测,GBM大致也是如此。我们的方法适用于其他入侵物种。所得的预测器可用于规划森林和害虫管理以及实地研究中的采样地点规划。

相似文献

1
Predicting insect outbreaks using machine learning: A mountain pine beetle case study.利用机器学习预测昆虫爆发:以山地松甲虫为例
Ecol Evol. 2021 Sep 12;11(19):13014-13028. doi: 10.1002/ece3.7921. eCollection 2021 Oct.
2
Dataset of mountain pine beetle outbreak dynamics and direct control in Cypress Hills, SK.加拿大萨斯喀彻温省柏树山地区山松甲虫爆发动态及直接防治数据集。
Data Brief. 2020 Feb 17;29:105293. doi: 10.1016/j.dib.2020.105293. eCollection 2020 Apr.
3
Larger Resin Ducts Are Linked to the Survival of Lodgepole Pine Trees During Mountain Pine Beetle Outbreak.更大的树脂道与山松甲虫爆发期间黑松的存活有关。
Front Plant Sci. 2019 Nov 26;10:1459. doi: 10.3389/fpls.2019.01459. eCollection 2019.
4
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
5
Recent mountain pine beetle outbreaks, wildfire severity, and postfire tree regeneration in the US Northern Rockies.美国北落基山脉近期的山松甲虫爆发、野火严重程度及火灾后树木再生情况。
Proc Natl Acad Sci U S A. 2014 Oct 21;111(42):15120-5. doi: 10.1073/pnas.1411346111. Epub 2014 Sep 29.
6
Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用:以新生儿呼吸暂停预测为例的研究
Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.
7
Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation.利用大数据和机器学习方法从电子健康记录中准确预测高血压患者的冠心病:模型开发与性能评估
JMIR Med Inform. 2020 Jul 6;8(7):e17257. doi: 10.2196/17257.
8
Machine learning algorithms, bull genetic information, and imbalanced datasets used in abortion incidence prediction models for Iranian Holstein dairy cattle.机器学习算法、公牛遗传信息和不平衡数据集用于伊朗荷斯坦奶牛流产发生率预测模型。
Prev Vet Med. 2020 Feb;175:104869. doi: 10.1016/j.prevetmed.2019.104869. Epub 2019 Dec 17.
9
Climate and weather influences on spatial temporal patterns of mountain pine beetle populations in Washington and Oregon.气候和天气对华盛顿州和俄勒冈州山松甲虫种群时空格局的影响。
Ecology. 2012 Nov;93(11):2421-34. doi: 10.1890/11-1412.1.
10
Determination of the compositional change (1999-2006) in the pine forests of British Columbia due to mountain pine beetle infestation.测定不列颠哥伦比亚省因山松甲虫虫害而导致的松林组成变化(1999-2006 年)。
Environ Monit Assess. 2009 Nov;158(1-4):593-608. doi: 10.1007/s10661-008-0607-9. Epub 2008 Nov 20.

引用本文的文献

1
Machine Learning Using Template-Based-Predicted Structure of Haemagglutinin Predicts Pathogenicity of Avian Influenza.基于模板预测的血凝素结构的机器学习预测禽流感的致病性。
J Microbiol Biotechnol. 2024 Oct 28;34(10):2033-2040. doi: 10.4014/jmb.2405.05022. Epub 2024 Aug 6.
2
TinyML-Sensor for Shelf Life Estimation of Fresh Date Fruits.基于 TinyML 的鲜枣货架期预测传感器。
Sensors (Basel). 2023 Aug 10;23(16):7081. doi: 10.3390/s23167081.
3
Modeling Tick Populations: An Ecological Test Case for Gradient Boosted Trees.蜱虫种群建模:梯度提升树的一个生态测试案例

本文引用的文献

1
Accurate long-range forecasting of COVID-19 mortality in the USA.美国 COVID-19 死亡率的精确长期预测。
Sci Rep. 2021 Jul 5;11(1):13822. doi: 10.1038/s41598-021-91365-2.
2
Factors governing outbreak dynamics in a forest intensively managed for mountain pine beetle.影响山松甲虫密集管理森林中疫情动态的因素。
Sci Rep. 2020 May 5;10(1):7601. doi: 10.1038/s41598-020-63388-8.
3
Dataset of mountain pine beetle outbreak dynamics and direct control in Cypress Hills, SK.加拿大萨斯喀彻温省柏树山地区山松甲虫爆发动态及直接防治数据集。
bioRxiv. 2023 Nov 29:2023.03.13.532443. doi: 10.1101/2023.03.13.532443.
4
From Policy to Prediction: Forecasting COVID-19 Dynamics Under Imperfect Vaccination.从政策到预测:在不完全疫苗接种下预测 COVID-19 动态。
Bull Math Biol. 2022 Jul 20;84(9):90. doi: 10.1007/s11538-022-01047-x.
5
A Hypothesis-Free Bridging of Disease Dynamics and Non-pharmaceutical Policies.无假设的疾病动态与非药物干预策略桥接
Bull Math Biol. 2022 Apr 8;84(5):57. doi: 10.1007/s11538-022-01012-8.
Data Brief. 2020 Feb 17;29:105293. doi: 10.1016/j.dib.2020.105293. eCollection 2020 Apr.
4
Application of feature selection and regression models for chlorophyll-a prediction in a shallow lake.特征选择和回归模型在浅水湖泊叶绿素-a 预测中的应用。
Environ Sci Pollut Res Int. 2018 Jul;25(20):19488-19498. doi: 10.1007/s11356-018-2147-3. Epub 2018 May 5.
5
Statistical and Machine Learning forecasting methods: Concerns and ways forward.统计和机器学习预测方法:关注问题与未来发展方向。
PLoS One. 2018 Mar 27;13(3):e0194889. doi: 10.1371/journal.pone.0194889. eCollection 2018.
6
Weathering the storm: how lodgepole pine trees survive mountain pine beetle outbreaks.经受住风暴:扭叶松如何在山松甲虫爆发中存活下来。
Oecologia. 2017 Jun;184(2):469-478. doi: 10.1007/s00442-017-3865-9. Epub 2017 Apr 18.
7
Networks of conforming or nonconforming individuals tend to reach satisfactory decisions.由合格或不合格个体组成的网络往往能达成令人满意的决策。
Proc Natl Acad Sci U S A. 2016 Nov 15;113(46):12985-12990. doi: 10.1073/pnas.1610244113. Epub 2016 Oct 31.
8
mRMRe: an R package for parallelized mRMR ensemble feature selection.mRMRe:一个用于并行化 mRMR 集成特征选择的 R 包。
Bioinformatics. 2013 Sep 15;29(18):2365-8. doi: 10.1093/bioinformatics/btt383. Epub 2013 Jul 3.
9
Climate and weather influences on spatial temporal patterns of mountain pine beetle populations in Washington and Oregon.气候和天气对华盛顿州和俄勒冈州山松甲虫种群时空格局的影响。
Ecology. 2012 Nov;93(11):2421-34. doi: 10.1890/11-1412.1.
10
Temporal autocorrelation functions for movement rates from global positioning system radiotelemetry data.基于全球定位系统无线电遥测数据的运动速率的时自相关函数。
Philos Trans R Soc Lond B Biol Sci. 2010 Jul 27;365(1550):2213-9. doi: 10.1098/rstb.2010.0080.