• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

蜱虫种群建模:梯度提升树的一个生态测试案例

Modeling Tick Populations: An Ecological Test Case for Gradient Boosted Trees.

作者信息

Manley William, Tran Tam, Prusinski Melissa, Brisson Dustin

机构信息

University of Pennsylvania.

New York State Department of Health (NYSDOH).

出版信息

bioRxiv. 2023 Nov 29:2023.03.13.532443. doi: 10.1101/2023.03.13.532443.

DOI:10.1101/2023.03.13.532443
PMID:36993623
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10054924/
Abstract

General linear models have been the foundational statistical framework used to discover the ecological processes that explain the distribution and abundance of natural populations. Analyses of the rapidly expanding cache of environmental and ecological data, however, require advanced statistical methods to contend with complexities inherent to extremely large natural data sets. Modern machine learning frameworks such as gradient boosted trees efficiently identify complex ecological relationships in massive data sets, which are expected to result in accurate predictions of the distribution and abundance of organisms in nature. However, rigorous assessments of the theoretical advantages of these methodologies on natural data sets are rare. Here we compare the abilities of gradient boosted and linear models to identify environmental features that explain observed variations in the distribution and abundance of blacklegged tick () populations in a data set collected across New York State over a ten-year period. The gradient boosted and linear models use similar environmental features to explain tick demography, although the gradient boosted models found non-linear relationships and interactions that are difficult to anticipate and often impractical to identify with a linear modeling framework. Further, the gradient boosted models predicted the distribution and abundance of ticks in years and areas beyond the training data with much greater accuracy than their linear model counterparts. The flexible gradient boosting framework also permitted additional model types that provide practical advantages for tick surveillance and public health. The results highlight the potential of gradient boosted models to discover novel ecological phenomena affecting pathogen demography and as a powerful public health tool to mitigate disease risks.

摘要

一般线性模型一直是用于发现解释自然种群分布和丰度的生态过程的基础统计框架。然而,对迅速扩充的环境和生态数据集进行分析,需要先进的统计方法来应对超大型自然数据集固有的复杂性。诸如梯度提升树等现代机器学习框架能够在海量数据集中高效识别复杂的生态关系,有望准确预测自然界中生物的分布和丰度。然而,对这些方法在自然数据集上的理论优势进行严格评估的情况却很少见。在此,我们比较了梯度提升模型和线性模型识别环境特征的能力,这些环境特征可解释在纽约州十年间收集的数据集中黑腿蜱()种群分布和丰度的观测变化。梯度提升模型和线性模型使用相似的环境特征来解释蜱虫种群统计学特征,不过梯度提升模型发现了非线性关系和相互作用,这些关系难以预测,在线性建模框架下往往也难以识别。此外,梯度提升模型在预测训练数据之外的年份和区域的蜱虫分布和丰度时,比对应的线性模型精确得多。灵活的梯度提升框架还允许使用其他模型类型,这些模型类型在蜱虫监测和公共卫生方面具有实际优势。研究结果凸显了梯度提升模型在发现影响病原体种群统计学特征的新生态现象方面的潜力,以及作为减轻疾病风险的强大公共卫生工具的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/e6cce121c35d/nihpp-2023.03.13.532443v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/3f6cc916aba0/nihpp-2023.03.13.532443v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/1a9b98fe1204/nihpp-2023.03.13.532443v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/47d1bb124a0b/nihpp-2023.03.13.532443v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/e6cce121c35d/nihpp-2023.03.13.532443v4-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/3f6cc916aba0/nihpp-2023.03.13.532443v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/1a9b98fe1204/nihpp-2023.03.13.532443v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/47d1bb124a0b/nihpp-2023.03.13.532443v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c0a0/10695235/e6cce121c35d/nihpp-2023.03.13.532443v4-f0004.jpg

相似文献

1
Modeling Tick Populations: An Ecological Test Case for Gradient Boosted Trees.蜱虫种群建模:梯度提升树的一个生态测试案例
bioRxiv. 2023 Nov 29:2023.03.13.532443. doi: 10.1101/2023.03.13.532443.
2
Mechanistic movement models to predict geographic range expansions of ticks and tick-borne pathogens: Case studies with Ixodes scapularis and Amblyomma americanum in eastern North America.机制运动模型预测蜱虫和蜱传病原体的地理范围扩张:以东北美地区的肩突硬蜱和美洲钝眼蜱为例。
Ticks Tick Borne Dis. 2023 Jul;14(4):102161. doi: 10.1016/j.ttbdis.2023.102161. Epub 2023 Mar 28.
3
Monitoring the patterns of submission and presence of tick-borne pathogens in Ixodes scapularis collected from humans and companion animals in Ontario, Canada (2011-2017).监测加拿大安大略省从人类和宠物身上采集的印鼠客蚤携带的病原体的提交和存在模式(2011-2017 年)。
Parasit Vectors. 2021 May 17;14(1):260. doi: 10.1186/s13071-021-04750-1.
4
Microclimate conditions alter Ixodes scapularis (Acari: Ixodidae) overwinter survival across climate gradients in Maine, United States.在美国缅因州,小气候条件会改变肩突硬蜱(蜱螨目:硬蜱科)在不同气候梯度下的越冬存活率。
Ticks Tick Borne Dis. 2022 Jan;13(1):101872. doi: 10.1016/j.ttbdis.2021.101872. Epub 2021 Nov 19.
5
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
6
Multiomics Reveals Symbionts, Pathogens, and Tissue-Specific Microbiome of Blacklegged Ticks (Ixodes scapularis) from a Lyme Disease Hot Spot in Southeastern Ontario, Canada.多组学揭示了安大略省东南部莱姆病热点地区黑腿蜱(Ixodes scapularis)的共生体、病原体和组织特异性微生物组。
Microbiol Spectr. 2023 Jun 15;11(3):e0140423. doi: 10.1128/spectrum.01404-23. Epub 2023 May 15.
7
Adverse moisture events predict seasonal abundance of Lyme disease vector ticks (Ixodes scapularis).不利的湿度事件可预测莱姆病传播媒介 ticks(Ixodes scapularis)的季节性丰度。
Parasit Vectors. 2014 Apr 14;7:181. doi: 10.1186/1756-3305-7-181.
8
Monitoring Risk: Tick and Public Participatory Surveillance in the Canadian Maritimes, 2012-2020.监测风险:2012 - 2020年加拿大海洋省份的蜱虫与公众参与式监测
Pathogens. 2021 Oct 6;10(10):1284. doi: 10.3390/pathogens10101284.
9
A machine learning approach to small area estimation: predicting the health, housing and well-being of the population of Netherlands.一种用于小区域估计的机器学习方法:预测荷兰人口的健康、住房和福祉。
Int J Health Geogr. 2022 Jun 6;21(1):4. doi: 10.1186/s12942-022-00304-5.
10
A Generalized Additive Model Correlating Blacklegged Ticks With White-Tailed Deer Density, Temperature, and Humidity in Maine, USA, 1990-2013.美国缅因州 1990-2013 年黑腿蜱与白尾鹿密度、温度和湿度的广义加性模型相关性
J Med Entomol. 2021 Jan 12;58(1):125-138. doi: 10.1093/jme/tjaa180.

本文引用的文献

1
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
2
The Crimean-Congo haemorrhagic fever tick vector Hyalomma marginatum in the south of France: Modelling its distribution and determination of factors influencing its establishment in a newly invaded area.法国南部克里米亚-刚果出血热媒介褐黄血蜱的分布模型:影响其在新入侵地区建立的因素的确定。
Transbound Emerg Dis. 2022 Sep;69(5):e2351-e2365. doi: 10.1111/tbed.14578. Epub 2022 May 19.
3
Estimating disease vector population size from citizen science data.
从公民科学数据估算病媒种群规模。
J R Soc Interface. 2021 Nov;18(184):20210610. doi: 10.1098/rsif.2021.0610. Epub 2021 Nov 24.
4
Predicting the zoonotic capacity of mammals to transmit SARS-CoV-2.预测哺乳动物传播 SARS-CoV-2 的人畜共患能力。
Proc Biol Sci. 2021 Nov 24;288(1963):20211651. doi: 10.1098/rspb.2021.1651. Epub 2021 Nov 17.
5
The role of host phenology for parasite transmission.宿主物候学在寄生虫传播中的作用。
Theor Ecol. 2021;14(1):123-143. doi: 10.1007/s12080-020-00484-5. Epub 2020 Nov 11.
6
Predicting insect outbreaks using machine learning: A mountain pine beetle case study.利用机器学习预测昆虫爆发:以山地松甲虫为例
Ecol Evol. 2021 Sep 12;11(19):13014-13028. doi: 10.1002/ece3.7921. eCollection 2021 Oct.
7
Spatio-temporal variation in environmental features predicts the distribution and abundance of Ixodes scapularis.环境特征的时空变化可预测扁虱的分布和丰度。
Int J Parasitol. 2021 Mar;51(4):311-320. doi: 10.1016/j.ijpara.2020.10.002. Epub 2020 Dec 24.
8
Performance evaluation of cetacean species distribution models developed using generalized additive models and boosted regression trees.使用广义相加模型和提升回归树开发的鲸类物种分布模型的性能评估
Ecol Evol. 2020 May 11;10(12):5759-5784. doi: 10.1002/ece3.6316. eCollection 2020 Jun.
9
Harnessing Deep Learning in Ecology: An Example Predicting Bark Beetle Outbreaks.在生态学中应用深度学习:预测树皮甲虫爆发的一个实例
Front Plant Sci. 2019 Oct 28;10:1327. doi: 10.3389/fpls.2019.01327. eCollection 2019.
10
Malaria risk assessment and mapping using satellite imagery and boosted regression trees in the Peruvian Amazon.利用卫星图像和提升回归树进行秘鲁亚马逊地区疟疾风险评估和制图。
Sci Rep. 2019 Oct 23;9(1):15173. doi: 10.1038/s41598-019-51564-4.