• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

标记仅存在数据中的未检测采样偏差。

Nondetection sampling bias in marked presence-only data.

机构信息

Department of Statistics and School of Natural Resources, University of Nebraska-Lincoln 234 Hardin Hall, 3310 Holdrege Street, Lincoln, Nebraska, 68583.

School of Natural Resources, University of Nebraska-Lincoln 416 Hardin Hall, 3310 Holdrege Street, Lincoln, Nebraska, 68583.

出版信息

Ecol Evol. 2013 Dec;3(16):5225-36. doi: 10.1002/ece3.887. Epub 2013 Dec 2.

DOI:10.1002/ece3.887
PMID:24455151
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3892331/
Abstract

Species distribution models (SDM) are tools used to determine environmental features that influence the geographic distribution of species' abundance and have been used to analyze presence-only records. Analysis of presence-only records may require correction for nondetection sampling bias to yield reliable conclusions. In addition, individuals of some species of animals may be highly aggregated and standard SDMs ignore environmental features that may influence aggregation behavior.We contend that nondetection sampling bias can be treated as missing data. Statistical theory and corrective methods are well developed for missing data, but have been ignored in the literature on SDMs. We developed a marked inhomogeneous Poisson point process model that accounted for nondetection and aggregation behavior in animals and tested our methods on simulated data.Correcting for nondetection sampling bias requires estimates of the probability of detection which must be obtained from auxiliary data, as presence-only data do not contain information about the detection mechanism. Weighted likelihood methods can be used to correct for nondetection if estimates of the probability of detection are available. We used an inhomogeneous Poisson point process model to model group abundance, a zero-truncated generalized linear model to model group size, and combined these two models to describe the distribution of abundance. Our methods performed well on simulated data when nondetection was accounted for and poorly when detection was ignored.We recommend researchers consider the effects of nondetection sampling bias when modeling species distributions using presence-only data. If information about the detection process is available, we recommend researchers explore the effects of nondetection and, when warranted, correct the bias using our methods. We developed our methods to analyze opportunistic presence-only records of whooping cranes (Grus americana), but expect that our methods will be useful to ecologists analyzing opportunistic presence-only records of other species of animals.

摘要

物种分布模型(SDM)是用于确定影响物种丰度地理分布的环境特征的工具,已被用于分析仅有存在记录的数据。分析仅有存在记录的数据可能需要纠正未检测采样偏差,以得出可靠的结论。此外,某些动物物种的个体可能高度聚集,而标准的 SDM 忽略了可能影响聚集行为的环境特征。我们认为未检测采样偏差可以视为缺失数据。缺失数据的统计理论和校正方法已经得到了很好的发展,但在 SDM 文献中被忽视了。我们开发了一种标记非均匀泊松点过程模型,该模型考虑了动物的未检测和聚集行为,并在模拟数据上测试了我们的方法。纠正未检测采样偏差需要从辅助数据中获得检测概率的估计值,因为仅有存在数据不包含关于检测机制的信息。如果可以获得检测概率的估计值,则可以使用加权似然方法进行校正。我们使用非均匀泊松点过程模型来建模群体丰度,使用零截断广义线性模型来建模群体大小,并将这两个模型结合起来描述丰度的分布。当考虑到未检测时,我们的方法在模拟数据上表现良好,而当忽略检测时则表现不佳。我们建议研究人员在使用仅有存在数据建模物种分布时考虑未检测采样偏差的影响。如果有关于检测过程的信息,我们建议研究人员探索未检测的影响,并在有必要时使用我们的方法纠正偏差。我们开发了我们的方法来分析美洲鹤(Grus americana)的机会性仅有存在记录,但期望我们的方法将对分析其他动物物种的机会性仅有存在记录的生态学家有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/ae44e01c66f4/ece30003-5225-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/0ce4aa23c841/ece30003-5225-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/6df4d746803b/ece30003-5225-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/389eb1f759c6/ece30003-5225-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/ae44e01c66f4/ece30003-5225-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/0ce4aa23c841/ece30003-5225-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/6df4d746803b/ece30003-5225-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/389eb1f759c6/ece30003-5225-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dce/3892331/ae44e01c66f4/ece30003-5225-f4.jpg

相似文献

1
Nondetection sampling bias in marked presence-only data.标记仅存在数据中的未检测采样偏差。
Ecol Evol. 2013 Dec;3(16):5225-36. doi: 10.1002/ece3.887. Epub 2013 Dec 2.
2
Use of opportunistic sightings and expert knowledge to predict and compare Whooping Crane stopover habitat.利用机会性观察和专业知识预测和比较美洲鹤中途停歇地的生境。
Conserv Biol. 2015 Oct;29(5):1337-46. doi: 10.1111/cobi.12515. Epub 2015 Apr 29.
3
Integrating count and detection-nondetection data to model population dynamics.整合计数和检测-未检测数据来建模种群动态。
Ecology. 2017 Jun;98(6):1640-1650. doi: 10.1002/ecy.1831. Epub 2017 May 11.
4
Generalized model-based solutions to false-positive error in species detection/nondetection data.广义基于模型的方法解决物种检测/未检测数据中的假阳性错误。
Ecology. 2021 Feb;102(2):e03241. doi: 10.1002/ecy.3241. Epub 2021 Jan 18.
5
The effects of sampling bias and model complexity on the predictive performance of MaxEnt species distribution models.采样偏差和模型复杂度对最大熵物种分布模型预测性能的影响。
PLoS One. 2013;8(2):e55158. doi: 10.1371/journal.pone.0055158. Epub 2013 Feb 14.
6
Haemosporida prevalence and diversity are similar in endangered wild whooping cranes (Grus americana) and sympatric sandhill cranes (Grus canadensis).在濒危的野生美洲鹤(Grus americana)和同域分布的沙丘鹤(Grus canadensis)中,血孢子虫的流行率和多样性相似。
Parasitology. 2017 Apr;144(5):629-640. doi: 10.1017/S0031182016002298. Epub 2016 Dec 12.
7
Accounting for preferential sampling in species distribution models.物种分布模型中优先抽样的核算
Ecol Evol. 2018 Dec 26;9(1):653-663. doi: 10.1002/ece3.4789. eCollection 2019 Jan.
8
Migrating Whooping Cranes avoid wind-energy infrastructure when selecting stopover habitat.迁徙的美洲鹤在选择中途停留的栖息地时会避开风能基础设施。
Ecol Appl. 2021 Jul;31(5):e02324. doi: 10.1002/eap.2324. Epub 2021 Apr 7.
9
Distribution models calibrated with independent field data predict two million ancient and veteran trees in England.利用独立的野外数据进行校准的分布模型预测英格兰有两百万棵古树和老龄树。
Ecol Appl. 2022 Dec;32(8):e2695. doi: 10.1002/eap.2695. Epub 2022 Aug 9.
10
Mapping species distributions with MAXENT using a geographically biased sample of presence data: a performance assessment of methods for correcting sampling bias.使用存在数据的地理偏差样本,通过最大熵模型(MAXENT)绘制物种分布:校正采样偏差方法的性能评估
PLoS One. 2014 May 12;9(5):e97122. doi: 10.1371/journal.pone.0097122. eCollection 2014.

引用本文的文献

1
Occurrence Data Sources Matter for Species Distribution Modeling: A Case Study of Based on Biomod2.物种分布建模中出现数据来源很重要:基于Biomod2的案例研究
Ecol Evol. 2025 May 8;15(5):e71390. doi: 10.1002/ece3.71390. eCollection 2025 May.
2
Treating gaps and biases in biodiversity data as a missing data problem.将生物多样性数据中的差距和偏差视为一个缺失数据问题。
Biol Rev Camb Philos Soc. 2025 Feb;100(1):50-67. doi: 10.1111/brv.13127. Epub 2024 Aug 8.
3
How to make use of unlabeled observations in species distribution modeling using point process models.

本文引用的文献

1
Finite-Sample Equivalence in Statistical Models for Presence-Only Data.仅存在数据统计模型中的有限样本等价性。
Ann Appl Stat. 2013 Dec 1;7(4):1917-1939. doi: 10.1214/13-AOAS667.
2
Inference from presence-only data; the ongoing controversy.仅存在数据的推断:持续的争议。
Ecography. 2013 Aug 1;36(8):864-867. doi: 10.1111/j.1600-0587.2013.00321.x.
3
Advancing our thinking in presence-only and used-available analysis.推进仅存在数据和已用数据分析方法的应用。
如何在使用点过程模型进行物种分布建模时利用未标记的观测数据。
Ecol Evol. 2021 Apr 1;11(10):5220-5243. doi: 10.1002/ece3.7411. eCollection 2021 May.
4
The value of long-term citizen science data for monitoring koala populations.长期公民科学数据在监测考拉种群中的价值。
Sci Rep. 2019 Jul 11;9(1):10037. doi: 10.1038/s41598-019-46376-5.
5
Delineating and identifying long-term changes in the whooping crane (Grus americana) migration corridor.描绘和识别美洲鹤(Grus americana)迁徙路线的长期变化。
PLoS One. 2018 Feb 15;13(2):e0192737. doi: 10.1371/journal.pone.0192737. eCollection 2018.
6
Bias correction in species distribution models: pooling survey and collection data for multiple species.物种分布模型中的偏差校正:整合多个物种的调查和收集数据
Methods Ecol Evol. 2015 Apr;6(4):424-438. doi: 10.1111/2041-210X.12242. Epub 2014 Oct 10.
J Anim Ecol. 2013 Nov;82(6):1125-34. doi: 10.1111/1365-2656.12071. Epub 2013 Mar 14.
4
Equivalence of MAXENT and Poisson point process models for species distribution modeling in ecology.用于生态学中物种分布建模的MAXENT模型与泊松点过程模型的等效性
Biometrics. 2013 Mar;69(1):274-81. doi: 10.1111/j.1541-0420.2012.01824.x. Epub 2013 Feb 4.
5
Fitting and interpreting occupancy models.拟合和解释占有率模型。
PLoS One. 2013;8(1):e52015. doi: 10.1371/journal.pone.0052015. Epub 2013 Jan 10.
6
Predicting the geographic distribution of a species from presence-only data subject to detection errors.从存在性数据预测物种的地理分布,存在检测误差。
Biometrics. 2012 Dec;68(4):1303-12. doi: 10.1111/j.1541-0420.2012.01779.x. Epub 2012 Aug 31.
7
Eliciting expert knowledge in conservation science. eliciting expert knowledge in conservation science
Conserv Biol. 2012 Feb;26(1):29-38. doi: 10.1111/j.1523-1739.2011.01806.x.
8
How much can we learn about missing data?: an exploration of a clinical trial in psychiatry.我们能从缺失数据中学到多少?:一项精神病学临床试验的探索
J R Stat Soc Ser A Stat Soc. 2010 Jul;173(3):593-612. doi: 10.1111/j.1467-985X.2009.00627.x.
9
Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data.样本选择偏差与仅存在分布模型:对背景数据和伪缺失数据的影响
Ecol Appl. 2009 Jan;19(1):181-97. doi: 10.1890/07-2153.1.
10
A working guide to boosted regression trees.提升回归树实用指南。
J Anim Ecol. 2008 Jul;77(4):802-13. doi: 10.1111/j.1365-2656.2008.01390.x. Epub 2008 Apr 8.