• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分类还是不分类:为何寄生虫丰度数据不应归为类别进行统计分析。

To bin or not to bin: why parasite abundance data should not be lumped into categories for statistical analysis.

作者信息

Poulin Robert

机构信息

Department of Zoology, University of Otago, Dunedin, New Zealand.

出版信息

Parasitology. 2025 Mar;152(3):338-345. doi: 10.1017/S003118202500040X.

DOI:10.1017/S003118202500040X
PMID:40123484
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12186089/
Abstract

The impact of macroparasites on their hosts is proportional to the number of parasites per host, or parasite abundance. Abundance values are count data, i.e. integers ranging from 0 to some maximum number, depending on the host-parasite system. When using parasite abundance as a predictor in statistical analysis, a common approach is to bin values, i.e. group hosts into infection categories based on abundance, and test for differences in some response variable (e.g. a host trait) among these categories. There are well-documented pitfalls associated with this approach. Here, I use a literature review to show that binning abundance values for analysis has been used in one-third of studies published in parasitological journals over the past 15 years, and half of the studies in ecological and behavioural journals, often without any justification. Binning abundance data into arbitrary categories has been much more common among studies using experimental infections than among those using naturally infected hosts. I then use simulated data to demonstrate that true and significant relationships between parasite abundance and host traits can be missed when abundance values are binned for analysis, and vice versa that when there is no underlying relationship between abundance and host traits, analysis of binned data can create a spurious one. This holds regardless of the prevalence of infection or the level of parasite aggregation in a host sample. These findings argue strongly for the practice of binning abundance data as a predictor variable to be abandoned in favour of more appropriate analytical approaches.

摘要

大型寄生虫对其宿主的影响与每个宿主的寄生虫数量成正比,即寄生虫丰度。丰度值是计数数据,也就是从0到某个最大值的整数,这取决于宿主 - 寄生虫系统。在统计分析中使用寄生虫丰度作为预测变量时,一种常见的方法是对数值进行分组,即根据丰度将宿主分为感染类别,并检验这些类别之间某些响应变量(例如宿主特征)的差异。这种方法存在一些有充分记录的缺陷。在这里,我通过文献综述表明,在过去15年发表在寄生虫学杂志上的研究中,有三分之一使用了对丰度值进行分组分析的方法,而在生态和行为学杂志上,这一比例为一半,而且往往没有任何理由。将丰度数据分为任意类别在使用实验感染的研究中比在使用自然感染宿主的研究中更为常见。然后我使用模拟数据证明,当对丰度值进行分组分析时,可能会错过寄生虫丰度与宿主特征之间真实且显著的关系,反之,当丰度与宿主特征之间不存在潜在关系时,对分组数据的分析可能会产生虚假关系。无论感染率或宿主样本中寄生虫聚集程度如何,都是如此。这些发现强烈主张放弃将丰度数据分组作为预测变量的做法,转而采用更合适的分析方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/12ab990189fe/S003118202500040X_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/7aa2c077359a/S003118202500040X_figAb1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/bdc8d05b80ce/S003118202500040X_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/711bd616e314/S003118202500040X_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/53d6003e8222/S003118202500040X_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/8668c37144ce/S003118202500040X_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/84fed95bb72b/S003118202500040X_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/12ab990189fe/S003118202500040X_fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/7aa2c077359a/S003118202500040X_figAb1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/bdc8d05b80ce/S003118202500040X_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/711bd616e314/S003118202500040X_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/53d6003e8222/S003118202500040X_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/8668c37144ce/S003118202500040X_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/84fed95bb72b/S003118202500040X_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5366/12186089/12ab990189fe/S003118202500040X_fig6.jpg

相似文献

1
To bin or not to bin: why parasite abundance data should not be lumped into categories for statistical analysis.分类还是不分类:为何寄生虫丰度数据不应归为类别进行统计分析。
Parasitology. 2025 Mar;152(3):338-345. doi: 10.1017/S003118202500040X.
2
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
3
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
4
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.
5
Comparison of the effectiveness of inhaler devices in asthma and chronic obstructive airways disease: a systematic review of the literature.吸入装置在哮喘和慢性阻塞性气道疾病中的有效性比较:文献系统评价
Health Technol Assess. 2001;5(26):1-149. doi: 10.3310/hta5260.
6
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
7
EORTC guidelines for the use of erythropoietic proteins in anaemic patients with cancer: 2006 update.欧洲癌症研究与治疗组织(EORTC)癌症贫血患者促红细胞生成蛋白使用指南:2006年更新版
Eur J Cancer. 2007 Jan;43(2):258-70. doi: 10.1016/j.ejca.2006.10.014. Epub 2006 Dec 19.
8
Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.
9
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
Overcoming the pitfalls of categorizing continuous variables in ecology, evolution and behaviour.克服生态学、进化生物学和行为学中连续变量分类的陷阱。
Proc Biol Sci. 2024 Oct;291(2032):20241640. doi: 10.1098/rspb.2024.1640. Epub 2024 Oct 2.
2
On the relationship between body condition and parasite infection in wildlife: a review and meta-analysis.论野生动物身体状况与寄生虫感染之间的关系:综述与荟萃分析。
Ecol Lett. 2018 Dec;21(12):1869-1884. doi: 10.1111/ele.13160. Epub 2018 Oct 4.
3
Best practice guidelines for studies of parasite community ecology.
寄生虫群落生态学研究的最佳实践指南。
J Helminthol. 2019 Jan;93(1):8-11. doi: 10.1017/S0022149X18000767. Epub 2018 Aug 24.
4
Host performance as a target of manipulation by parasites: a meta-analysis.宿主表现作为寄生虫操纵的目标:一项荟萃分析。
J Parasitol. 2014 Aug;100(4):399-410. doi: 10.1645/13-488.1. Epub 2014 Apr 25.
5
Explaining variability in parasite aggregation levels among host samples.解释宿主样本中寄生虫聚集水平的变异性。
Parasitology. 2013 Apr;140(4):541-6. doi: 10.1017/S0031182012002053. Epub 2013 Jan 24.
6
Review: analysis of parasite and other skewed counts.综述:寄生虫和其他偏态计数的分析。
Trop Med Int Health. 2012 Jun;17(6):684-93. doi: 10.1111/j.1365-3156.2012.02987.x.
7
Intensity-dependent host mortality: what can it tell us about larval growth strategies in complex life cycle helminths?强度相关的宿主死亡率:它能告诉我们关于复杂生命周期的寄生虫幼虫生长策略的什么信息?
Parasitology. 2011 Jun;138(7):913-25. doi: 10.1017/S0031182011000370. Epub 2011 Apr 18.
8
Analysis by categorizing or dichotomizing continuous variables is inadvisable: an example from the natural history of unruptured aneurysms.将连续变量分类或二分法分析是不可取的:未破裂动脉瘤自然史的一个例子。
AJNR Am J Neuroradiol. 2011 Mar;32(3):437-40. doi: 10.3174/ajnr.A2425. Epub 2011 Feb 17.
9
Dichotomizing continuous predictors in multiple regression: a bad idea.在多元回归中对连续预测变量进行二分法处理:一个糟糕的主意。
Stat Med. 2006 Jan 15;25(1):127-41. doi: 10.1002/sim.2331.
10
Dispersion and bias: can we trust geometric means?离散度与偏差:我们能信赖几何平均数吗?
Parasitol Today. 1994 Nov;10(11):446-8. doi: 10.1016/0169-4758(94)90181-3.