• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在记录无法关联时估计人口规模和重复率。

Estimating population size and duplication rates when records cannot be linked.

作者信息

Laska Eugene M, Meisner Morris, Wanderling Joseph, Siegel Carole

机构信息

Statistical Sciences and Epidemiology Division, The Nathan S. Kline Institute for Psychiatric Research, 140 Orangeburg Road, Orangeburg, NY, USA.

出版信息

Stat Med. 2003 Nov 15;22(21):3403-17. doi: 10.1002/sim.1640.

DOI:10.1002/sim.1640
PMID:14566923
Abstract

The capture-recapture approach to estimating the size of a population is a well-studied area of statistics. The number of distinct individuals, N(A) and N(B), on each of two lists, A and B, and the number common to both lists, N(AB), are used to form an estimate of the binomial probability of being on one of the lists, which then allows an estimate to be made of the size of the population. Critical to the method is an accurate count of N(AB). We consider situations in which this count is not available. Such problems arise in a variety of behavioural health contexts in which the need for protection of privacy may prevent sharing identifying information, so it is not possible to specifically match an individual who appears on one list with an individual on the other. Suppose that the birth dates and/or other demographics of individuals on each list are known. We introduce two methods for estimating the duplication rates and the size of the population. Conditioning on the set beta of birth dates of those on list B, N(A) and N(B), the maximum likelihood estimators (MLEs) and their variance are derived. The MLEs are based on the proportion of individuals on list A whose birth dates fall in beta. This approach is particularly useful if list B itself contains duplicates. The second model utilizes the full sample distribution of the birth dates. We generalize this approach to accommodate multiple demographic characteristics. The approaches are applied to the problem of estimating duplication rates and the population size of veterans who have mental illness in Kings County, NY. The data are lists of those receiving service from the Veterans Administration system and from providers funded or certified by the New York State Office of Mental Health.

摘要

用于估计种群规模的捕获再捕获方法是统计学中一个经过充分研究的领域。两个列表A和B上各自不同个体的数量N(A)和N(B),以及两个列表共有的个体数量N(AB),被用于形成对处于其中一个列表的二项概率的估计,进而可以对种群规模进行估计。该方法的关键在于对N(AB)的准确计数。我们考虑无法获得该计数的情况。此类问题出现在各种行为健康背景中,在这些背景下,出于隐私保护的需要可能会阻止共享识别信息,因此无法将出现在一个列表上的个体与另一个列表上的个体进行具体匹配。假设每个列表上个体的出生日期和/或其他人口统计学特征是已知的。我们介绍两种估计重复率和种群规模的方法。以列表B上个体的出生日期集合β为条件,推导了N(A)和N(B)的最大似然估计量(MLEs)及其方差。MLEs基于列表A上出生日期落在β中的个体比例。如果列表B本身包含重复项,这种方法特别有用。第二个模型利用出生日期的全样本分布。我们对这种方法进行推广以适应多种人口统计学特征。这些方法被应用于估计纽约州金斯县患有精神疾病的退伍军人的重复率和种群规模问题。数据是从退伍军人管理系统以及从由纽约州心理健康办公室资助或认证的提供者处接受服务的人员列表。

相似文献

1
Estimating population size and duplication rates when records cannot be linked.在记录无法关联时估计人口规模和重复率。
Stat Med. 2003 Nov 15;22(21):3403-17. doi: 10.1002/sim.1640.
2
Estimating population size when duplicates are present.
Stat Med. 1996 Aug 15;15(15):1635-46. doi: 10.1002/(SICI)1097-0258(19960815)15:15<1635::AID-SIM337>3.0.CO;2-T.
3
Problems in using birth certificate files in the capture-recapture model to estimate the completeness of case ascertainment in a population-based birth defects registry in New York State.在纽约州基于人群的出生缺陷登记处中,使用出生证明文件进行捕获-再捕获模型以估计病例确诊完整性时存在的问题。
Birth Defects Res A Clin Mol Teratol. 2006 Nov;76(11):772-7. doi: 10.1002/bdra.20293.
4
Population size estimation in a two-list surveillance system with a discrete covariate.具有离散协变量的双列表监测系统中的种群规模估计
Biometrics. 2008 Jun;64(2):371-6. doi: 10.1111/j.1541-0420.2007.00901.x. Epub 2008 Mar 5.
5
Model-based multiplicity estimation of population size.基于模型的种群大小多重性估计
Stat Med. 2009 Jul 30;28(17):2230-52. doi: 10.1002/sim.3614.
6
Spatially explicit maximum likelihood methods for capture-recapture studies.用于捕获-再捕获研究的空间明确最大似然法。
Biometrics. 2008 Jun;64(2):377-85. doi: 10.1111/j.1541-0420.2007.00927.x. Epub 2007 Oct 26.
7
Multilist population estimation with incomplete and partial stratification.具有不完全和部分分层的多重列表总体估计。
Biometrics. 2007 Sep;63(3):910-6. doi: 10.1111/j.1541-0420.2007.00767.x.
8
A unified likelihood-based approach for estimating population size in continuous-time capture-recapture experiments with frailty.一种基于统一似然法的方法,用于在具有脆弱性的连续时间捕获-再捕获实验中估计种群大小。
Biometrics. 2007 Mar;63(1):228-36. doi: 10.1111/j.1541-0420.2006.00623.x.
9
Multi-list methods using incomplete lists in closed populations.在封闭人群中使用不完整列表的多列表方法。
Biometrics. 2005 Mar;61(1):134-40. doi: 10.1111/j.0006-341X.2005.021126.x.
10
A multilevel model for continuous time population estimation.一种用于连续时间种群估计的多层次模型。
Biometrics. 2009 Sep;65(3):841-9. doi: 10.1111/j.1541-0420.2008.01129.x. Epub 2009 Jan 23.

引用本文的文献

1
Estimating capacity requirements for mental health services after a disaster has occurred: a call for new data.灾难发生后心理健康服务能力需求的评估:呼吁新数据。
Am J Public Health. 2004 Apr;94(4):582-5. doi: 10.2105/ajph.94.4.582.