• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分层抽样设计和生存模型中的随访丢失:效率和偏差评估。

Stratified sampling design and loss to follow-up in survival models: evaluation of efficiency and bias.

机构信息

Department of Statistics, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.

出版信息

BMC Med Res Methodol. 2011 Jun 26;11:99. doi: 10.1186/1471-2288-11-99.

DOI:10.1186/1471-2288-11-99
PMID:21703013
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3144452/
Abstract

BACKGROUND

Longitudinal studies often employ complex sample designs to optimize sample size, over-representing population groups of interest. The effect of sample design on parameter estimates is quite often ignored, particularly when fitting survival models. Another major problem in long-term cohort studies is the potential bias due to loss to follow-up.

METHODS

In this paper we simulated a dataset with approximately 50,000 individuals as the target population and 15,000 participants to be followed up for 40 years, both based on real cohort studies of cardiovascular diseases. Two sample strategies--simple random (our golden standard) and Stratified by professional group, with non-proportional allocation--and two loss to follow-up scenarios--non-informative censoring and losses related to the professional group--were analyzed.

RESULTS

Two modeling approaches were evaluated: weighted and non-weighted fit. Our results indicate that under the correctly specified model, ignoring the sample weights does not affect the results. However, the model ignoring the interaction of sample strata with the variable of interest and the crude estimates were highly biased.

CONCLUSIONS

In epidemiological studies misspecification should always be considered, as different sources of variability, related to the individuals and not captured by the covariates, are always present. Therefore, allowance must be made for the possibility of unknown confounders and interactions with the main variable of interest in our data. It is strongly recommended always to correct by sample weights.

摘要

背景

纵向研究通常采用复杂的样本设计来优化样本量,从而过度代表感兴趣的人群。样本设计对参数估计的影响往往被忽视,尤其是在拟合生存模型时。长期队列研究中的另一个主要问题是由于随访丢失而导致的潜在偏差。

方法

在本文中,我们模拟了一个数据集,目标人群约为 50000 人,15000 人将在 40 年内进行随访,这两个数据均基于心血管疾病的真实队列研究。分析了两种样本策略:简单随机(我们的黄金标准)和按专业群体分层,非比例分配,以及两种随访丢失情况:无信息删失和与专业群体相关的丢失。

结果

评估了两种建模方法:加权和非加权拟合。我们的结果表明,在正确指定的模型下,忽略样本权重不会影响结果。但是,忽略样本分层与感兴趣变量的交互以及未加权的估计值存在高度偏差。

结论

在流行病学研究中,应始终考虑模型的不正确指定,因为总是存在与个体相关且未被协变量捕获的不同来源的变异性。因此,必须考虑到我们数据中未知混杂因素和与主要感兴趣变量的交互的可能性。强烈建议始终通过样本权重进行校正。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/0dd104911a28/1471-2288-11-99-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/12689d595fc6/1471-2288-11-99-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/fafebaef4ac6/1471-2288-11-99-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/fea6e6bbb414/1471-2288-11-99-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/e7015d74c063/1471-2288-11-99-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/b930263a7ad4/1471-2288-11-99-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/b55507fb14fc/1471-2288-11-99-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/b9c9a863fe30/1471-2288-11-99-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/49a248141d75/1471-2288-11-99-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/0dd104911a28/1471-2288-11-99-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/12689d595fc6/1471-2288-11-99-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/fafebaef4ac6/1471-2288-11-99-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/fea6e6bbb414/1471-2288-11-99-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/e7015d74c063/1471-2288-11-99-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/b930263a7ad4/1471-2288-11-99-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/b55507fb14fc/1471-2288-11-99-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/b9c9a863fe30/1471-2288-11-99-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/49a248141d75/1471-2288-11-99-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d50/3144452/0dd104911a28/1471-2288-11-99-9.jpg

相似文献

1
Stratified sampling design and loss to follow-up in survival models: evaluation of efficiency and bias.分层抽样设计和生存模型中的随访丢失:效率和偏差评估。
BMC Med Res Methodol. 2011 Jun 26;11:99. doi: 10.1186/1471-2288-11-99.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Correcting for dependent censoring in routine outcome monitoring data by applying the inverse probability censoring weighted estimator.通过应用逆概率删失加权估计量对常规结局监测数据中的依存删失进行校正。
Stat Methods Med Res. 2018 Feb;27(2):323-335. doi: 10.1177/0962280216628900. Epub 2016 Mar 17.
4
Calibration adjustments to address bias in mortality analyses due to informative sampling-a census-linked survey analysis in Switzerland.校准调整以解决因信息性抽样导致的死亡率分析偏差——瑞士一项与人口普查相关的调查分析
PeerJ. 2018 Feb 13;6:e4376. doi: 10.7717/peerj.4376. eCollection 2018.
5
Mortality and Morbidity Effects of Long-Term Exposure to Low-Level PM, BC, NO, and O: An Analysis of European Cohorts in the ELAPSE Project.长期暴露于低水平 PM、BC、NO 和 O 对死亡率和发病率的影响:ELAPSE 项目中欧洲队列的分析。
Res Rep Health Eff Inst. 2021 Sep;2021(208):1-127.
6
Comparison of methods for estimating the attributable risk in the context of survival analysis.生存分析背景下可归因风险估计方法的比较
BMC Med Res Methodol. 2017 Jan 23;17(1):10. doi: 10.1186/s12874-016-0285-1.
7
Effects of long-term exposure to traffic-related air pollution on respiratory and cardiovascular mortality in the Netherlands: the NLCS-AIR study.长期暴露于交通相关空气污染对荷兰呼吸道和心血管疾病死亡率的影响:荷兰长期队列空气污染研究(NLCS-AIR研究)
Res Rep Health Eff Inst. 2009 Mar(139):5-71; discussion 73-89.
8
Sample size determination for comparing several survival curves with unequal allocations.用于比较几个不等分配的生存曲线的样本量确定。
Stat Med. 2004 Jun 15;23(11):1793-815. doi: 10.1002/sim.1771.
9
A robust weighted Kaplan-Meier approach for data with dependent censoring using linear combinations of prognostic covariates.一种稳健的加权 Kaplan-Meier 方法,用于使用预后协变量的线性组合处理具有相依删失数据。
Stat Med. 2010 Sep 20;29(21):2215-23. doi: 10.1002/sim.3969.
10
Two-stage estimation to adjust for treatment switching in randomised trials: a simulation study investigating the use of inverse probability weighting instead of re-censoring.两阶段估计调整随机试验中的治疗转换:一项使用逆概率加权而不是重新分类来调整的模拟研究。
BMC Med Res Methodol. 2019 Mar 29;19(1):69. doi: 10.1186/s12874-019-0709-9.

引用本文的文献

1
Simplifying Data Analysis in Biomedical Research: An Automated, User-Friendly Tool.简化生物医学研究中的数据分析:一种自动化、用户友好的工具。
Methods Protoc. 2024 Apr 24;7(3):36. doi: 10.3390/mps7030036.

本文引用的文献

1
Establishing a follow-up of the Swiss MONICA participants (1984-1993): record linkage with census and mortality data.建立瑞士 MONICA 参与者(1984-1993 年)的随访:与人口普查和死亡率数据的记录链接。
BMC Public Health. 2010 Sep 21;10:562. doi: 10.1186/1471-2458-10-562.
2
Coronary artery calcium score and risk classification for coronary heart disease prediction.冠状动脉钙评分与冠心病风险分类预测。
JAMA. 2010 Apr 28;303(16):1610-6. doi: 10.1001/jama.2010.461.
3
Inflammation and the incidence of type 2 diabetes: the Multi-Ethnic Study of Atherosclerosis (MESA).
炎症与 2 型糖尿病的发病:动脉粥样硬化的多民族研究(MESA)。
Diabetes Care. 2010 Apr;33(4):804-10. doi: 10.2337/dc09-1679. Epub 2010 Jan 22.
4
Association of plasma sphingomyelin levels and incident coronary heart disease events in an adult population: Multi-Ethnic Study of Atherosclerosis.血浆神经鞘磷脂水平与成人冠心病事件的关系:动脉粥样硬化的多民族研究。
Arterioscler Thromb Vasc Biol. 2010 Mar;30(3):628-33. doi: 10.1161/ATVBAHA.109.199281. Epub 2009 Dec 23.
5
Missing data: a special challenge in aging research.缺失数据:衰老研究中的一项特殊挑战。
J Am Geriatr Soc. 2009 Apr;57(4):722-9. doi: 10.1111/j.1532-5415.2008.02168.x. Epub 2009 Feb 10.
6
The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials.观察性研究因果效应的设计与分析:与随机试验设计的相似之处。
Stat Med. 2007 Jan 15;26(1):20-36. doi: 10.1002/sim.2739.
7
Tutorial in biostatistics: competing risks and multi-state models.生物统计学教程:竞争风险与多状态模型
Stat Med. 2007 May 20;26(11):2389-430. doi: 10.1002/sim.2712.
8
Risk of ischaemic heart disease and acute myocardial infarction in a Spanish population: observational prospective study in a primary-care setting.西班牙人群中缺血性心脏病和急性心肌梗死的风险:基层医疗环境中的前瞻性观察研究。
BMC Public Health. 2006 Feb 17;6:38. doi: 10.1186/1471-2458-6-38.
9
Assessing missing data assumptions in longitudinal studies: an example using a smoking cessation trial.评估纵向研究中的缺失数据假设:以一项戒烟试验为例。
Drug Alcohol Depend. 2005 Mar 7;77(3):213-25. doi: 10.1016/j.drugalcdep.2004.08.018.
10
Loss to follow-up in cohort studies: how much is too much?队列研究中的失访:多少算过多?
Eur J Epidemiol. 2004;19(8):751-60. doi: 10.1023/b:ejep.0000036568.02655.f8.