Lee Ji-Sung, Kim Jaiyong, Baik Sei-Hyun, Park Ie-Byung, Lee Juneyoung
Department of Public Health, Korea University Graduate School, Korea.
J Prev Med Public Health. 2009 Mar;42(2):135-42. doi: 10.3961/jpmph.2009.42.2.135.
An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation.
We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a 'patient-hospital profile' that had 3,676,164 cases, and then to a 'patient profile' that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices.
Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04% (95% prediction range, 49.83 - 64.24%) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into "metropolitan/city/county" and the types of hospital into "tertiary/secondary/primary/clinic" with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3%.
The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.
通过模拟评估了一种用于估计糖尿病流行病学数据量的适当抽样策略。
我们分析了2003年提交给健康保险审查与评估服务机构的约2.5亿份医疗保险理赔数据,这些数据以糖尿病作为主要或后续诊断,且每年出现一次及以上。该数据库被重建为一个包含3,676,164个病例的“患者-医院概况”,然后再重建为一个由2,412,082条观测值组成的“患者概况”。然后使用患者概况数据来检验所提议抽样框架和抽样方法的有效性,以制定与糖尿病相关的流行病学指标。
模拟研究表明,采用总样本量为4000的分层两阶段整群抽样设计,对于糖尿病治疗处方率的估计值为57.04%(95%预测范围为49.83 - 64.24%)。所提议的抽样设计首先将全国区域按“大城市/城市/县”分层,将医院类型按“三级/二级/一级/诊所”分层,比例为5:10:10:75。然后在各层内随机选择医院作为初级抽样单位,接着在医院内随机选择患者作为次级抽样单位。估计值与参数值之间的差异预计小于0.3%。
所提议的抽样方案将应用于后续的全国性实地调查,不仅用于估计糖尿病的流行病学数据量,还用于评估全国糖尿病控制的现状。