Alexander N, Lenhart A E, Romero-Vivas C M E, Barbazan P, Morrison A C, Barrera R, Arredondo-Jiménez J I, Focks D A
Infectious Disease Epidemiology Unit, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, U.K.
Ann Trop Med Parasitol. 2006 Apr;100 Suppl 1:S5-S16. doi: 10.1179/136485906X105471.
A method has been developed for estimating the sample sizes needed to identify categories that comprise a large proportion of a compositional data-set. The method is to be used in the design of surveys of mosquito pupae, for identifying the key container types from which the majority of adult dengue vectors emerge. Although a finite-population correction was devised for estimating the mean of a negative binomial distribution, other complications of parametric approaches make them unlikely to yield methods simple enough to be practically applicable. The Shannon-Wiener index was therefore investigated as a more useful alternative, at the cost of theoretical generalizability, in an approach based on re-sampling methods in conjunction with the use of entropy. This index can be used to summarize the degree to which pupae are either concentrated in a few container types, or dispersed among many. An empirical relationship between the index and the repeatability of surveys of differing sample sizes was observed. A step-wise rule, based on the entropy of the cumulative data, was devised for determining the sample size, in terms of the number of houses positive for pupae, at which a pupal survey might reasonably be stopped.
已开发出一种方法,用于估计识别构成成分数据集很大比例的类别所需的样本量。该方法将用于蚊蛹调查设计,以识别大多数成年登革热传播媒介出现的关键容器类型。尽管设计了有限总体校正来估计负二项分布的均值,但参数方法的其他复杂性使得它们不太可能产生足够简单以便实际应用的方法。因此,在基于重采样方法并结合熵使用的方法中,研究了香农 - 维纳指数作为一种更有用的替代方法,代价是理论通用性。该指数可用于总结蛹集中在少数几种容器类型中或分散在多种容器类型中的程度。观察到该指数与不同样本量调查的可重复性之间的经验关系。根据累积数据的熵设计了一个逐步规则,用于根据蛹呈阳性的房屋数量确定样本量,达到该样本量时蛹调查可能合理停止。