Siri Jose G, Lindblade Kim A, Rosen Daniel H, Onyango Bernard, Vulule John, Slutsker Laurence, Wilson Mark L
Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan, USA.
Malar J. 2008 Feb 25;7:34. doi: 10.1186/1475-2875-7-34.
Although sub-Saharan Africa (SSA) is rapidly urbanizing, the terms used to classify urban ecotypes are poorly defined in the context of malaria epidemiology. Lack of clear definitions may cause misclassification error, which likely decreases the accuracy of continent-wide estimates of malaria burden, limits the generalizability of urban malaria studies, and makes identification of high-risk areas for targeted interventions within cities more difficult. Accordingly, clustering techniques were applied to a set of urbanization- and malaria-related variables in Kisumu, Kenya, to produce a quantitative classification of the urban environment for malaria research.
Seven variables with a known or expected relationship with malaria in the context of urbanization were identified and measured at the census enumeration area (EA) level, using three sources: a) the results of a citywide knowledge, attitudes and practices (KAP) survey; b) a high-resolution multispectral satellite image; and c) national census data. Principal components analysis (PCA) was used to identify three factors explaining higher proportions of the combined variance than the original variables. A k-means clustering algorithm was applied to the EA-level factor scores to assign EAs to one of three categories: "urban," "peri-urban," or "semi-rural." The results were compared with classifications derived from two other approaches: a) administrative designation of urban/rural by the census or b) population density thresholds.
Urban zones resulting from the clustering algorithm were more geographically coherent than those delineated by population density. Clustering distributed population more evenly among zones than either of the other methods and more accurately predicted variation in other variables related to urbanization, but not used for classification.
Effective urban malaria epidemiology and control would benefit from quantitative methods to identify and characterize urban areas. Cluster analysis techniques were used to classify Kisumu, Kenya, into levels of urbanization in a repeatable and unbiased manner, an approach that should permit more relevant comparisons among and within urban areas. To the extent that these divisions predict meaningful intra-urban differences in malaria epidemiology, they should inform targeted urban malaria interventions in cities across SSA.
尽管撒哈拉以南非洲地区(SSA)正在迅速城市化,但在疟疾流行病学背景下,用于对城市生态类型进行分类的术语定义不明确。缺乏明确的定义可能会导致错误分类,这可能会降低全大陆疟疾负担估计的准确性,限制城市疟疾研究的可推广性,并使确定城市内针对性干预的高风险地区更加困难。因此,聚类技术被应用于肯尼亚基苏木市一组与城市化和疟疾相关的变量,以产生用于疟疾研究的城市环境定量分类。
确定了七个在城市化背景下与疟疾有已知或预期关系的变量,并在人口普查枚举区(EA)层面进行测量,使用了三个来源:a)全市知识、态度和实践(KAP)调查的结果;b)高分辨率多光谱卫星图像;c)全国人口普查数据。主成分分析(PCA)用于识别三个解释组合方差比例高于原始变量的因素。k均值聚类算法应用于EA层面的因素得分,将EA分为三类之一:“城市”、“城郊”或“半农村”。将结果与另外两种方法得出的分类进行比较:a)人口普查的城乡行政划分;b)人口密度阈值。
聚类算法产生的城市区域在地理上比人口密度划定的区域更连贯。与其他两种方法相比,聚类在各区域间更均匀地分布人口,并且更准确地预测了与城市化相关但未用于分类的其他变量的变化。
有效的城市疟疾流行病学和控制将受益于识别和描述城市地区的定量方法。聚类分析技术被用于以可重复且无偏的方式将肯尼亚基苏木市划分为不同的城市化水平,这种方法应允许在城市地区之间和内部进行更相关的比较。就这些划分预测城市内疟疾流行病学有意义的差异而言,它们应为SSA各城市的针对性城市疟疾干预提供信息。