Emergint Corporation, Louisville, KY, USA.
J Urban Health. 2012 Jun;89(3):565-86. doi: 10.1007/s11524-012-9676-8.
Time-location sampling (TLS) is useful for collecting information on a hard-to-reach population (such as men who have sex with men [MSM]) by sampling locations where persons of interest can be found, and then sampling those who attend. These studies have typically been analyzed as a simple random sample (SRS) from the population of interest. If this population is the source population, as we assume here, such an analysis is likely to be biased, because it ignores possible associations between outcomes of interest and frequency of attendance at the locations sampled, and is likely to underestimate the uncertainty in the estimates, as a result of ignoring both the clustering within locations and the variation in the probability of sampling among members of the population who attend sampling locations. We propose that TLS data be analyzed as a two-stage sample survey using a simple weighting procedure based on the inverse of the approximate probability that a person was sampled and using sample survey analysis software to estimate the standard errors of estimates (to account for the effects of clustering within the first stage [locations] and variation in the weights). We use data from the Young Men's Survey Phase II, a study of MSM, to show that, compared with an analysis assuming a SRS, weighting can affect point prevalence estimates and estimates of associations and that weighting and clustering can substantially increase estimates of standard errors. We describe data on location attendance that would yield improved estimates of weights. We comment on the advantages and disadvantages of TLS and respondent-driven sampling.
时间-地点抽样(TLS)通过在可以找到目标人群的地点进行抽样,然后对参加者进行抽样,从而有助于收集难以接触到的人群(如男男性行为者[MSM])的信息。这些研究通常被分析为从目标人群中抽取的简单随机样本(SRS)。如果该人群是我们这里假设的源人群,那么这种分析可能存在偏差,因为它忽略了目标人群的结果与抽样地点的出席频率之间可能存在的关联,并且由于忽略了地点内的聚类和抽样人群中参加抽样地点的成员之间的抽样概率的变化,结果可能会低估估计的不确定性。我们建议使用基于被抽样者的近似抽样概率的倒数的简单加权程序,将 TLS 数据作为两阶段抽样调查进行分析,并使用抽样调查分析软件来估计估计值的标准误差(以考虑到第一阶段[地点]内的聚类和权重变化的影响)。我们使用来自年轻男性调查第二期的 MSM 研究数据,表明与假设 SRS 的分析相比,加权会影响点患病率估计值和关联估计值,并且加权和聚类会大大增加标准误差的估计值。我们描述了可提高权重估计值的数据。我们评论了 TLS 和受访者驱动抽样的优缺点。