识别互联网研究中的多重提交:维护数据完整性。
Identifying multiple submissions in Internet research: preserving data integrity.
作者信息
Bowen Anne M, Daniel Candice M, Williams Mark L, Baird Grayson L
机构信息
Department of Psychology, University of Wyoming, Laramie, WY 82071, USA.
出版信息
AIDS Behav. 2008 Nov;12(6):964-73. doi: 10.1007/s10461-007-9352-2. Epub 2008 Feb 1.
Internet-based sexuality research with hidden populations has become increasingly popular. Respondent anonymity may encourage participation and lower social desirability, but associated disinhibition may promote multiple submissions, especially when incentives are offered. The goal of this study was to identify the usefulness of different variables for detecting multiple submissions from repeat responders and to explore incentive effects. The data included 1,900 submissions from a three-session Internet intervention with a pretest and three post-test questionnaires. Participants were men who have sex with men and incentives were offered to rural participants for completing each questionnaire. The final number of submissions included 1,273 "unique", 132 first submissions by "repeat responders" and 495 additional submissions by the "repeat responders" (N = 1,900). Four categories of repeat responders were identified: "infrequent" (2-5 submissions), "persistent" (6-10 submissions), "very persistent" (11-30 submissions), and "hackers" (more than 30 submissions). Internet Provider (IP) addresses, user names, and passwords were the most useful for identifying "infrequent" repeat responders. "Hackers" often varied their IP address and identifying information to prevent easy identification, but investigating the data for small variations in IP, using reverse telephone look up, and patterns across usernames and passwords were helpful. Incentives appeared to play a role in stimulating multiple submissions, especially from the more sophisticated "hackers". Finally, the web is ever evolving and it will be necessary to have good programmers and staff who evolve as fast as "hackers".
针对隐蔽人群开展的基于互联网的性取向研究越来越普遍。受访者匿名可能会鼓励参与并降低社会期望性,但由此产生的行为抑制解除可能会促使多次提交,尤其是在提供奖励的情况下。本研究的目的是确定不同变量在检测重复受访者的多次提交方面的有用性,并探讨奖励的影响。数据包括来自一项三阶段互联网干预的1900份提交内容,该干预包括一次预测试和三次后测试问卷。参与者为男同性恋者,为农村参与者完成每份问卷提供奖励。最终提交内容的数量包括1273份“唯一”提交、“重复受访者”的132份首次提交以及“重复受访者”的495份额外提交(N = 1900)。确定了四类重复受访者:“不频繁”(2 - 5次提交)、“持续”(6 - 10次提交)、“非常持续”(11 - 30次提交)和“黑客”(超过30次提交)。互联网服务提供商(IP)地址、用户名和密码在识别“不频繁”的重复受访者方面最有用。“黑客”经常更改他们的IP地址和识别信息以防止被轻易识别,但调查IP的微小变化数据、使用反向电话查询以及用户名和密码的模式会有所帮助。奖励似乎在刺激多次提交方面起到了作用,尤其是对更老练的“黑客”而言。最后,网络在不断发展,有优秀的程序员和与“黑客”发展速度一样快的工作人员将很有必要。