冒名顶替者、机器人及在线研究中数据完整性面临的其他威胁:文献综述与最佳实践建议

Imposters, Bots, and Other Threats to Data Integrity in Online Research: Scoping Review of the Literature and Recommendations for Best Practices.

作者信息

Strickland Isabella B, Ferketich Amy K, Tackett Alayna P, Patterson Joanne G, Breitborde Nicholas J K, Davis Jade, Roberts Megan

机构信息

College of Public Health, The Ohio State University, 1841 Neil Ave, Columbus, OH, 43210, United States, 1 6142924647.

Center for Tobacco Research, The Ohio State University Comprehensive Cancer Center, The Ohio State University, Columbus, OH, United States.

出版信息

Online J Public Health Inform. 2025 Aug 29;17:e70926. doi: 10.2196/70926.

Abstract

BACKGROUND

Threats to data integrity have always existed in online human subjects research, but it appears these threats have become more common and more advanced in recent years. Researchers have proposed various techniques to address satisficers, repeat participants, bots, and fraudulent participants; yet, no synthesis of this literature has been conducted.

OBJECTIVE

This study undertakes a scoping review of recent methods and ethical considerations for addressing threats to data integrity in online research.

METHODS

A PubMed search was used to identify 90 articles published from 2020 to 2024 that were written in English, that discussed online human subjects research, and that had at least one paragraph dedicated to discussing threats to online data integrity.

RESULTS

We cataloged 16 types of techniques for addressing threats to online data integrity. Techniques to authenticate personal information (eg, videoconferencing and mailing incentives to a physical address) appear to be very effective at deterring or identifying fraudulent participants. Yet such techniques also come with ethical considerations, including participant burden and increased threats to privacy. Other techniques, such as Completely Automated Public Turing test to tell Computers and Humans Apart (reCAPTCHA; Google LLC), scores, and checking IP addresses, although very common, were also deemed by several researchers as no longer sufficient protections against advanced threats to data integrity.

CONCLUSIONS

Overall, this review demonstrates the importance of shifting online research protocols as bots and fraudulent participants become more sophisticated.

摘要

背景

在线人体研究中,数据完整性一直面临威胁,但近年来这些威胁似乎变得更加普遍和复杂。研究人员提出了各种技术来应对敷衍了事者、重复参与者、机器人程序和欺诈性参与者;然而,尚未对该文献进行综合分析。

目的

本研究对近期解决在线研究中数据完整性威胁的方法和伦理考量进行了范围综述。

方法

通过PubMed检索,识别出2020年至2024年发表的9篇英文文章,这些文章讨论了在线人体研究,且至少有一段专门讨论在线数据完整性的威胁。

结果

我们梳理了16种应对在线数据完整性威胁的技术类型。验证个人信息的技术(如视频会议和向实际地址邮寄激励措施)在威慑或识别欺诈性参与者方面似乎非常有效。然而,这些技术也带来了伦理考量,包括参与者负担和对隐私的更大威胁。其他技术,如区分计算机和人类的完全自动化公开图灵测试(reCAPTCHA;谷歌有限责任公司)、评分和检查IP地址,虽然非常常见,但也有几位研究人员认为,这些技术已不足以防范对数据完整性的高级威胁。

结论

总体而言,随着机器人程序和欺诈性参与者变得更加复杂,本综述表明了调整在线研究方案的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7592/12396152/4e63232b8a04/ojphi-v17-e70926-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索