• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估和提高基于网络的调查中的数据完整性:COVID-19研究中欺诈检测系统的比较

Assessing and Improving Data Integrity in Web-Based Surveys: Comparison of Fraud Detection Systems in a COVID-19 Study.

作者信息

Bonett Stephen, Lin Willey, Sexton Topper Patrina, Wolfe James, Golinkoff Jesse, Deshpande Aayushi, Villarruel Antonia, Bauermeister José

机构信息

School of Nursing, University of Pennsylvania, Philadelphia, PA, United States.

Department of Psychology, Ashoka University, Sonepat, India.

出版信息

JMIR Form Res. 2024 Jan 12;8:e47091. doi: 10.2196/47091.

DOI:10.2196/47091
PMID:38214962
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10818231/
Abstract

BACKGROUND

Web-based surveys increase access to study participation and improve opportunities to reach diverse populations. However, web-based surveys are vulnerable to data quality threats, including fraudulent entries from automated bots and duplicative submissions. Widely used proprietary tools to identify fraud offer little transparency about the methods used, effectiveness, or representativeness of resulting data sets. Robust, reproducible, and context-specific methods of accurately detecting fraudulent responses are needed to ensure integrity and maximize the value of web-based survey research.

OBJECTIVE

This study aims to describe a multilayered fraud detection system implemented in a large web-based survey about COVID-19 attitudes, beliefs, and behaviors; examine the agreement between this fraud detection system and a proprietary fraud detection system; and compare the resulting study samples from each of the 2 fraud detection methods.

METHODS

The PhillyCEAL Common Survey is a cross-sectional web-based survey that remotely enrolled residents ages 13 years and older to assess how the COVID-19 pandemic impacted individuals, neighborhoods, and communities in Philadelphia, Pennsylvania. Two fraud detection methods are described and compared: (1) a multilayer fraud detection strategy developed by the research team that combined automated validation of response data and real-time verification of study entries by study personnel and (2) the proprietary fraud detection system used by the Qualtrics (Qualtrics) survey platform. Descriptive statistics were computed for the full sample and for responses classified as valid by 2 different fraud detection methods, and classification tables were created to assess agreement between the methods. The impact of fraud detection methods on the distribution of vaccine confidence by racial or ethnic group was assessed.

RESULTS

Of 7950 completed surveys, our multilayer fraud detection system identified 3228 (40.60%) cases as valid, while the Qualtrics fraud detection system identified 4389 (55.21%) cases as valid. The 2 methods showed only "fair" or "minimal" agreement in their classifications (κ=0.25; 95% CI 0.23-0.27). The choice of fraud detection method impacted the distribution of vaccine confidence by racial or ethnic group.

CONCLUSIONS

The selection of a fraud detection method can affect the study's sample composition. The findings of this study, while not conclusive, suggest that a multilayered approach to fraud detection that includes conservative use of automated fraud detection and integration of human review of entries tailored to the study's specific context and its participants may be warranted for future survey research.

摘要

背景

基于网络的调查增加了参与研究的机会,并改善了接触不同人群的可能性。然而,基于网络的调查容易受到数据质量威胁,包括自动机器人的欺诈性条目和重复提交。广泛使用的用于识别欺诈的专有工具几乎没有提供关于所使用方法、有效性或所得数据集代表性的透明度。需要稳健、可重复且针对具体情况的准确检测欺诈性回复的方法,以确保基于网络的调查研究的完整性并最大化其价值。

目的

本研究旨在描述在一项关于新冠疫情态度、信念和行为的大型基于网络的调查中实施的多层欺诈检测系统;检验该欺诈检测系统与一个专有欺诈检测系统之间的一致性;并比较两种欺诈检测方法各自所得的研究样本。

方法

费城社区参与与评估共同调查是一项基于网络的横断面调查,远程招募13岁及以上居民,以评估新冠疫情如何影响宾夕法尼亚州费城的个人、社区和邻里。描述并比较了两种欺诈检测方法:(1)研究团队开发的多层欺诈检测策略,该策略结合了对回复数据的自动验证以及研究人员对研究条目的实时核查;(2)Qualtrics(Qualtrics)调查平台使用的专有欺诈检测系统。计算了整个样本以及被两种不同欺诈检测方法分类为有效的回复的描述性统计数据,并创建了分类表以评估两种方法之间的一致性。评估了欺诈检测方法对按种族或族裔划分的疫苗信心分布的影响。

结果

在7950份完成的调查中,我们的多层欺诈检测系统将3228例(40.60%)判定为有效,而Qualtrics欺诈检测系统将4389例(55.21%)判定为有效。两种方法在分类上仅显示出“一般”或“极小”的一致性(κ = 0.25;95%置信区间0.23 - 0.27)。欺诈检测方法的选择影响了按种族或族裔划分的疫苗信心分布。

结论

欺诈检测方法的选择会影响研究的样本构成。本研究的结果虽不具有决定性,但表明对于未来的调查研究,可能有必要采用多层欺诈检测方法,包括谨慎使用自动欺诈检测以及结合根据研究的具体情况及其参与者量身定制的人工条目审核。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/fe75e05acded/formative_v8i1e47091_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/4706bfaffbfa/formative_v8i1e47091_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/017a8c0c2c8b/formative_v8i1e47091_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/2c078de92aa9/formative_v8i1e47091_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/fe75e05acded/formative_v8i1e47091_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/4706bfaffbfa/formative_v8i1e47091_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/017a8c0c2c8b/formative_v8i1e47091_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/2c078de92aa9/formative_v8i1e47091_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a8d4/10818231/fe75e05acded/formative_v8i1e47091_fig4.jpg

相似文献

1
Assessing and Improving Data Integrity in Web-Based Surveys: Comparison of Fraud Detection Systems in a COVID-19 Study.评估和提高基于网络的调查中的数据完整性:COVID-19研究中欺诈检测系统的比较
JMIR Form Res. 2024 Jan 12;8:e47091. doi: 10.2196/47091.
2
Challenges and Lessons Learned in Managing Web-Based Survey Fraud for the Garnering Effective Outreach and Research in Georgia for Impact Alliance-Community Engagement Alliance Survey Administrations.在为佐治亚州影响联盟 - 社区参与联盟调查管理进行有效外展和研究的网络调查欺诈管理中面临的挑战及经验教训。
JMIR Public Health Surveill. 2024 Dec 24;10:e51786. doi: 10.2196/51786.
3
Assessment of Fraud Deterrence and Detection Procedures Used in a Web-Based Survey Study With Adult Black Cisgender Women: Description of Lessons Learned and Recommendations.对一项针对成年黑人顺性别女性的网络调查研究中使用的欺诈威慑和检测程序的评估:经验教训描述与建议
JMIR Form Res. 2025 Mar 12;9:e59955. doi: 10.2196/59955.
4
From Doubt to Confidence-Overcoming Fraudulent Submissions by Bots and Other Takers of a Web-Based Survey.从怀疑到信任——克服机器人及其他基于网络调查作弊者的欺诈性提交行为
J Med Internet Res. 2024 Dec 16;26:e60184. doi: 10.2196/60184.
5
Fraud Detection Protocol for Web-Based Research Among Men Who Have Sex With Men: Development and Descriptive Evaluation.男男性行为者基于网络研究的欺诈检测协议:制定与描述性评估。
JMIR Public Health Surveill. 2019 Feb 4;5(1):e12344. doi: 10.2196/12344.
6
Identifying Fraudulent Responses in a Study Exploring Delivery Options for Pregnancies Impacted by Gestational Diabetes: Lessons Learned From a Web-Based Survey.在一项探索妊娠期糖尿病影响的妊娠分娩选择的研究中识别欺诈性回复:基于网络调查的经验教训
J Med Internet Res. 2025 Jan 20;27:e58450. doi: 10.2196/58450.
7
Data Verification and Respondent Validity for a Web-Based Sexual Health Survey: Tutorial.基于网络的性健康调查的数据验证与应答者有效性:教程
JMIR Form Res. 2024 Dec 9;8:e56788. doi: 10.2196/56788.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Methods for Authenticating Participants in Fully Web-Based Mobile App Trials from the iReach Project: Cross-sectional Study.基于 iReach 项目的全网络移动应用试验中参与者认证方法:横断面研究。
JMIR Mhealth Uhealth. 2021 Aug 31;9(8):e28232. doi: 10.2196/28232.
10
Combating Fraudulent Participation in Urban American Indian and Alaska Native Virtual Health Research: Protocol for Increasing Data Integrity in Online Research (PRIOR).打击美国城市印第安人和阿拉斯加原住民虚拟健康研究中的欺诈性参与:提高在线研究数据完整性的方案(PRIOR)。
JMIR Res Protoc. 2024 Jun 13;13:e52281. doi: 10.2196/52281.

引用本文的文献

1
Imposters, Bots, and Other Threats to Data Integrity in Online Research: Scoping Review of the Literature and Recommendations for Best Practices.冒名顶替者、机器人及在线研究中数据完整性面临的其他威胁:文献综述与最佳实践建议
Online J Public Health Inform. 2025 Aug 29;17:e70926. doi: 10.2196/70926.
2
Increasing Rigor in Online Health Surveys Through the Reduction of Fraudulent Data.通过减少欺诈性数据提高在线健康调查的严谨性。
J Med Internet Res. 2025 Aug 21;27:e68092. doi: 10.2196/68092.
3
Improving Data Integrity in Samples Obtained From Web-Based Recruitment: Protocol for the Development of a Novel System for Assessing Participant Authenticity in a Remote Longitudinal Cohort Study of Polysubstance Use.

本文引用的文献

1
Are Your Participants Real? Dealing with Fraud in Recruiting Older Adults Online.参与者是真实的吗?在线招募老年人时应对欺诈行为。
West J Nurs Res. 2023 Jan;45(1):93-99. doi: 10.1177/01939459221098468. Epub 2022 May 19.
2
Evaluating and improving the quality of survey data from panel and crowd-sourced samples: A practical guide for psychological research.评估和提高面板和众包样本调查数据的质量:心理研究的实用指南。
Exp Clin Psychopharmacol. 2022 Aug;30(4):400-408. doi: 10.1037/pha0000564. Epub 2022 Apr 4.
3
Using a consistency check during data collection to identify invalid responding in an online cannabis screening survey.
提高基于网络招募的样本的数据完整性:开发一种新型系统以评估多物质使用远程纵向队列研究中参与者真实性的方案。
JMIR Res Protoc. 2025 Aug 14;14:e69956. doi: 10.2196/69956.
4
It's raining bots: how easier access to internet surveys has created the perfect storm.“机器人”如雨下:网络调查获取途径的便捷化如何引发了一场完美风暴。
BMJ Open Qual. 2025 Jun 1;14(2):e003208. doi: 10.1136/bmjoq-2024-003208.
5
Accessible, not Exploitable: Navigating fraud prevention in disability research.可及而非可利用:残疾研究中的欺诈预防导航
Disabil Health J. 2025 Apr 29:101843. doi: 10.1016/j.dhjo.2025.101843.
6
The relationship between racial discrimination in healthcare, loneliness, and mental health among Black Philadelphia residents.费城黑人居民在医疗保健方面遭受的种族歧视、孤独感与心理健康之间的关系。
Int J Equity Health. 2025 Apr 21;24(1):109. doi: 10.1186/s12939-025-02475-6.
7
A Cross-Sectional Study of Online Survey Respondents' Knowledge and Attitudes Toward Delirium.一项关于在线调查受访者对谵妄的认知与态度的横断面研究。
J Gen Intern Med. 2025 Mar 3. doi: 10.1007/s11606-025-09445-2.
8
Successful (and Unsuccessful) Recruitment Approaches and Participant Loss in a Down Syndrome Survey.唐氏综合征调查中成功(与不成功)的招募方法及参与者流失情况
Am J Intellect Dev Disabil. 2025 Mar 1;130(2):131-145. doi: 10.1352/1944-7558-130.2.131.
9
Challenges and Lessons Learned in Managing Web-Based Survey Fraud for the Garnering Effective Outreach and Research in Georgia for Impact Alliance-Community Engagement Alliance Survey Administrations.在为佐治亚州影响联盟 - 社区参与联盟调查管理进行有效外展和研究的网络调查欺诈管理中面临的挑战及经验教训。
JMIR Public Health Surveill. 2024 Dec 24;10:e51786. doi: 10.2196/51786.
10
Identifying and counteracting fraudulent responses in online recruitment for health research: a scoping review.识别和应对健康研究在线招募中的欺诈性回复:一项范围综述
BMJ Evid Based Med. 2025 May 20;30(3):173-182. doi: 10.1136/bmjebm-2024-113170.
在数据收集过程中使用一致性检查来识别在线大麻筛查调查中的无效回答。
BMC Med Res Methodol. 2022 Mar 13;22(1):67. doi: 10.1186/s12874-022-01556-2.
4
Ensuring survey research data integrity in the era of internet bots.在互联网机器人时代确保调查研究数据的完整性。
Qual Quant. 2022;56(4):2841-2852. doi: 10.1007/s11135-021-01252-1. Epub 2021 Oct 5.
5
Methods for Authenticating Participants in Fully Web-Based Mobile App Trials from the iReach Project: Cross-sectional Study.基于 iReach 项目的全网络移动应用试验中参与者认证方法:横断面研究。
JMIR Mhealth Uhealth. 2021 Aug 31;9(8):e28232. doi: 10.2196/28232.
6
Survey Fraud and the Integrity of Web-Based Survey Research.网络调查研究中的欺诈行为与诚信问题。
Am J Health Promot. 2022 Jan;36(1):18-20. doi: 10.1177/08901171211037531. Epub 2021 Aug 10.
7
Remote data collection for public health research in a COVID-19 era: ethical implications, challenges and opportunities.新冠疫情时代公共卫生研究中的远程数据收集:伦理影响、挑战与机遇
Health Policy Plan. 2021 Apr 21;36(3):360-368. doi: 10.1093/heapol/czaa158.
8
The Expanding Digital Divide: Digital Health Access Inequities during the COVID-19 Pandemic in New York City.不断扩大的数字鸿沟:纽约市新冠疫情期间数字健康获取的不平等现象
J Urban Health. 2021 Apr;98(2):183-186. doi: 10.1007/s11524-020-00508-9. Epub 2021 Jan 20.
9
Threats of Bots and Other Bad Actors to Data Quality Following Research Participant Recruitment Through Social Media: Cross-Sectional Questionnaire.社交媒体招募研究参与者后对数据质量的机器人和其他不良行为者的威胁:横断面问卷调查。
J Med Internet Res. 2020 Oct 7;22(10):e23021. doi: 10.2196/23021.
10
Social media as a recruitment platform for a nationwide online survey of COVID-19 knowledge, beliefs, and practices in the United States: methodology and feasibility analysis.社交媒体作为美国全国性在线 COVID-19 知识、信念和实践调查的招募平台:方法学和可行性分析。
BMC Med Res Methodol. 2020 May 13;20(1):116. doi: 10.1186/s12874-020-01011-0.