• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

排除报告患有“ Syndomitis”或“Chekalism”的人对数据质量的影响:来自亚马逊 Mechanical Turk 的样本的纵向健康调查。

Effects of Excluding Those Who Report Having "Syndomitis" or "Chekalism" on Data Quality: Longitudinal Health Survey of a Sample From Amazon's Mechanical Turk.

机构信息

Division of General Internal Medicine and Health Services Research, Department of Medicine, University of California, Los Angeles, CA, United States.

Behavioral and Policy Sciences, RAND Corporation, Santa Monica, CA, United States.

出版信息

J Med Internet Res. 2023 Aug 4;25:e46421. doi: 10.2196/46421.

DOI:10.2196/46421
PMID:37540543
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10439462/
Abstract

BACKGROUND

Researchers have implemented multiple approaches to increase data quality from existing web-based panels such as Amazon's Mechanical Turk (MTurk).

OBJECTIVE

This study extends prior work by examining improvements in data quality and effects on mean estimates of health status by excluding respondents who endorse 1 or both of 2 fake health conditions ("Syndomitis" and "Chekalism").

METHODS

Survey data were collected in 2021 at baseline and 3 months later from MTurk study participants, aged 18 years or older, with an internet protocol address in the United States, and who had completed a minimum of 500 previous MTurk "human intelligence tasks." We included questions about demographic characteristics, health conditions (including the 2 fake conditions), and the Patient Reported Outcomes Measurement Information System (PROMIS)-29+2 (version 2.1) preference-based score survey. The 3-month follow-up survey was only administered to those who reported having back pain and did not endorse a fake condition at baseline.

RESULTS

In total, 15% (996/6832) of the sample endorsed at least 1 of the 2 fake conditions at baseline. Those who endorsed a fake condition at baseline were more likely to identify as male, non-White, younger, report more health conditions, and take longer to complete the survey than those who did not endorse a fake condition. They also had substantially lower internal consistency reliability on the PROMIS-29+2 scales than those who did not endorse a fake condition: physical function (0.69 vs 0.89), pain interference (0.80 vs 0.94), fatigue (0.80 vs 0.92), depression (0.78 vs 0.92), anxiety (0.78 vs 0.90), sleep disturbance (-0.27 vs 0.84), ability to participate in social roles and activities (0.77 vs 0.92), and cognitive function (0.65 vs 0.77). The lack of reliability of the sleep disturbance scale for those endorsing a fake condition was because it includes both positively and negatively worded items. Those who reported a fake condition reported significantly worse self-reported health scores (except for sleep disturbance) than those who did not endorse a fake condition. Excluding those who endorsed a fake condition improved the overall mean PROMIS-29+2 (version 2.1) T-scores by 1-2 points and the PROMIS preference-based score by 0.04. Although they did not endorse a fake condition at baseline, 6% (n=59) of them endorsed at least 1 of them on the 3-month survey and they had lower PROMIS-29+2 score internal consistency reliability and worse mean scores on the 3-month survey than those who did not report having a fake condition. Based on these results, we estimate that 25% (1708/6832) of the MTurk respondents provided careless or dishonest responses.

CONCLUSIONS

This study provides evidence that asking about fake health conditions can help to screen out respondents who may be dishonest or careless. We recommend this approach be used routinely in samples of members of MTurk.

摘要

背景

研究人员已经实施了多种方法来提高来自现有基于网络的面板(如亚马逊的 Mechanical Turk (MTurk))的数据质量。

目的

本研究通过检查排除了对 2 种虚假健康状况(“Syndomitis”和“Chekalism”)中的 1 种或 2 种均表示认可的受访者后,数据质量的改进和对健康状况平均估计的影响,扩展了之前的工作。

方法

在 2021 年,通过 MTurk 研究参与者的基线和 3 个月后的调查数据进行研究,年龄在 18 岁或以上,在美国拥有互联网协议地址,并且完成了至少 500 项之前的 MTurk“人类智能任务”。我们询问了人口统计学特征、健康状况(包括 2 种虚假状况)以及患者报告的结果测量信息系统(PROMIS)-29+2(版本 2.1)偏好得分调查。只有那些在基线时报告有背痛且不认可虚假状况的人会收到 3 个月的随访调查。

结果

总共有 15%(996/6832)的样本在基线时认可了至少 1 种虚假状况。那些在基线时认可虚假状况的人更可能是男性、非白人、更年轻、报告更多的健康状况、并且比不认可虚假状况的人完成调查所需的时间更长。他们在 PROMIS-29+2 量表上的内部一致性可靠性也明显低于不认可虚假状况的人:身体功能(0.69 与 0.89)、疼痛干扰(0.80 与 0.94)、疲劳(0.80 与 0.92)、抑郁(0.78 与 0.92)、焦虑(0.78 与 0.90)、睡眠障碍(-0.27 与 0.84)、参与社会角色和活动的能力(0.77 与 0.92)以及认知功能(0.65 与 0.77)。认可虚假状况的人睡眠障碍量表缺乏可靠性,是因为它包括了正向和负向措辞的项目。那些报告虚假状况的人自我报告的健康状况明显比不认可虚假状况的人差(除了睡眠障碍)。排除认可虚假状况的人将总体平均 PROMIS-29+2(版本 2.1)T 分数提高了 1-2 分,将 PROMIS 偏好得分提高了 0.04。尽管他们在基线时没有认可虚假状况,但 6%(n=59)的人在 3 个月的调查中认可了至少 1 种虚假状况,他们在 PROMIS-29+2 量表上的内部一致性可靠性较低,3 个月调查时的平均得分也比那些没有报告虚假状况的人差。根据这些结果,我们估计 25%(1708/6832)的 MTurk 受访者提供了粗心或不诚实的回答。

结论

本研究提供了证据表明,询问虚假健康状况可以帮助筛选出可能不诚实或粗心的受访者。我们建议在 MTurk 成员的样本中定期使用这种方法。

相似文献

1
Effects of Excluding Those Who Report Having "Syndomitis" or "Chekalism" on Data Quality: Longitudinal Health Survey of a Sample From Amazon's Mechanical Turk.排除报告患有“ Syndomitis”或“Chekalism”的人对数据质量的影响:来自亚马逊 Mechanical Turk 的样本的纵向健康调查。
J Med Internet Res. 2023 Aug 4;25:e46421. doi: 10.2196/46421.
2
Comparing the Recruitment of Research Participants With Chronic Low Back Pain Using Amazon Mechanical Turk With the Recruitment of Patients From Chiropractic Clinics: A Quasi-Experimental Study.使用亚马逊土耳其机器人招募慢性下背痛研究参与者与从脊骨神经科诊所招募患者进行比较:一项准实验研究。
J Manipulative Physiol Ther. 2021 Oct;44(8):601-611. doi: 10.1016/j.jmpt.2022.02.004. Epub 2022 Jun 18.
3
Prevalence of and Factors Associated With Patient Nondisclosure of Medically Relevant Information to Clinicians.患者向临床医生隐瞒与医疗相关信息的流行率及相关因素。
JAMA Netw Open. 2018 Nov 2;1(7):e185293. doi: 10.1001/jamanetworkopen.2018.5293.
4
Assessment of Patient Nondisclosures to Clinicians of Experiencing Imminent Threats.评估患者对即将面临威胁的临床医生的隐瞒情况。
JAMA Netw Open. 2019 Aug 2;2(8):e199277. doi: 10.1001/jamanetworkopen.2019.9277.
5
Comparing Data Collected on Amazon's Mechanical Turk to National Surveys.将亚马逊土耳其机器人上收集的数据与全国性调查进行比较。
Am J Health Behav. 2022 Oct 17;46(5):497-502. doi: 10.5993/AJHB.46.5.1.
6
Screening Amazon's Mechanical Turk for Adults With ADHD.在亚马逊土耳其机器人上为患有注意力缺陷多动障碍的成年人进行筛查。
J Atten Disord. 2019 Aug;23(10):1178-1187. doi: 10.1177/1087054715597471. Epub 2015 Aug 5.
7
The Promise and Pitfalls of Using Crowdsourcing in Research Prioritization for Back Pain: Cross-Sectional Surveys.众包在背痛研究优先级确定中的前景与陷阱:横断面调查
J Med Internet Res. 2017 Oct 6;19(10):e341. doi: 10.2196/jmir.8821.
8
Self-reported Health Status Differs for Amazon's Mechanical Turk Respondents Compared With Nationally Representative Surveys.与全国代表性调查相比,亚马逊土耳其机器人受访者自我报告的健康状况有所不同。
Med Care. 2018 Mar;56(3):211-215. doi: 10.1097/MLR.0000000000000871.
9
Characteristics of a Mild Traumatic Brain Injury Sample Recruited Using Amazon's Mechanical Turk.使用亚马逊的 Mechanical Turk 招募轻度创伤性脑损伤样本的特征。
PM R. 2018 Jan;10(1):45-55. doi: 10.1016/j.pmrj.2017.06.010. Epub 2017 Jun 19.
10
Is Amazon's Mechanical Turk (MTurk) a comparable recruitment source for trauma studies?亚马逊的 Mechanical Turk(MTurk)是否可以作为创伤研究的一种可比较的招募来源?
Psychol Trauma. 2020 May;12(4):381-388. doi: 10.1037/tra0000502. Epub 2019 Aug 5.

引用本文的文献

1
Introducing Novel Methods to Identify Fraudulent Responses (Sampling With Sisyphus): Web-Based LGBTQ2S+ Mixed-Methods Study.引入识别欺诈性回复的新方法(西西弗斯抽样法):基于网络的LGBTQ2S+混合方法研究
J Med Internet Res. 2025 Mar 17;27:e63252. doi: 10.2196/63252.
2
Comparing Health Survey Data Cost and Quality Between Amazon's Mechanical Turk and Ipsos' KnowledgePanel: Observational Study.比较亚马逊土耳其机器人和益普索知识面板的健康调查数据成本和质量:观察性研究。
J Med Internet Res. 2024 Nov 29;26:e63032. doi: 10.2196/63032.
3
Dropout in a Longitudinal Survey of Amazon Mechanical Turk Workers With Low Back Pain: Observational Study.对患有腰痛的亚马逊土耳其机器人平台工人进行的纵向调查中的失访情况:观察性研究。
Interact J Med Res. 2024 Nov 11;13:e58771. doi: 10.2196/58771.
4
Support for a Single Underlying Dimension of Self-Reported Health in a Sample of Adults with Low Back Pain in the United States.在美国患有腰痛的成年人样本中,自我报告健康状况的单一潜在维度的支持情况。
Appl Res Qual Life. 2024 Oct;19(5):2213-2226. doi: 10.1007/s11482-024-10327-8. Epub 2024 May 14.
5
Agreement of PROMIS Preference (PROPr) scores generated from the PROMIS-29 + 2 and the PROMIS-16.由PROMIS-29 + 2和PROMIS-16生成的PROMIS偏好(PROPr)评分的一致性。
Qual Life Res. 2025 Jan;34(1):43-51. doi: 10.1007/s11136-024-03827-5. Epub 2024 Nov 7.
6
Longitudinal validation of the PROMIS-16 in a sample of adults in the United States with back pain.PROMIS-16在美国背痛成人样本中的纵向验证。
Qual Life Res. 2025 Jan;34(1):35-42. doi: 10.1007/s11136-024-03826-6. Epub 2024 Nov 6.
7
The Understanding America Study (UAS).《理解美国研究》(UAS)。
BMJ Open. 2024 Oct 23;14(10):e088183. doi: 10.1136/bmjopen-2024-088183.
8
Crosswalking 4 Pain Impact Measures in a Nationally Representative Sample of Adults With Back Pain.在全国代表性的背痛成人样本中对4种疼痛影响测量方法进行交叉比对
Arch Phys Med Rehabil. 2025 Mar;106(3):351-357. doi: 10.1016/j.apmr.2024.09.007. Epub 2024 Sep 26.
9
Comparing patient-reported outcomes measurement information system (PROMIS)-16 domain scores with the PROMIS-29 and 5-item PROMIS cognitive function scores.比较患者报告结局测量信息系统(PROMIS)-16领域得分与PROMIS-29及5项PROMIS认知功能得分。
Qual Life Res. 2025 Jan;34(1):27-34. doi: 10.1007/s11136-024-03747-4. Epub 2024 Aug 14.
10
Comparison of the EQ-5D-5L and the patient-reported outcomes measurement information system preference score (PROPr) in the United States.美国 EQ-5D-5L 与患者报告结局测量信息系统偏好评分(PROPr)的比较。
J Patient Rep Outcomes. 2024 Jul 19;8(1):76. doi: 10.1186/s41687-024-00749-1.

本文引用的文献

1
Sampling Sexual and Gender Minority Youth With UnACoRN (Understanding Affirming Communities, Relationships, and Networks): Lessons From a Web-Based Survey.使用 UnACoRN(理解肯定社区、关系和网络)对性少数和性别少数青年进行抽样调查:来自网络调查的经验教训。
J Med Internet Res. 2023 Jan 12;25:e44175. doi: 10.2196/44175.
2
Comparing Data Collected on Amazon's Mechanical Turk to National Surveys.将亚马逊土耳其机器人上收集的数据与全国性调查进行比较。
Am J Health Behav. 2022 Oct 17;46(5):497-502. doi: 10.5993/AJHB.46.5.1.
3
Comparing the Recruitment of Research Participants With Chronic Low Back Pain Using Amazon Mechanical Turk With the Recruitment of Patients From Chiropractic Clinics: A Quasi-Experimental Study.使用亚马逊土耳其机器人招募慢性下背痛研究参与者与从脊骨神经科诊所招募患者进行比较:一项准实验研究。
J Manipulative Physiol Ther. 2021 Oct;44(8):601-611. doi: 10.1016/j.jmpt.2022.02.004. Epub 2022 Jun 18.
4
Careless responding in crowdsourced alcohol research: A systematic review and meta-analysis of practices and prevalence.众包酒精研究中的粗心回应:实践和流行率的系统评价和荟萃分析。
Exp Clin Psychopharmacol. 2022 Aug;30(4):381-399. doi: 10.1037/pha0000546. Epub 2022 Feb 7.
5
Liar! Liar! Identifying eligibility fraud by applicants in digital health research.骗子!骗子!识别数字健康研究中申请人的资格欺诈行为。
Internet Interv. 2021 May 9;25:100401. doi: 10.1016/j.invent.2021.100401. eCollection 2021 Sep.
6
Online panels in social science research: Expanding sampling methods beyond Mechanical Turk.在线panel 调查在社会科学研究中的运用:超越 Mechanical Turk 的抽样方法。
Behav Res Methods. 2019 Oct;51(5):2022-2038. doi: 10.3758/s13428-019-01273-7.
7
PROMIS Adult Health Profiles: Efficient Short-Form Measures of Seven Health Domains.PROMIS 成人健康概况量表:七个健康领域的高效简明测量工具。
Value Health. 2019 May;22(5):537-544. doi: 10.1016/j.jval.2019.02.004.
8
Estimation of a Preference-Based Summary Score for the Patient-Reported Outcomes Measurement Information System: The PROMIS-Preference (PROPr) Scoring System.患者报告结局测量信息系统(PROMIS)偏好评分系统:基于偏好的总结评分估算。
Med Decis Making. 2018 Aug;38(6):683-698. doi: 10.1177/0272989X18776637. Epub 2018 Jun 26.
9
PROMIS-29 v2.0 profile physical and mental health summary scores.PROMIS-29 v2.0 概况身心健康综合评分。
Qual Life Res. 2018 Jul;27(7):1885-1891. doi: 10.1007/s11136-018-1842-3. Epub 2018 Mar 22.
10
Sources of Error in Substance Use Prevalence Surveys.物质使用流行率调查中的误差来源。
Int Sch Res Notices. 2014 Nov 5;2014:923290. doi: 10.1155/2014/923290. eCollection 2014.