• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Comparing Methods for Assessing Reliability.评估可靠性的方法比较
J Surv Stat Methodol. 2020 Sep 8;9(4):651-673. doi: 10.1093/jssam/smaa018. eCollection 2021 Sep.
2
Population Assessment of Tobacco and Health (PATH) reliability and validity study: selected reliability and validity estimates.人口评估烟草与健康 (PATH) 可靠性和有效性研究:部分可靠性和有效性评估。
Tob Control. 2019 Nov;28(6):663-668. doi: 10.1136/tobaccocontrol-2018-054561. Epub 2018 Oct 8.
3
Who Can You Count On? Understanding The Determinants of Reliability.你能依靠谁?理解可靠性的决定因素。
J Surv Stat Methodol. 2020 Nov;8(5):903-931. doi: 10.1093/jssam/smz034. Epub 2019 Oct 3.
4
Survey Reliability: Models, Methods, and Findings.调查的可靠性:模型、方法与发现
J Surv Stat Methodol. 2020 Oct 21;9(5):961-991. doi: 10.1093/jssam/smaa021. eCollection 2021 Nov.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Validity and reliability in reporting sexual partners and condom use in a Swiss population survey.瑞士一项人口调查中报告性伴侣及避孕套使用情况的有效性和可靠性
Eur J Epidemiol. 1998 Feb;14(2):139-46. doi: 10.1023/a:1007435824281.
7
Health-related quality of life in early breast cancer.早期乳腺癌患者的健康相关生活质量
Dan Med Bull. 2010 Sep;57(9):B4184.
8
[The estimation of premorbid intelligence levels in French speakers].[法语使用者病前智力水平的评估]
Encephale. 2005 Jan-Feb;31(1 Pt 1):31-43. doi: 10.1016/s0013-7006(05)82370-x.
9
Applications of computerized adaptive testing (CAT) to the assessment of headache impact.计算机自适应测试(CAT)在头痛影响评估中的应用。
Qual Life Res. 2003 Dec;12(8):935-52. doi: 10.1023/a:1026115230284.
10

引用本文的文献

1
No Effect of Cigarette Smoking in the Outcome of Arthroscopic Management for Femoroacetabular Impingement: A Systematic Review.吸烟对髋关节撞击综合征关节镜治疗结果无影响:一项系统评价
J Clin Med. 2024 Nov 27;13(23):7214. doi: 10.3390/jcm13237214.

本文引用的文献

1
Who Can You Count On? Understanding The Determinants of Reliability.你能依靠谁?理解可靠性的决定因素。
J Surv Stat Methodol. 2020 Nov;8(5):903-931. doi: 10.1093/jssam/smz034. Epub 2019 Oct 3.
2
Population Assessment of Tobacco and Health (PATH) reliability and validity study: selected reliability and validity estimates.人口评估烟草与健康 (PATH) 可靠性和有效性研究:部分可靠性和有效性评估。
Tob Control. 2019 Nov;28(6):663-668. doi: 10.1136/tobaccocontrol-2018-054561. Epub 2018 Oct 8.
3
When do latent class models overstate accuracy for diagnostic and other classifiers in the absence of a gold standard?在没有金标准的情况下,潜在类别模型何时会高估诊断及其他分类器的准确性?
Biometrics. 2012 Jun;68(2):559-66. doi: 10.1111/j.1541-0420.2011.01694.x. Epub 2011 Oct 21.
4
Weighted kappa: nominal scale agreement with provision for scaled disagreement or partial credit.加权kappa系数:用于衡量名义尺度上的一致性,并考虑了尺度不一致或部分得分的情况。
Psychol Bull. 1968 Oct;70(4):213-20. doi: 10.1037/h0026256.
5
Context effects in national health surveys: effects of preceding questions on reporting serious difficulty seeing and legal blindness.国民健康调查中的背景效应:先前问题对报告严重视力困难和法定失明的影响。
Public Opin Q. 2000 Spring;64(1):65-76.
6
Testing the self-perception explanation of dissonance phenomena: on the salience of premanipulation attitudes.检验失调现象的自我认知解释:关于操纵前态度的显著性
J Pers Soc Psychol. 1970 Jan;14(1):23-31. doi: 10.1037/h0020916.
7
A proposed solution to the base rate problem in the kappa statistic.kappa统计量中基础概率问题的一个提议解决方案。
Arch Gen Psychiatry. 1985 Jul;42(7):725-8. doi: 10.1001/archpsyc.1985.01790300093012.
8
Implicit and explicit memory for new associations in normal and amnesic subjects.正常人和失忆症患者对新关联的内隐记忆和外显记忆。
J Exp Psychol Learn Mem Cogn. 1985 Jul;11(3):501-18. doi: 10.1037//0278-7393.11.3.501.
9
High agreement but low kappa: II. Resolving the paradoxes.高一致性但低卡帕值:II. 解决悖论
J Clin Epidemiol. 1990;43(6):551-8. doi: 10.1016/0895-4356(90)90159-m.

评估可靠性的方法比较

Comparing Methods for Assessing Reliability.

作者信息

Tourangeau Roger, Sun Hanyu, Yan Ting

机构信息

Westat Inc., Methodology, Westat, 1600 Research Boulevard, Rockville, MD 20850, USA.

Westat Inc., Statistical Department, Westat, 1600 Research Boulevard, Rockville, MD 20850, USA.

出版信息

J Surv Stat Methodol. 2020 Sep 8;9(4):651-673. doi: 10.1093/jssam/smaa018. eCollection 2021 Sep.

DOI:10.1093/jssam/smaa018
PMID:34671685
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8519302/
Abstract

The usual method for assessing the reliability of survey data has been to conduct reinterviews a short interval (such as one to two weeks) after an initial interview and to use these data to estimate relatively simple statistics, such as gross difference rates (GDRs). More sophisticated approaches have also been used to estimate reliability. These include estimates from multi-trait, multi-method experiments, models applied to longitudinal data, and latent class analyses. To our knowledge, no prior study has systematically compared these different methods for assessing reliability. The Population Assessment of Tobacco and Health Reliability and Validity (PATH-RV) Study, done on a national probability sample, assessed the reliability of answers to the Wave 4 questionnaire from the PATH Study. Respondents in the PATH-RV were interviewed twice about two weeks apart. We examined whether the classic survey approach yielded different conclusions from the more sophisticated methods. We also examined two methods for assessing problems with survey questions and item nonresponse rates and response times to see how strongly these related to the different reliability estimates. We found that kappa was highly correlated with both GDRs and over-time correlations, but the latter two statistics were less highly correlated, particularly for adult respondents; estimates from longitudinal analyses of the same items in the main PATH study were also highly correlated with the traditional reliability estimates. The latent class analysis results, based on fewer items, also showed a high level of agreement with the traditional measures. The other methods and indicators had at best weak relationships with the reliability estimates derived from the reinterview data. Although the Question Understanding Aid seems to tap a different factor from the other measures, for adult respondents, it did predict item nonresponse and response latencies and thus may be a useful adjunct to the traditional measures.

摘要

评估调查数据可靠性的常用方法是在首次访谈后短时间间隔(如1至2周)内进行重新访谈,并使用这些数据来估计相对简单的统计数据,如总差异率(GDRs)。也采用了更复杂的方法来估计可靠性。这些方法包括多特质、多方法实验的估计、应用于纵向数据的模型以及潜在类别分析。据我们所知,此前没有研究系统地比较过这些评估可靠性的不同方法。烟草与健康人口评估可靠性与有效性(PATH-RV)研究基于全国概率样本,评估了PATH研究第4轮问卷答案的可靠性。PATH-RV的受访者接受了两次间隔约两周的访谈。我们研究了经典调查方法是否会得出与更复杂方法不同的结论。我们还研究了两种评估调查问题、项目无回答率和回答时间问题的方法,以了解它们与不同可靠性估计的关联程度。我们发现,kappa与GDRs和随时间的相关性都高度相关,但后两个统计数据的相关性较低,尤其是对于成年受访者;对PATH主要研究中相同项目的纵向分析估计也与传统可靠性估计高度相关。基于较少项目的潜在类别分析结果也与传统测量方法高度一致。其他方法和指标与重新访谈数据得出的可靠性估计最多只有微弱的关系。尽管问题理解辅助工具似乎涉及到与其他测量方法不同的因素,但对于成年受访者,它确实可以预测项目无回答和回答延迟,因此可能是传统测量方法的有用补充。