STARD 清单的可重复性：一种评估诊断准确性研究报告质量的工具。

Reproducibility of the STARD checklist: an instrument to assess the quality of reporting of diagnostic accuracy studies.

作者信息

Smidt Nynke, Rutjes Anne W S, van der Windt Daniëlle A W M, Ostelo Raymond W J G, Bossuyt Patrick M, Reitsma Johannes B, Bouter Lex M, de Vet Henrica C w

机构信息

Institute for Research in Extramural Medicine, VU University Medical Center, Van der Boechorststraat 7, 1081 BT Amsterdam, The Netherlands.

出版信息

BMC Med Res Methodol. 2006 Mar 15;6:12. doi: 10.1186/1471-2288-6-12.

DOI:10.1186/1471-2288-6-12

PMID:16539705

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1522016/

Abstract

BACKGROUND

In January 2003, STAndards for the Reporting of Diagnostic accuracy studies (STARD) were published in a number of journals, to improve the quality of reporting in diagnostic accuracy studies. We designed a study to investigate the inter-assessment reproducibility, and intra- and inter-observer reproducibility of the items in the STARD statement.

METHODS

Thirty-two diagnostic accuracy studies published in 2000 in medical journals with an impact factor of at least 4 were included. Two reviewers independently evaluated the quality of reporting of these studies using the 25 items of the STARD statement. A consensus evaluation was obtained by discussing and resolving disagreements between reviewers. Almost two years later, the same studies were evaluated by the same reviewers. For each item, percentages agreement and Cohen's kappa between first and second consensus assessments (inter-assessment) were calculated. Intraclass Correlation coefficients (ICC) were calculated to evaluate its reliability.

RESULTS

The overall inter-assessment agreement for all items of the STARD statement was 85% (Cohen's kappa 0.70) and varied from 63% to 100% for individual items. The largest differences between the two assessments were found for the reporting of the rationale of the reference standard (kappa 0.37), number of included participants that underwent tests (kappa 0.28), distribution of the severity of the disease (kappa 0.23), a cross tabulation of the results of the index test by the results of the reference standard (kappa 0.33) and how indeterminate results, missing data and outliers were handled (kappa 0.25). Within and between reviewers, also large differences were observed for these items. The inter-assessment reliability of the STARD checklist was satisfactory (ICC = 0.79 [95% CI: 0.62 to 0.89]).

CONCLUSION

Although the overall reproducibility of the quality of reporting on diagnostic accuracy studies using the STARD statement was found to be good, substantial disagreements were found for specific items. These disagreements were not so much caused by differences in interpretation of the items by the reviewers but rather by difficulties in assessing the reporting of these items due to lack of clarity within the articles. Including a flow diagram in all reports on diagnostic accuracy studies would be very helpful in reducing confusion between readers and among reviewers.

摘要

背景

2003年1月，《诊断准确性研究报告标准》（STARD）在多家期刊上发表，以提高诊断准确性研究报告的质量。我们设计了一项研究，以调查STARD声明中各项内容的评估间可重复性以及观察者内和观察者间可重复性。

方法

纳入2000年发表在影响因子至少为4的医学期刊上的32项诊断准确性研究。两名评审员使用STARD声明的25项内容独立评估这些研究的报告质量。通过讨论和解决评审员之间的分歧达成共识评估。大约两年后，相同的评审员对相同的研究进行评估。对于每个项目，计算首次和第二次共识评估之间（评估间）的一致百分比和科恩kappa系数。计算组内相关系数（ICC）以评估其可靠性。

结果

STARD声明所有项目的总体评估间一致性为85%（科恩kappa系数0.70），各个项目的一致性从63%到100%不等。两次评估之间差异最大的是参考标准原理的报告（kappa系数0.37）、接受测试的纳入参与者数量（kappa系数0.28）、疾病严重程度分布（kappa系数0.23）、索引测试结果与参考标准结果的交叉表（kappa系数0.33）以及不确定结果、缺失数据和异常值的处理方式（kappa系数0.25）。在评审员内部和评审员之间，这些项目也存在很大差异。STARD清单的评估间可靠性令人满意（ICC =

0.79 [95% CI：0.62至0.89]）。

结论

虽然使用STARD声明对诊断准确性研究报告质量的总体可重复性良好，但在特定项目上存在重大分歧。这些分歧与其说是由评审员对项目解释的差异导致的，不如说是由于文章中缺乏清晰度而难以评估这些项目的报告。在所有诊断准确性研究报告中纳入流程图将非常有助于减少读者和评审员之间的困惑。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81a0/1522016/f486abe9df13/1471-2288-6-12-1.jpg

相似文献

Reproducibility of the STARD checklist: an instrument to assess the quality of reporting of diagnostic accuracy studies.STARD 清单的可重复性：一种评估诊断准确性研究报告质量的工具。

BMC Med Res Methodol. 2006 Mar 15;6:12. doi: 10.1186/1471-2288-6-12.

The quality of diagnostic accuracy studies since the STARD statement: has it improved?自《STARD声明》发布以来诊断准确性研究的质量：有提高吗？

Neurology. 2006 Sep 12;67(5):792-7. doi: 10.1212/01.wnl.0000238386.41398.30.

Quality of reporting of diagnostic accuracy studies.诊断准确性研究的报告质量。

Radiology. 2005 May;235(2):347-53. doi: 10.1148/radiol.2352040507. Epub 2005 Mar 15.

Quality of reporting of test accuracy studies in reproductive medicine: impact of the Standards for Reporting of Diagnostic Accuracy (STARD) initiative.生殖医学中检验准确性研究的报告质量：诊断准确性报告标准（STARD）倡议的影响

Fertil Steril. 2006 Nov;86(5):1321-9. doi: 10.1016/j.fertnstert.2006.03.050. Epub 2006 Sep 14.

Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative.迈向诊断准确性研究的完整和准确报告：STARD倡议。

AJR Am J Roentgenol. 2003 Jul;181(1):51-5. doi: 10.2214/ajr.181.1.1810051.

Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative. The Standards for Reporting of Diagnostic Accuracy Group.迈向诊断准确性研究的完整和准确报告：STARD倡议。诊断准确性报告标准小组。

Croat Med J. 2003 Oct;44(5):635-8.

Towards complete and accurate reporting of studies of diagnostic accuracy: The STARD Initiative.迈向诊断准确性研究的完整和准确报告：STARD倡议。

Radiology. 2003 Jan;226(1):24-8. doi: 10.1148/radiol.2261021292.

Toward complete and accurate reporting of studies of diagnostic accuracy. The STARD initiative.迈向完整、准确的诊断准确性研究报告。STARD倡议。

Am J Clin Pathol. 2003 Jan;119(1):18-22. doi: 10.1309/8EXC-CM6Y-R1TH-UBAF.

The quality of reporting of diagnostic accuracy studies published in ophthalmic journals.眼科期刊上发表的诊断准确性研究的报告质量。

Br J Ophthalmol. 2005 Mar;89(3):261-5. doi: 10.1136/bjo.2004.051862.

The quality of reporting of diagnostic accuracy studies in glaucoma using the Heidelberg retina tomograph.使用海德堡视网膜断层扫描仪对青光眼诊断准确性研究的报告质量

Invest Ophthalmol Vis Sci. 2006 Jun;47(6):2317-23. doi: 10.1167/iovs.05-1250.

引用本文的文献

Adherence to STARD guidelines in miRNA diagnostic accuracy studies for hepatocellular carcinoma: a systematic review.肝细胞癌微小RNA诊断准确性研究中对STARD指南的遵循情况：一项系统评价

BMC Med Res Methodol. 2025 Jun 6;25(1):155. doi: 10.1186/s12874-025-02610-5.

Has the STARD statement improved the quality of reporting of diagnostic accuracy studies published in European Radiology?STARD 声明是否提高了发表在《欧洲放射学》上的诊断准确性研究报告的质量？

Eur Radiol. 2023 Jan;33(1):97-105. doi: 10.1007/s00330-022-09008-7. Epub 2022 Jul 30.

Systematic Review and STARD Scoring of Renal Cell Carcinoma Circulating Diagnostic Biomarker Manuscripts.肾细胞癌循环诊断生物标志物手稿的系统评价与STARD评分

JNCI Cancer Spectr. 2020 Jun 8;4(5):pkaa050. doi: 10.1093/jncics/pkaa050. eCollection 2020 Oct.

STARD 2015 was reproducible in a large set of studies on glaucoma.STARD 2015在大量关于青光眼的研究中具有可重复性。

PLoS One. 2017 Oct 12;12(10):e0186209. doi: 10.1371/journal.pone.0186209. eCollection 2017.

Endoscopic ultrasound for staging of colonic cancer proximal to the rectum: A systematic review and meta-analysis.内镜超声用于直肠近端结肠癌的分期：一项系统评价和荟萃分析。

Endosc Ultrasound. 2016 Sep-Oct;5(5):307-314. doi: 10.4103/2303-9027.191610.

A Systematic Review of Reporting Tools Applicable to Sexual and Reproductive Health Programmes: Step 1 in Developing Programme Reporting Standards.适用于性与生殖健康项目的报告工具的系统评价：制定项目报告标准的第一步。

PLoS One. 2015 Sep 29;10(9):e0138647. doi: 10.1371/journal.pone.0138647. eCollection 2015.

Assessing the quality of studies on the diagnostic accuracy of tumor markers.评估肿瘤标志物诊断准确性研究的质量。

Urol Oncol. 2014 Oct;32(7):1051-60. doi: 10.1016/j.urolonc.2013.10.003. Epub 2014 Aug 20.

[Inappropriate test methods in allergy].[过敏症中不恰当的检测方法]

Hautarzt. 2010 Nov;61(11):961-6. doi: 10.1007/s00105-010-1969-9.

Inter-rater agreement and reliability of the COSMIN (COnsensus-based Standards for the selection of health status Measurement Instruments) checklist.COSMIN（健康状况测量仪器选择的共识基础标准）清单的评价者间一致性和可靠性。

BMC Med Res Methodol. 2010 Sep 22;10:82. doi: 10.1186/1471-2288-10-82.

Selection and presentation of imaging figures in the medical literature.医学文献中影像学图片的选择与呈现。

PLoS One. 2010 May 28;5(5):e10888. doi: 10.1371/journal.pone.0010888.

本文引用的文献

Association between compliance with methodological standards of diagnostic research and reported test accuracy: meta-analysis of focused assessment of US for trauma.诊断研究方法学标准的依从性与报告的检验准确性之间的关联：美国创伤重点评估的荟萃分析

Radiology. 2005 Jul;236(1):102-11. doi: 10.1148/radiol.2361040791. Epub 2005 Jun 27.

An analysis of general medical and specialist journals that endorse CONSORT found that reporting was not enforced consistently.一项对认可CONSORT的普通医学期刊和专业期刊的分析发现，报告要求并未得到始终如一的执行。

J Clin Epidemiol. 2005 Jul;58(7):662-7. doi: 10.1016/j.jclinepi.2005.01.004.

The dependence of Cohen's kappa on the prevalence does not matter.科恩kappa系数对患病率的依赖性无关紧要。

J Clin Epidemiol. 2005 Jul;58(7):655-61. doi: 10.1016/j.jclinepi.2004.02.021. Epub 2005 Apr 18.

Epidemiology and reporting of randomised trials published in PubMed journals.发表于PubMed期刊的随机试验的流行病学与报告情况。

Lancet. 2005;365(9465):1159-62. doi: 10.1016/S0140-6736(05)71879-1.

Quality of reporting of diagnostic accuracy studies.诊断准确性研究的报告质量。

Radiology. 2005 May;235(2):347-53. doi: 10.1148/radiol.2352040507. Epub 2005 Mar 15.

Quality of reporting of observational longitudinal research.观察性纵向研究的报告质量。

Am J Epidemiol. 2005 Feb 1;161(3):280-8. doi: 10.1093/aje/kwi042.

The scandal of poor epidemiological research.糟糕的流行病学研究丑闻。

BMJ. 2004 Oct 16;329(7471):868-9. doi: 10.1136/bmj.329.7471.868.

Issues in the reporting of epidemiological studies: a survey of recent practice.流行病学研究报告中的问题：近期实践调查

BMJ. 2004 Oct 16;329(7471):883. doi: 10.1136/bmj.38250.571088.55. Epub 2004 Oct 6.

Sources of variation and bias in studies of diagnostic accuracy: a systematic review.诊断准确性研究中的变异和偏倚来源：一项系统综述。

Ann Intern Med. 2004 Feb 3;140(3):189-202. doi: 10.7326/0003-4819-140-3-200402030-00010.

Diagnostic accuracy of nucleic acid amplification tests for tuberculous meningitis: a systematic review and meta-analysis.核酸扩增检测对结核性脑膜炎的诊断准确性：一项系统评价和荟萃分析。

Lancet Infect Dis. 2003 Oct;3(10):633-43. doi: 10.1016/s1473-3099(03)00772-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

STARD 清单的可重复性：一种评估诊断准确性研究报告质量的工具。

Reproducibility of the STARD checklist: an instrument to assess the quality of reporting of diagnostic accuracy studies.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献