• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种潜在尺度模型,用于在国家草坪草评估计划的视觉评级数据分析中尽量减少主观性。

A latent scale model to minimize subjectivity in the analysis of visual rating data for the National Turfgrass Evaluation Program.

作者信息

Qu Yuanshuo, Kne Len, Graham Steve, Watkins Eric, Morris Kevin

机构信息

National Turfgrass Evaluation Program, Beltsville, MD, United States.

U-Spatial, University of Minnesota, Minneapolis, MN, United States.

出版信息

Front Plant Sci. 2023 Jul 6;14:1135918. doi: 10.3389/fpls.2023.1135918. eCollection 2023.

DOI:10.3389/fpls.2023.1135918
PMID:37528968
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10390229/
Abstract

INTRODUCTION

Traditional evaluation procedure in National Turfgrass Evaluation Program (NTEP) relies on visually assessing replicated turf plots at multiple testing locations. This process yields ordinal data; however, statistical models that falsely assume these to be interval or ratio data have almost exclusively been applied in the subsequent analysis. This practice raises concerns about procedural subjectivity, preventing objective comparisons of cultivars across different test locations. It may also lead to serious errors, such as increased false alarms, failures to detect effects, and even inversions of differences among groups.

METHODS

We reviewed this problem, identified sources of subjectivity, and presented a model-based approach to minimize subjectivity, allowing objective comparisons of cultivars across different locations and better monitoring of the evaluation procedure. We demonstrate how to fit the described model in a Bayesian framework with Stan, using datasets on overall turf quality ratings from the 2017 NTEP Kentucky bluegrass trials at seven testing locations.

RESULTS

Compared with the existing method, ours allows the estimation of additional parameters, i.e., category thresholds, rating severity, and within-field spatial variations, and provides better separation of cultivar means and more realistic standard deviations.

DISCUSSION

To implement the proposed model, additional information on rater identification, trial layout, rating date is needed. Given the model assumptions, we recommend small trials to reduce rater fatigue. For large trials, ratings can be conducted for each replication on multiple occasions instead of all at once. To minimize subjectivity, multiple raters are required. We also proposed new ideas on temporal analysis, incorporating existing knowledge of turfgrass.

摘要

引言

国家草坪草评价计划(NTEP)中的传统评价程序依赖于在多个测试地点对重复的草坪地块进行目视评估。这一过程产生的是有序数据;然而,在后续分析中几乎完全应用了错误地将这些数据假定为区间或比率数据的统计模型。这种做法引发了对程序主观性的担忧,阻碍了不同测试地点品种之间的客观比较。它还可能导致严重错误,如误报增加、未能检测到效应,甚至组间差异的颠倒。

方法

我们审视了这个问题,确定了主观性的来源,并提出了一种基于模型的方法来尽量减少主观性,从而实现不同地点品种之间的客观比较,并更好地监测评价程序。我们展示了如何使用来自2017年NTEP肯塔基蓝草在七个测试地点试验的总体草坪质量评级数据集,在贝叶斯框架下用Stan拟合所描述的模型。

结果

与现有方法相比,我们的方法允许估计额外的参数,即类别阈值、评级严重性和田间空间变异,并能更好地分离品种均值和给出更现实的标准差。

讨论

要实施所提出的模型,需要关于评分者识别、试验布局、评级日期的额外信息。鉴于模型假设,我们建议进行小规模试验以减少评分者疲劳。对于大型试验,可以对每个重复进行多次评级,而不是一次性完成所有评级。为了尽量减少主观性,需要多个评分者。我们还提出了关于时间分析的新想法,纳入了草坪草的现有知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/ec6e9105df35/fpls-14-1135918-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/c28fd09eebd6/fpls-14-1135918-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/b6269be285b9/fpls-14-1135918-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/b1e6369af020/fpls-14-1135918-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/3602382a031e/fpls-14-1135918-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/e337b021e43b/fpls-14-1135918-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/3939854074ea/fpls-14-1135918-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/fb92b432f31d/fpls-14-1135918-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/ec6e9105df35/fpls-14-1135918-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/c28fd09eebd6/fpls-14-1135918-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/b6269be285b9/fpls-14-1135918-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/b1e6369af020/fpls-14-1135918-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/3602382a031e/fpls-14-1135918-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/e337b021e43b/fpls-14-1135918-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/3939854074ea/fpls-14-1135918-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/fb92b432f31d/fpls-14-1135918-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1264/10390229/ec6e9105df35/fpls-14-1135918-g008.jpg

相似文献

1
A latent scale model to minimize subjectivity in the analysis of visual rating data for the National Turfgrass Evaluation Program.一种潜在尺度模型,用于在国家草坪草评估计划的视觉评级数据分析中尽量减少主观性。
Front Plant Sci. 2023 Jul 6;14:1135918. doi: 10.3389/fpls.2023.1135918. eCollection 2023.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Beneficial Effects of Mixing Kentucky Bluegrass With Red Fescue Plant-Soil Interactions in Black Soil of Northeast China.肯塔基蓝草与紫羊茅混播在中国东北黑土中的有益效应及植物-土壤相互作用
Front Microbiol. 2020 Oct 28;11:556118. doi: 10.3389/fmicb.2020.556118. eCollection 2020.
4
Vehicle-Mounted Optical Sensing: An Objective Means for Evaluating Turf Quality.车载光学传感:一种评估草坪质量的客观方法。
Crop Sci. 2002 Jan;42(1):197-201. doi: 10.2135/cropsci2002.1970.
5
[Application of near-infrared spectroscopy in golf turfgrass management].近红外光谱在高尔夫草坪管理中的应用
Guang Pu Xue Yu Guang Pu Fen Xi. 2008 Jul;28(7):1539-43.
6
7
Measuring rater bias in diagnostic tests with ordinal ratings.用等级评定测量诊断测试中的评分者偏倚。
Stat Med. 2021 Jul 30;40(17):4014-4033. doi: 10.1002/sim.9011. Epub 2021 May 9.
8
9
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.
10
Molecular Analysis of Turfgrass Rusts Reveals the Widespread Distribution of Puccinia coronata as a Pathogen of Kentucky Bluegrass in the United States.
Plant Dis. 2011 Dec;95(12):1547-1557. doi: 10.1094/PDIS-01-11-0073.

本文引用的文献

1
Rating scales and Rasch measurement.量表和 Rasch 测量。
Expert Rev Pharmacoecon Outcomes Res. 2011 Oct;11(5):571-85. doi: 10.1586/erp.11.59.
2
Derivation and evaluation of a labeled hedonic scale.标签偏好量表的推导和评估。
Chem Senses. 2009 Nov;34(9):739-51. doi: 10.1093/chemse/bjp054.
3
Ratio scales and category scales for a dozen perceptual continua.用于十二个感知连续统的比率量表和类别量表。
J Exp Psychol. 1957 Dec;54(6):377-411. doi: 10.1037/h0043680.
4
Conditional pairwise estimation in the Rasch model for ordered response categories using principal components.使用主成分对有序反应类别Rasch模型中的条件成对估计。
J Appl Meas. 2003;4(3):205-21.
5
Systematic review of studies of quality of clinical care in general practice in the UK, Australia and New Zealand.对英国、澳大利亚和新西兰全科医疗临床护理质量研究的系统评价。
Qual Health Care. 2001 Sep;10(3):152-8. doi: 10.1136/qhc.0100152...
6
The sweetness and pleasantness of sugars.糖的甜味和宜人特性。
Am J Psychol. 1971 Sep;84(3):387-405.
7
The category effect with rating scales: number of categories, number of stimuli, and method of presentation.评定量表的类别效应:类别数量、刺激数量及呈现方法。
J Exp Psychol Hum Percept Perform. 1986 Nov;12(4):496-516. doi: 10.1037//0096-1523.12.4.496.