• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Enhancing Precision in Predicting Magnitude of Differential Item Functioning: An M-DIF Pretrained Model Approach.提高预测项目功能差异程度的精度:一种M-DIF预训练模型方法。
Educ Psychol Meas. 2024 Oct 1:00131644241279882. doi: 10.1177/00131644241279882.
2
Assessing DIF among small samples with separate calibration t and Mantel-Haenszel χ² statistics in the Rasch model.在Rasch模型中,使用单独校准的t统计量和Mantel-Haenszel卡方统计量评估小样本中的差异性项目功能。
J Appl Meas. 2013;14(4):389-99.
3
The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning With the Rasch Model: Conventional and Tailored Calibration.使用拉施模型对项目功能差异进行建模时能力差异与猜测的相互作用:传统校准与定制校准
Educ Psychol Meas. 2015 Aug;75(4):610-633. doi: 10.1177/0013164414554082. Epub 2014 Oct 20.
4
Improvement in Detection of Differential Item Functioning Using a Mixture Item Response Theory Model.使用混合项目反应理论模型改进差异项目功能检测
Multivariate Behav Res. 2010 Nov 30;45(6):975-99. doi: 10.1080/00273171.2010.533047.
5
Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.使用项目反应理论对数似然比(IRTLR)方法评估测量等价性,以评估项目功能差异(DIF):身体功能能力和一般痛苦测量的应用(附说明)
Qual Life Res. 2007;16 Suppl 1:43-68. doi: 10.1007/s11136-007-9186-4. Epub 2007 May 5.
6
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Anxiety Short Forms in Ethnically Diverse Groups.患者报告结局测量信息系统(PROMIS)焦虑简表在不同种族群体中的测量等效性
Psychol Test Assess Model. 2016;58(1):183-219.
7
Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.用于检测项目功能差异的现代心理测量方法:在认知评估测量中的应用。
Stat Med. 2000;19(11-12):1651-83. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1651::aid-sim453>3.0.co;2-h.
8
Measurement Equivalence of the Patient Reported Outcomes Measurement Information System (PROMIS) Pain Interference Short Form Items: Application to Ethnically Diverse Cancer and Palliative Care Populations.患者报告结局测量信息系统(PROMIS)疼痛干扰简表条目的测量等效性:在不同种族癌症和姑息治疗人群中的应用。
Psychol Test Assess Model. 2016;58(2):309-352.
9
Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning.改进测量不变性评估:使用正则化选择锚定项目并识别差异项目功能。
Psychol Methods. 2020 Dec;25(6):673-690. doi: 10.1037/met0000253. Epub 2020 Jan 9.
10
Psychometric Properties and Performance of the Patient Reported Outcomes Measurement Information System (PROMIS) Depression Short Forms in Ethnically Diverse Groups.患者报告结局测量信息系统(PROMIS)抑郁简表在不同种族群体中的心理测量特性及表现
Psychol Test Assess Model. 2016;58(1):141-181.

本文引用的文献

1
Enhancing measurement validity in diverse populations: Modern approaches to evaluating differential item functioning.提高不同人群中的测量有效性:评估项目间差异的现代方法。
Br J Math Stat Psychol. 2023 Nov;76(3):435-461. doi: 10.1111/bmsp.12316. Epub 2023 Jul 10.
2
Combining Item Purification and Multiple Comparison Adjustment Methods in Detection of Differential Item Functioning.结合项目纯化和多重比较调整方法检测差异项目功能。
Multivariate Behav Res. 2024 Jan-Feb;59(1):46-61. doi: 10.1080/00273171.2023.2205393. Epub 2023 May 23.
3
A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning.一种基于用于项目功能差异的曼特尔-亨塞尔效应量度量的拉施树新停止准则。
Educ Psychol Meas. 2023 Feb;83(1):181-212. doi: 10.1177/00131644221077135. Epub 2022 Feb 28.
4
Treatments of Differential Item Functioning: A Comparison of Four Methods.差异项目功能的处理方法:四种方法的比较
Educ Psychol Meas. 2022 Apr;82(2):225-253. doi: 10.1177/00131644211012050. Epub 2021 May 10.
5
Detecting Differential Item Functioning Using Multiple-Group Cognitive Diagnosis Models.使用多组认知诊断模型检测项目功能差异
Appl Psychol Meas. 2021 Jan;45(1):37-53. doi: 10.1177/0146621620965745. Epub 2020 Oct 21.
6
One model to rule them all? Using machine learning algorithms to determine the number of factors in exploratory factor analysis.一统天下的模式?使用机器学习算法确定探索性因素分析中的因素数量。
Psychol Methods. 2020 Dec;25(6):776-786. doi: 10.1037/met0000262. Epub 2020 Mar 5.
7
Improving the assessment of measurement invariance: Using regularization to select anchor items and identify differential item functioning.改进测量不变性评估:使用正则化选择锚定项目并识别差异项目功能。
Psychol Methods. 2020 Dec;25(6):673-690. doi: 10.1037/met0000253. Epub 2020 Jan 9.
8
A Comparison of Differential Item Functioning Detection Methods in Cognitive Diagnostic Models.认知诊断模型中差异项目功能检测方法的比较
Front Psychol. 2019 May 17;10:1137. doi: 10.3389/fpsyg.2019.01137. eCollection 2019.
9
Scientists rise up against statistical significance.科学家们奋起反对统计显著性。
Nature. 2019 Mar;567(7748):305-307. doi: 10.1038/d41586-019-00857-9.
10
Using Odds Ratios to Detect Differential Item Functioning.使用优势比来检测项目功能差异
Appl Psychol Meas. 2018 Nov;42(8):613-629. doi: 10.1177/0146621618762738. Epub 2018 Mar 21.

提高预测项目功能差异程度的精度:一种M-DIF预训练模型方法。

Enhancing Precision in Predicting Magnitude of Differential Item Functioning: An M-DIF Pretrained Model Approach.

作者信息

Huang Shan, Ishii Hidetoki

机构信息

Nagoya University, Japan.

出版信息

Educ Psychol Meas. 2024 Oct 1:00131644241279882. doi: 10.1177/00131644241279882.

DOI:10.1177/00131644241279882
PMID:39554774
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11562883/
Abstract

Despite numerous studies on the magnitude of differential item functioning (DIF), different DIF detection methods often define effect sizes inconsistently and fail to adequately account for testing conditions. To address these limitations, this study introduces the unified M-DIF model, which defines the magnitude of DIF as the difference in item difficulty parameters between reference and focal groups. The M-DIF model can incorporate various DIF detection methods and test conditions to form a quantitative model. The pretrained approach was employed to leverage a sufficiently representative large sample as the training set and ensure the model's generalizability. Once the pretrained model is constructed, it can be directly applied to new data. Specifically, a training dataset comprising 144 combinations of test conditions and 144,000 potential DIF items, each equipped with 29 statistical metrics, was used. We adopt the XGBoost method for modeling. Results show that, based on root mean square error (RMSE) and BIAS metrics, the M-DIF model outperforms the baseline model in both validation sets: under consistent and inconsistent test conditions. Across all 360 combinations of test conditions (144 consistent and 216 inconsistent with the training set), the M-DIF model demonstrates lower RMSE in 357 cases (99.2%), illustrating its robustness. Finally, we provided an empirical example to showcase the practical feasibility of implementing the M-DIF model.

摘要

尽管针对差异项目功能(DIF)的大小进行了大量研究,但不同的DIF检测方法往往对效应大小的定义不一致,并且未能充分考虑测试条件。为了解决这些局限性,本研究引入了统一的M-DIF模型,该模型将DIF的大小定义为参考组和焦点组之间项目难度参数的差异。M-DIF模型可以纳入各种DIF检测方法和测试条件,以形成一个定量模型。采用预训练方法利用一个具有充分代表性的大样本作为训练集,并确保模型的通用性。一旦构建了预训练模型,就可以直接将其应用于新数据。具体而言,使用了一个包含144种测试条件组合和144,000个潜在DIF项目的训练数据集,每个项目配备29个统计指标。我们采用XGBoost方法进行建模。结果表明,基于均方根误差(RMSE)和偏差指标,M-DIF模型在两个验证集中均优于基线模型:在一致和不一致的测试条件下。在所有360种测试条件组合(144种与训练集一致,216种与训练集不一致)中,M-DIF模型在357例(99.2%)中表现出较低的RMSE,说明了其稳健性。最后,我们提供了一个实证例子来展示实施M-DIF模型的实际可行性。