Department of Psychology, Stanford University, 450 Serra Mall, Stanford, CA, 94305, USA.
School of Communication Sciences and Disorders, McGill University, 2001 Avenue McGill College Suite 800, Montreal, QC, H3A 1G1, Canada.
J Neurodev Disord. 2020 Jul 24;12(1):20. doi: 10.1186/s11689-020-09321-6.
Matching is one commonly utilized method in quasi-experimental designs involving individuals with neurodevelopmental disorders (NDD). This method ensures two or more groups (e.g., individuals with an NDD versus neurotypical individuals) are balanced on pre-existing covariates (e.g., IQ), enabling researchers to interpret performance on outcome measures as being attributed to group membership. While much attention has been paid to the statistical criteria of how to assess whether groups are well-matched, relatively little attention has been given to a crucial prior step: the selection of the individuals that are included in matched groups. The selection of individuals is often an undocumented process, which can invite unintentional, arbitrary, and biased decision-making. Limited documentation can result in findings that have limited reproducibility and replicability and thereby have poor potential for generalization to the broader population. Especially given the heterogeneity of individuals with NDDs, interpretation of research findings depends on minimizing bias at all stages of data collection and analysis.
In the spirit of open science, this tutorial demonstrates how a workflow can be used to provide a transparent, reproducible, and replicable process to select individuals for matched groups. Our workflow includes the following key steps: Assess data, Select covariates, Conduct matching, and Diagnose matching. Our sample dataset is from children with autism spectrum disorder (ASD; n = 25) and typically developing children (n = 43) but can be adapted to comparisons of any two groups in quasi-experimental designs. We work through this method to conduct and document matching using propensity scores implemented with the R package MatchIt. Data and code are publicly available, and a template for this workflow is provided in the Additional file 1 as well as on a public repository.
It is important to provide clear documentation regarding the selection process to establish matched groups. This documentation ensures better transparency in participant selection and data analysis in NDD research. We hope the adoption of such a workflow will ultimately advance our ability to replicate findings and help improve the lives of individuals with NDDs.
匹配是涉及神经发育障碍(NDD)个体的准实验设计中常用的方法之一。这种方法确保两个或更多组(例如,NDD 个体与神经典型个体)在预先存在的协变量(例如,智商)上平衡,使研究人员能够将结果测量上的表现解释为归因于组别的成员身份。尽管人们非常关注如何评估组是否匹配良好的统计标准,但相对较少关注一个关键的先前步骤:选择纳入匹配组的个体。个体的选择通常是一个未记录的过程,这可能会导致无意识、任意和有偏见的决策。文档记录有限会导致可重复性和可复制性有限的发现,从而对更广泛的人群推广的潜力较差。特别是考虑到 NDD 个体的异质性,研究结果的解释取决于在数据收集和分析的所有阶段最小化偏差。
本着开放科学的精神,本教程演示了如何使用工作流程为匹配组选择个体提供透明、可重现和可复制的过程。我们的工作流程包括以下关键步骤:评估数据、选择协变量、进行匹配和诊断匹配。我们的样本数据集来自自闭症谱系障碍(ASD;n=25)和典型发育儿童(n=43),但可以适应准实验设计中任何两个组之间的比较。我们使用 R 包 MatchIt 实现的倾向得分来执行和记录匹配,完成了这个方法。数据和代码是公开的,这个工作流程的模板在附加文件 1 中以及公共存储库中都有提供。
提供关于选择过程的明确文档以建立匹配组非常重要。这种文档记录确保了在 NDD 研究中参与者选择和数据分析的更好透明度。我们希望采用这样的工作流程最终能够提高我们复制研究结果的能力,并帮助改善 NDD 个体的生活。