Cheung Yin-Bun, Gladstone Melissa, Maleta Ken, Duan Xiaolian, Ashorn Per
Clinical Trials and Epidemiology Research Unit, Block A #03-02, 226 Outram Road, Singapore.
Trop Med Int Health. 2008 Aug;13(8):987-93. doi: 10.1111/j.1365-3156.2008.02104.x. Epub 2008 Jun 28.
Assessment of child development often results in a multitude of binary outcome data. There is no agreed way to use them to score the developmental status of children. Conventional methods include age-standardized Z-scores and simple sum of number of passes. Recently two approaches based on the Rasch model and the concept of 'developmental age' have been proposed. This study aims to compare the performance of the four approaches.
In a longitudinal study, 473 Malawian children were measured for growth status at age 36 months and administered a new test of developmental milestones between age 3 and 6 years. The test consisted of four domains: gross motor (GM), fine motor (FM), social and language development. The four approaches were used to score the developmental level of each child in each domain, and the results compared.
In this sample, the approach based on the Rasch model provided development scores that were more normally distributed than the other approaches did. The four sets of scores were highly correlated with each other. They gave similar estimates of the effect of height-for-age on GM, social and language development. In FM development, the maximum difference in the effect size estimates was only 0.04 standard deviation despite its statistical significance (P = 0.009).
The four approaches were practically equivalent in the context of the estimation of an intervention effect or association. Their relative advantages and disadvantages are discussed. None of them can be universally recommended.
儿童发育评估常常会产生大量二元结局数据。目前尚无公认的方法利用这些数据来对儿童的发育状况进行评分。传统方法包括年龄标准化Z分数和通过项目数的简单总和。最近有人提出了基于拉施模型和“发育年龄”概念的两种方法。本研究旨在比较这四种方法的性能。
在一项纵向研究中,对473名马拉维儿童在36个月大时测量生长状况,并在3至6岁之间对其进行一项新的发育里程碑测试。该测试包括四个领域:大运动(GM)、精细运动(FM)、社交和语言发育。使用这四种方法对每个儿童在每个领域的发育水平进行评分,并比较结果。
在该样本中,基于拉施模型的方法所提供的发育分数比其他方法的分布更接近正态分布。这四组分数彼此高度相关。它们对年龄别身高对大运动、社交和语言发育的影响给出了相似的估计。在精细运动发育方面,尽管效应大小估计的最大差异具有统计学意义(P = 0.009),但其仅为0.04个标准差。
在估计干预效果或关联方面,这四种方法实际上是等效的。讨论了它们各自的优缺点。没有一种方法可以被普遍推荐。