Robledo Kristy P, Rieger Ingrid, Finlayson Sarah, Tarnow-Mordi William, Martin Andrew J
NHMRC Clinical Trials Centre, The University of Sydney, Camperdown, New South Wales, Australia
RPA Women and Babies, Camperdown, New South Wales, Australia.
Arch Dis Child Fetal Neonatal Ed. 2025 Jun 19;110(4):409-414. doi: 10.1136/archdischild-2024-327762.
Large-scale mortality trials require reliable secondary assessments of impairment. We compared the Ages and Stages Questionnaire (ASQ-3), a screening tool self-administered by parents, in classifying impairment using the 'gold standard' Bayley Scales of Infant Development (Bayley-III), a diagnostic tool administered by trained assessors.
Analysis of 405 children around 2 years corrected age from the Australian Placental Transfusion Study, a trial conducted over 8 years.
Secondary analysis of international, open-label, multicentre randomised trial.
Children born <30 weeks gestation.
Immediate (<10 s) versus delayed (60 s+) cord clamping.
ASQ-3 and Bayley-III assessments around 2 years corrected age. Impairment (or developmental delay) was defined as <2 SD below the mean (<70) for Bayley-III domains.
The area under the receiver operating curve for ASQ-3 domains predicting delay was 0.75-0.99. Sensitivity for predicting delay was 57%-100%, while specificity was 88%-90%.We modelled the cost and sample size using a less expensive, though less precise, screening assessment for impairment compared with a more costly diagnostic assessment. For detecting a 25% reduction in the relative risk of delay, using ASQ-3 rather than Bayley-III could require double the sample size (15 000 to 30 000), but outcome assessment cost savings would be US$13M (EUR$12M). However, assessment cost savings may be outweighed by upscaling.
When measuring developmental outcomes in a large-scale clinical trial, using a more precise diagnostic tool may be financially prohibitive, so increasing the sample size and using a less precise but appropriately calibrated tool may be more affordable.
ACTRN12610000633088.
大规模死亡率试验需要对损伤进行可靠的二次评估。我们比较了由父母自行填写的筛查工具《年龄与发育阶段问卷》(ASQ - 3)与由经过培训的评估人员使用的诊断工具《贝利婴幼儿发展量表》(贝利 - III)在对损伤进行分类时的情况,后者被视为“金标准”。
对澳大利亚胎盘输血研究中405名矫正年龄约2岁的儿童进行分析,该试验历时8年。
对一项国际、开放标签、多中心随机试验的二次分析。
孕周小于30周出生的儿童。
即刻(<10秒)与延迟(60秒以上)脐带结扎。
矫正年龄约2岁时的ASQ - 3和贝利 - III评估。损伤(或发育迟缓)定义为贝利 - III各领域得分低于均值(<70)2个标准差以下。
ASQ - 3各领域预测发育迟缓的受试者工作特征曲线下面积为0.75 - 0.99。预测发育迟缓的敏感性为57% - 100%,特异性为88% - 90%。我们对成本和样本量进行了建模,与成本更高的诊断性评估相比,使用一种成本较低但精度稍差的损伤筛查评估方法。为检测发育迟缓相对风险降低25%,使用ASQ - 3而非贝利 - III可能需要将样本量翻倍(从15000增至30000),但结局评估成本可节省1300万美元(1200万欧元)。然而,评估成本的节省可能会因扩大规模而被抵消。
在大规模临床试验中测量发育结局时,使用更精确的诊断工具在经济上可能难以承受,因此增加样本量并使用精度稍差但经过适当校准的工具可能更经济实惠。
ACTRN12610000633088。