在大规模死亡率试验中评估婴儿发育时平衡精度与可承受性：一项随机对照试验的二次分析

Balancing precision and affordability in assessing infant development in large-scale mortality trials: secondary analysis of a randomised controlled trial.

作者信息

Robledo Kristy P, Rieger Ingrid, Finlayson Sarah, Tarnow-Mordi William, Martin Andrew J

机构信息

NHMRC Clinical Trials Centre, The University of Sydney, Camperdown, New South Wales, Australia

RPA Women and Babies, Camperdown, New South Wales, Australia.

出版信息

Arch Dis Child Fetal Neonatal Ed. 2025 Jun 19;110(4):409-414. doi: 10.1136/archdischild-2024-327762.

DOI:10.1136/archdischild-2024-327762

PMID:39788720

Abstract

OBJECTIVE

Large-scale mortality trials require reliable secondary assessments of impairment. We compared the Ages and Stages Questionnaire (ASQ-3), a screening tool self-administered by parents, in classifying impairment using the 'gold standard' Bayley Scales of Infant Development (Bayley-III), a diagnostic tool administered by trained assessors.

DESIGN

Analysis of 405 children around 2 years corrected age from the Australian Placental Transfusion Study, a trial conducted over 8 years.

SETTING

Secondary analysis of international, open-label, multicentre randomised trial.

PATIENTS

Children born <30 weeks gestation.

INTERVENTIONS

Immediate (<10 s) versus delayed (60 s+) cord clamping.

MAIN OUTCOMES

ASQ-3 and Bayley-III assessments around 2 years corrected age. Impairment (or developmental delay) was defined as <2 SD below the mean (<70) for Bayley-III domains.

RESULTS

The area under the receiver operating curve for ASQ-3 domains predicting delay was 0.75-0.99. Sensitivity for predicting delay was 57%-100%, while specificity was 88%-90%.We modelled the cost and sample size using a less expensive, though less precise, screening assessment for impairment compared with a more costly diagnostic assessment. For detecting a 25% reduction in the relative risk of delay, using ASQ-3 rather than Bayley-III could require double the sample size (15 000 to 30 000), but outcome assessment cost savings would be US$13M (EUR$12M). However, assessment cost savings may be outweighed by upscaling.

CONCLUSIONS

When measuring developmental outcomes in a large-scale clinical trial, using a more precise diagnostic tool may be financially prohibitive, so increasing the sample size and using a less precise but appropriately calibrated tool may be more affordable.

TRIAL REGISTRATION NUMBER

ACTRN12610000633088.

摘要

目的

大规模死亡率试验需要对损伤进行可靠的二次评估。我们比较了由父母自行填写的筛查工具《年龄与发育阶段问卷》（ASQ - 3）与由经过培训的评估人员使用的诊断工具《贝利婴幼儿发展量表》（贝利 - III）在对损伤进行分类时的情况，后者被视为“金标准”。

设计

对澳大利亚胎盘输血研究中405名矫正年龄约2岁的儿童进行分析，该试验历时8年。

背景

对一项国际、开放标签、多中心随机试验的二次分析。

研究对象

孕周小于30周出生的儿童。

干预措施

即刻（<10秒）与延迟（60秒以上）脐带结扎。

主要结局

矫正年龄约2岁时的ASQ - 3和贝利 - III评估。损伤（或发育迟缓）定义为贝利 - III各领域得分低于均值（<70）2个标准差以下。

结果

ASQ - 3各领域预测发育迟缓的受试者工作特征曲线下面积为0.75 - 0.99。预测发育迟缓的敏感性为57% - 100%，特异性为88% - 90%。我们对成本和样本量进行了建模，与成本更高的诊断性评估相比，使用一种成本较低但精度稍差的损伤筛查评估方法。为检测发育迟缓相对风险降低25%，使用ASQ - 3而非贝利 - III可能需要将样本量翻倍（从15000增至30000），但结局评估成本可节省1300万美元（1200万欧元）。然而，评估成本的节省可能会因扩大规模而被抵消。