Thornton Christopher B, Kolehmainen Niina, Nazarpour Kianoush
Population Health Sciences Institute, Faculty of Medical Sciences, Newcastle University, United Kingdom.
Great North Children's Hospital, Newcastle upon Tyne NHS Hospitals Trust, Unite Kingdom.
PLOS Digit Health. 2023 Apr 5;2(4):e0000220. doi: 10.1371/journal.pdig.0000220. eCollection 2023 Apr.
Accelerometers are widely used to measure physical activity behaviour, including in children. The traditional method for processing acceleration data uses cut points to define physical activity intensity, relying on calibration studies that relate the magnitude of acceleration to energy expenditure. However, these relationships do not generalise across diverse populations and hence they must be parametrised for each subpopulation (e.g., age groups) which is costly and makes studies across diverse populations and over time difficult. A data-driven approach that allows physical activity intensity states to emerge from the data, without relying on parameters derived from external populations, offers a new perspective on this problem and potentially improved results. We applied an unsupervised machine learning approach, namely a hidden semi-Markov model, to segment and cluster the raw accelerometer data recorded (using a waist-worn ActiGraph GT3X+) from 279 children (9-38 months old) with a diverse range of developmental abilities (measured using the Paediatric Evaluation of Disability Inventory-Computer Adaptive Testing measure). We benchmarked this analysis with the cut points approach, calculated using thresholds from the literature which had been validated using the same device and for a population which most closely matched ours. Time spent active as measured by this unsupervised approach correlated more strongly with PEDI-CAT measures of the child's mobility (R2: 0.51 vs 0.39), social-cognitive capacity (R2: 0.32 vs 0.20), responsibility (R2: 0.21 vs 0.13), daily activity (R2: 0.35 vs 0.24), and age (R2: 0.15 vs 0.1) than that measured using the cut points approach. Unsupervised machine learning offers the potential to provide a more sensitive, appropriate, and cost-effective approach to quantifying physical activity behaviour in diverse populations, compared to the current cut points approach. This, in turn, supports research that is more inclusive of diverse or rapidly changing populations.
加速度计被广泛用于测量身体活动行为,包括在儿童中。处理加速度数据的传统方法使用切点来定义身体活动强度,这依赖于将加速度大小与能量消耗相关联的校准研究。然而,这些关系并不能推广到不同人群,因此必须针对每个亚人群(例如年龄组)进行参数化,这成本高昂,并且使得跨不同人群和随时间的研究变得困难。一种数据驱动的方法,即允许身体活动强度状态从数据中显现出来,而不依赖于从外部人群得出的参数,为这个问题提供了一个新视角,并可能带来更好的结果。我们应用了一种无监督机器学习方法,即隐半马尔可夫模型,对279名发育能力各异(使用儿童残疾评定量表-计算机自适应测试进行测量)的儿童(9至38个月大)佩戴在腰部的ActiGraph GT3X+记录的原始加速度计数据进行分段和聚类。我们使用文献中的阈值计算切点方法对该分析进行了基准测试,这些阈值已在使用相同设备且与我们的人群最匹配的人群中得到验证。通过这种无监督方法测量的活跃时间与儿童运动能力(R2:0.51对0.39)、社会认知能力(R2:0.32对0.20)、责任感(R2:0.21对0.13)、日常活动(R2:0.35对0.24)和年龄(R2:0.15对0.1)的儿童残疾评定量表-计算机自适应测试测量值的相关性,比使用切点方法测量的更强。与当前的切点方法相比,无监督机器学习有可能提供一种更敏感、合适且具有成本效益的方法来量化不同人群的身体活动行为。这反过来又支持了更具包容性的不同或快速变化人群的研究。