Siampou Maria Despoina, Nocera Luciano, Oh Jinseok, Smith Beth A, Shahabi Cyrus
Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-4. doi: 10.1109/EMBC53108.2024.10782656.
The inherent challenges in recruiting human subjects, particularly infants, often hinder the acquisition of sufficiently large datasets for health research, thereby limiting the applicability of conventional machine-learning (ML) approaches. In this study, we analyze full-day motion recordings from two groups: typically developing infants (N = 12) and infants at risk for developmental disabilities (N = 24), further divided into those with good (N = 10) and poor (N = 9) developmental outcomes at 24 months. The goal is to differentiate at-risk (AR) infants from those with typical development (TD) and predict outcomes for the at-risk category using wearable data. Due to its limited size, previous studies on this dataset, employing statistical and machine learning methods, raise reliability concerns. To address this, we introduce a novel algorithmic approach to extract meaningful patterns, referred to as Motifs, from the raw signals. The abundance of Motifs serves as highly informative indicators, enabling effective differentiation between the groups. Evaluation on this limited-size dataset demonstrates the effectiveness of Motifs in distinguishing AR from TD infants and predicting future outcomes for the at-risk category.
招募人类受试者,尤其是婴儿,存在诸多内在挑战,这常常阻碍为健康研究获取足够大的数据集,从而限制了传统机器学习(ML)方法的适用性。在本研究中,我们分析了两组全天的运动记录:发育正常的婴儿(N = 12)和有发育障碍风险的婴儿(N = 24),后者又进一步分为在24个月时发育结果良好(N = 10)和不良(N = 9)的两组。目标是使用可穿戴数据将有风险(AR)的婴儿与发育正常(TD)的婴儿区分开来,并预测有风险类别的结果。由于该数据集规模有限,此前使用统计和机器学习方法对其进行的研究引发了可靠性方面的担忧。为解决这一问题,我们引入了一种新颖的算法方法,从原始信号中提取有意义的模式,即基序(Motifs)。大量的基序可作为极具信息量的指标,实现两组之间的有效区分。在这个规模有限的数据集上进行的评估表明,基序在区分AR婴儿和TD婴儿以及预测有风险类别的未来结果方面是有效的。