Chiu Yi-Hung, Lee Ying-Han, Wang San-Yuan, Ouyang Chen-Sen, Wu Rong-Ching, Yang Rei-Cheng, Lin Lung-Chang
Department of Information Engineering, I-Shou University, No. 1, University Road, Yanchao District, Kaohsiung City, 824005, Taiwan.
Department of General Medicine, Shin Kong Wu Ho-Su Memorial Hospital, No. 95, Wenchang Road, Shilin District, Taipei City, 111045, Taiwan.
J Neurodev Disord. 2024 Dec 24;16(1):71. doi: 10.1186/s11689-024-09588-z.
Attention deficit hyperactivity disorder (ADHD) is a common childhood neurodevelopmental disorder, affecting between 5% and 7% of school-age children. ADHD is typically characterized by persistent patterns of inattention or hyperactivity-impulsivity, and it is diagnosed on the basis of the criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition, through subjective observations and information provided by parents and teachers. Diagnosing ADHD in children is challenging, despite several assessment tools, such as the Swanson, Nolan, and Pelham questionnaire, being widely available. Such scales provide only a subjective understanding of the disorder. In this study, we employed video pixel subtraction and machine learning classification to objectively categorize 85 participants (43 with a diagnosis of ADHD and 42 without) into an ADHD group or a non-ADHD group by quantifying their movements.
We employed pixel subtraction movement quantization by analyzing movement features in videos of patients in outpatient consultation rooms. Pixel subtraction is a technique in which the number of pixels in one frame is subtracted from that in another frame to detect changes between the two frames. A difference between the pixel values indicates the presence of movement. In the current study, the patients' subtracted image sequences were characterized using three movement feature values: mean, variance, and Shannon entropy value. A classification analysis based on six machine learning models was performed to compare the performance indices and the discriminatory power of various features.
The results revealed that compared with the non-ADHD group, the ADHD group had significantly larger values for all movement features. Notably, the Shannon entropy values were 2.38 ± 0.59 and 1.0 ± 0.38 in the ADHD and non-ADHD groups, respectively (P < 0.0001). The Random Forest machine learning classification model achieved the most favorable results, with an accuracy of 90.24%, sensitivity of 88.85%, specificity of 91.75%, and area under the curve of 93.87%.
Our pixel subtraction and machine learning classification approach is an objective and practical method that can aid to clinical decisions regarding ADHD diagnosis.
注意力缺陷多动障碍(ADHD)是一种常见的儿童神经发育障碍,影响5%至7%的学龄儿童。ADHD的典型特征是持续的注意力不集中或多动-冲动模式,它是根据《精神疾病诊断与统计手册》第五版中概述的标准,通过家长和教师提供的主观观察和信息来诊断的。尽管有几种评估工具,如Swanson、Nolan和Pelham问卷,广泛可用,但诊断儿童ADHD仍具有挑战性。此类量表仅提供对该疾病的主观理解。在本研究中,我们采用视频像素减法和机器学习分类,通过量化85名参与者(43名被诊断为ADHD,42名未被诊断为ADHD)的动作,将他们客观地分为ADHD组或非ADHD组。
我们通过分析门诊咨询室患者视频中的动作特征,采用像素减法动作量化。像素减法是一种从一帧中的像素数量减去另一帧中的像素数量以检测两帧之间变化的技术。像素值之间的差异表明存在动作。在本研究中,使用三个动作特征值(均值、方差和香农熵值)对患者的减法图像序列进行表征。基于六种机器学习模型进行分类分析,以比较各种特征的性能指标和鉴别能力。
结果显示,与非ADHD组相比,ADHD组所有动作特征的值均显著更大。值得注意的是,ADHD组和非ADHD组的香农熵值分别为2.38±0.59和1.0±0.38(P<0.0001)。随机森林机器学习分类模型取得了最有利的结果,准确率为90.24%,灵敏度为88.85%,特异性为91.75%,曲线下面积为93.87%。
我们的像素减法和机器学习分类方法是一种客观实用的方法,有助于ADHD诊断的临床决策。