Yaghouby Farid, Sunderam Sridhar
Department of Biomedical Engineering, University of Kentucky, Lexington, KY 40506-0108, USA.
Department of Biomedical Engineering, University of Kentucky, Lexington, KY 40506-0108, USA.
Comput Biol Med. 2015 Apr;59:54-63. doi: 10.1016/j.compbiomed.2015.01.012. Epub 2015 Jan 23.
The limitations of manual sleep scoring make computerized methods highly desirable. Scoring errors can arise from human rater uncertainty or inter-rater variability. Sleep scoring algorithms either come as supervised classifiers that need scored samples of each state to be trained, or as unsupervised classifiers that use heuristics or structural clues in unscored data to define states. We propose a quasi-supervised classifier that models observations in an unsupervised manner but mimics a human rater wherever training scores are available. EEG, EMG, and EOG features were extracted in 30s epochs from human-scored polysomnograms recorded from 42 healthy human subjects (18-79 years) and archived in an anonymized, publicly accessible database. Hypnograms were modified so that: 1. Some states are scored but not others; 2. Samples of all states are scored but not for transitional epochs; and 3. Two raters with 67% agreement are simulated. A framework for quasi-supervised classification was devised in which unsupervised statistical models-specifically Gaussian mixtures and hidden Markov models--are estimated from unlabeled training data, but the training samples are augmented with variables whose values depend on available scores. Classifiers were fitted to signal features incorporating partial scores, and used to predict scores for complete recordings. Performance was assessed using Cohen's Κ statistic. The quasi-supervised classifier performed significantly better than an unsupervised model and sometimes as well as a completely supervised model despite receiving only partial scores. The quasi-supervised algorithm addresses the need for classifiers that mimic scoring patterns of human raters while compensating for their limitations.
人工睡眠评分的局限性使得计算机化方法备受青睐。评分误差可能源于人工评分者的不确定性或评分者之间的差异。睡眠评分算法要么是需要对每个状态的评分样本进行训练的监督分类器,要么是利用未评分数据中的启发式方法或结构线索来定义状态的无监督分类器。我们提出了一种准监督分类器,它以无监督方式对观测值进行建模,但在有训练分数的情况下模仿人工评分者。从42名健康人类受试者(18 - 79岁)记录的人工评分多导睡眠图中,以30秒时段提取脑电图(EEG)、肌电图(EMG)和眼电图(EOG)特征,并将其存档于一个匿名的、可公开访问的数据库中。对睡眠图进行了修改,以便:1. 对某些状态进行评分而对其他状态不评分;2. 对所有状态的样本进行评分,但不对过渡时段进行评分;3. 模拟两名一致性为67%的评分者。设计了一个准监督分类框架,其中从未标记的训练数据中估计无监督统计模型——具体为高斯混合模型和隐马尔可夫模型——但训练样本用其值取决于可用分数的变量进行扩充。将分类器拟合到包含部分分数的信号特征上,并用于预测完整记录的分数。使用科恩κ统计量评估性能。尽管只获得了部分分数,但准监督分类器的表现明显优于无监督模型,有时与完全监督模型相当。准监督算法满足了对模仿人工评分者评分模式同时弥补其局限性的分类器的需求。