Grigull Lorenz, Lechner Werner, Petri Susanne, Kollewe Katja, Dengler Reinhard, Mehmecke Sandra, Schumacher Ulrike, Lücke Thomas, Schneider-Gold Christiane, Köhler Cornelia, Güttsches Anne-Katrin, Kortum Xiaowei, Klawonn Frank
Department of Pediatric Hematology and Oncology, Hannover Medical School, Carl-Neuberg Str. 1, D-30623, Hannover, Germany.
Improved Medical Diagnostics, IMD GmbH, Hannover, Germany.
BMC Med Inform Decis Mak. 2016 Mar 8;16:31. doi: 10.1186/s12911-016-0268-5.
Diagnosis of neuromuscular diseases in primary care is often challenging. Rare diseases such as Pompe disease are easily overlooked by the general practitioner. We therefore aimed to develop a diagnostic support tool using patient-oriented questions and combined data mining algorithms recognizing answer patterns in individuals with selected neuromuscular diseases. A multicenter prospective study for the proof of concept was conducted thereafter.
First, 16 interviews with patients were conducted focusing on their pre-diagnostic observations and experiences. From these interviews, we developed a questionnaire with 46 items. Then, patients with diagnosed neuromuscular diseases as well as patients without such a disease answered the questionnaire to establish a database for data mining. For proof of concept, initially only six diagnoses were chosen (myotonic dystrophy and myotonia (MdMy), Pompe disease (MP), amyotrophic lateral sclerosis (ALS), polyneuropathy (PNP), spinal muscular atrophy (SMA), other neuromuscular diseases, and no neuromuscular disease (NND). A prospective study was performed to validate the automated malleable system, which included six different classification methods combined in a fusion algorithm proposing a final diagnosis. Finally, new diagnoses were incorporated into the system.
In total, questionnaires from 210 individuals were used to train the system. 89.5 % correct diagnoses were achieved during cross-validation. The sensitivity of the system was 93-97 % for individuals with MP, with MdMy and without neuromuscular diseases, but only 69 % in SMA and 81 % in ALS patients. In the prospective trial, 57/64 (89 %) diagnoses were predicted correctly by the computerized system. All questions, or rather all answers, increased the diagnostic accuracy of the system, with the best results reached by the fusion of different classifier methods. Receiver operating curve (ROC) and p-value analyses confirmed the results.
A questionnaire-based diagnostic support tool using data mining methods exhibited good results in predicting selected neuromuscular diseases. Due to the variety of neuromuscular diseases, additional studies are required to measure beneficial effects in the clinical setting.
基层医疗中神经肌肉疾病的诊断往往具有挑战性。像庞贝病这样的罕见疾病很容易被全科医生忽视。因此,我们旨在开发一种诊断支持工具,该工具使用以患者为导向的问题以及结合数据挖掘算法来识别患有特定神经肌肉疾病个体的答案模式。此后进行了一项多中心前瞻性概念验证研究。
首先,对16名患者进行了访谈,重点关注他们诊断前的观察和经历。通过这些访谈,我们编制了一份包含46个条目的问卷。然后,患有已确诊神经肌肉疾病的患者以及未患此类疾病的患者回答了问卷,以建立用于数据挖掘的数据库。为了进行概念验证,最初仅选择了六种诊断(强直性肌营养不良和肌强直(MdMy)、庞贝病(MP)、肌萎缩侧索硬化症(ALS)、多发性神经病(PNP)、脊髓性肌萎缩症(SMA)、其他神经肌肉疾病以及无神经肌肉疾病(NND))。进行了一项前瞻性研究以验证自动可变系统,该系统包括六种不同的分类方法,这些方法组合在一个融合算法中以提出最终诊断。最后,将新的诊断纳入该系统。
总共使用了210人的问卷来训练该系统。在交叉验证期间,正确诊断率达到89.5%。该系统对患有MP、MdMy和无神经肌肉疾病的个体的敏感性为93% - 97%,但对SMA患者仅为69%,对ALS患者为81%。在前瞻性试验中,计算机化系统正确预测了57/64(89%)的诊断。所有问题,或者更确切地说所有答案,都提高了系统的诊断准确性,不同分类器方法融合时效果最佳。受试者工作特征曲线(ROC)和p值分析证实了这些结果。
一种使用数据挖掘方法的基于问卷的诊断支持工具在预测特定神经肌肉疾病方面表现出良好的效果。由于神经肌肉疾病的多样性,需要进行更多研究以衡量其在临床环境中的有益效果。