Batool Humaira, Usman Akram M, Batool Fouzia, Butt Wasi Haider
National University of Sciences and Technology, Islamabad, Pakistan.
Riphah College of Rehabilitation Sciences, Riphah International University Islamabad, Islamabad, Pakistan.
Springerplus. 2016 Oct 21;5(1):1840. doi: 10.1186/s40064-016-3537-y. eCollection 2016.
Frozen shoulder is a disease in which shoulder becomes stiff. Accurate diagnosis of frozen shoulder is helpful in providing economical and effective treatment for patients. This research provides the classification of unstructured data using data mining techniques. Prediction results are validated by K-fold cross-validation method. It also provides accurate diagnosis of frozen shoulder using Naïve Bayesian and Random Forest models. At the end results are presented by performance measure techniques.
In this research, 145 respondents (patients) with a severe finding of frozen shoulder are included. They are selected on premise of (clinical) assessment confirmed after by MRI. This data is taken from the department of Orthopedics (Pakistan Institute of Medical Sciences Islamabad and Railway Hospital Rawalpindi) between September 2014 to November 2015. Frozen shoulder is categorized on the basis of MRI result. The predictor variables are taken from patient survey and patient reports, which consisted of 35+ variables. The outcome variable is coded into numeric system of "intact" and "no-intact". The outcome variable is assigned into numeric code, 1 for "intact" and 0 for "no-intact". "Intact" group is used as an indication that tissue is damaged badly and "no-intact" is classified as normal. Distribution of result is 110 patients for "Intact" group and 35 patients for "No-Intact" group (false positive rate was 24 %). In this research we have utilized two methods i.e. Naive Bayes and Random Forest. A statistics regression model (Logistic regression) to categorize frozen shoulder finding into "intact" and "no-intact" classes. In the end, we validated our results by Bayesian theorem. This gives a rough estimate about the probability of frozen shoulder.
In this research, our anticipated and predictive procedures gave better outcome as compared to statistical techniques. The specificity and sensitivity ratio of predicting a frozen shoulder are better in the Naïve Bayes as compared to Random Forest. In end the likelihood ratio results are used with Bayesian theorem for final evaluation of the results, from this we conclude predictive model is valid model for classification of frozen shoulder.
We have used three predictive models in our study to classify frozen shoulder. Then we validated our predictive results by Bayesian theorem to give a rough estimate about the probability of occurrence of disease or not. This enhances the clinical decision making regarding frozen shoulder.
肩周炎是一种肩部僵硬的疾病。准确诊断肩周炎有助于为患者提供经济有效的治疗。本研究使用数据挖掘技术对非结构化数据进行分类。预测结果通过K折交叉验证法进行验证。它还使用朴素贝叶斯和随机森林模型对肩周炎进行准确诊断。最后通过性能度量技术展示结果。
本研究纳入了145名有严重肩周炎症状的受访者(患者)。他们是在经MRI确认的(临床)评估基础上被挑选出来的。这些数据取自2014年9月至2015年11月期间的骨科(伊斯兰堡巴基斯坦医学科学研究所和拉瓦尔品第铁路医院)。肩周炎根据MRI结果进行分类。预测变量取自患者调查和患者报告,其中包含35个以上的变量。结果变量被编码为“完整”和“不完整”的数字系统。结果变量被分配数字代码,“完整”为1,“不完整”为0。“完整”组表示组织严重受损,“不完整”组被归类为正常。结果分布为“完整”组110名患者,“不完整”组35名患者(假阳性率为24%)。在本研究中,我们使用了两种方法,即朴素贝叶斯和随机森林。一个统计回归模型(逻辑回归)将肩周炎症状分类为“完整”和“不完整”类别。最后,我们通过贝叶斯定理验证了结果。这给出了肩周炎概率的大致估计。
在本研究中,与统计技术相比,我们的预期和预测程序产生了更好的结果。与随机森林相比,朴素贝叶斯预测肩周炎的特异性和灵敏度比率更高。最后,似然比结果与贝叶斯定理一起用于结果的最终评估,由此我们得出预测模型是肩周炎分类的有效模型。
我们在研究中使用了三种预测模型对肩周炎进行分类。然后我们通过贝叶斯定理验证了预测结果,以给出疾病发生概率的大致估计。这增强了关于肩周炎的临床决策。