Shakeel P Mohamed, Baskar S, Dhulipala V R Sarma, Jaber Mustafa Musa
1Faculty of Information and Communication Technology, Universiti Teknikal Malaysia Melaka, Durian Tunggal, Malaysia.
Department of ECE, Karpagam Academy of Higher Education, Coimbatore, India.
Health Inf Sci Syst. 2018 Sep 24;6(1):16. doi: 10.1007/s13755-018-0054-0. eCollection 2018 Dec.
Diabetes mellitus is a serious health problem affecting the entire population all over the world for many decades. It is a group of metabolic disorder characterized by chronic disease which occurs due to high blood sugar, unhealthy foods, lack of physical activity and also hereditary. The sorts of diabetes mellitus are type1, type2 and gestational diabetes. The type1 appears during childhood and type2 diabetes develop at any age, mostly affects older than 40. The gestational diabetes occurs for pregnant women. According to the statistical report of WHO 79% of deaths occurred in people under the age of 60, due to diabetes. With a specific end goal to deal with the vast volume, speed, assortment, veracity and estimation of information a scalable environment is needed. Cloud computing is an interesting computing model suitable for accommodating huge volume of dynamic data. To overcome the data handling problems this work focused on Hadoop framework along with clustering technique. This work also predicts the occurrence of diabetes under various circumstances which is more useful for the human. This paper also compares the efficiency of two different clustering techniques suitable for the environment. The predicted result is used to diagnose which age group and gender are mostly affected by diabetes. Further some of the attributes such as hyper tension and work nature are also taken into consideration for analysis.
几十年来,糖尿病一直是影响全球全体人口的严重健康问题。它是一组代谢紊乱疾病,其特征为慢性病,由高血糖、不健康饮食、缺乏体育活动以及遗传因素导致。糖尿病的类型有1型、2型和妊娠期糖尿病。1型糖尿病出现在儿童期,2型糖尿病在任何年龄都可能发生,主要影响40岁以上人群。妊娠期糖尿病发生在孕妇身上。根据世界卫生组织的统计报告,79%的死亡发生在60岁以下的人群中,原因是糖尿病。为了处理大量、快速、多样、准确和有价值的信息,需要一个可扩展的环境。云计算是一种适用于处理大量动态数据的有趣计算模型。为了克服数据处理问题,这项工作聚焦于Hadoop框架以及聚类技术。这项工作还预测了在各种情况下糖尿病的发生,这对人类更有用。本文还比较了适用于该环境的两种不同聚类技术的效率。预测结果用于诊断哪些年龄组和性别受糖尿病影响最大。此外,还考虑了一些诸如高血压和工作性质等属性进行分析。