Isola Rahul, Carvalho Rebeck, Tripathy Amiya Kumar
IEEE Trans Inf Technol Biomed. 2012 Nov;16(6):1287-95. doi: 10.1109/TITB.2012.2215044. Epub 2012 Aug 23.
Medical data is an ever-growing source of information generated from the hospitals in the form of patient records. When mined properly the information hidden in these records is a huge resource bank for medical research. As of now, this data is mostly used only for clinical work. This data often contains hidden patterns and relationships, that can lead to better diagnosis, better medicines, better treatment and overall, a platform to better understand the mechanisms governing almost all aspects of the medical domain. Unfortunately, discovery of these hidden patterns and relationships often goes unexploited. However there is on-going research in medical diagnosis which can predict the diseases of the heart, lungs and various tumours based on the past data collected from the patients.They are mostly limited to domain specific systems that predict diseases restricted to their area of operation like heart, brain and various other domains. These are not applicable to the whole medical dataset. The system proposed in this paper uses this vast storage of information so that diagnosis based on this historical data can be made. It focuses on computing the probability of occurrence of a particular ailment from the medical data by mining it using a unique algorithm which increases accuracy of such diagnosis by combining the key points of Neural Networks, Large Memory Storage and Retrieval (LAMSTAR), k-NN and Differential Diagnosis all integrated into one single algorithm. The system uses a Service-Oriented Architecture wherein the system components of diagnosis, information portal and other miscellaneous services are provided.This algorithm can be used in solving a few common problems that are encountered in automated diagnosis these days, which include: diagnosis of multiple diseases showing similar symptoms, diagnosis of a person suffering from multiple diseases, receiving faster and more accurate second opinion and faster identification of trends present in the medical records.
医疗数据是医院以患者记录形式生成的一个不断增长的信息源。如果挖掘得当,隐藏在这些记录中的信息对于医学研究来说是一个巨大的资源库。截至目前,这些数据大多仅用于临床工作。这些数据通常包含隐藏的模式和关系,能够带来更好的诊断、更优的药物、更有效的治疗,总体而言,还能形成一个更好地理解几乎所有医学领域控制机制的平台。不幸的是,这些隐藏模式和关系的发现往往未得到利用。不过,目前正在进行医学诊断方面的研究,这些研究能够根据从患者那里收集到的过往数据预测心脏、肺部和各种肿瘤疾病。它们大多局限于特定领域的系统,这些系统只能预测限于其操作领域(如心脏、大脑和其他各种领域)的疾病,并不适用于整个医疗数据集。本文提出的系统利用了这一庞大的信息存储,以便能够基于这些历史数据进行诊断。它专注于通过使用一种独特算法挖掘医疗数据来计算特定疾病发生的概率,该算法通过将神经网络、大容量存储与检索(LAMSTAR)、k近邻算法和鉴别诊断的关键点整合到一个单一算法中,提高了此类诊断的准确性。该系统采用面向服务的架构,其中提供了诊断、信息门户和其他杂项服务等系统组件。这种算法可用于解决如今自动诊断中遇到的一些常见问题,这些问题包括:诊断表现出相似症状的多种疾病、诊断患有多种疾病的人、获得更快更准确的第二种意见以及更快识别病历中呈现的趋势。