Goodwin L, Prather J, Schlitz K, Iannacchione M A, Hage M, Hammond W E, Grzymala-Busse J
Duke University Durham, NC 27710, USA.
Biomed Sci Instrum. 1997;34:291-6.
Issues obstructing progress in data mining for improved health outcomes include data quality problems, data redundancy, data inconsistency, repeated measures, temporal (time-contextual) measures, and data volume. Related issues involve theoretical and technical problems involving uncertainty management, missing data and missing values, and matching appropriate data mining techniques to patient data sets. Results of data mining research in progress are reported for Duke University's perinatal database that contains nearly a decade of clinical patient data, 71,753 database (patient) records and 4-5000 variables per patient.
阻碍通过数据挖掘改善健康结果的问题包括数据质量问题、数据冗余、数据不一致、重复测量、时间(时间背景)测量和数据量。相关问题涉及理论和技术问题,包括不确定性管理、缺失数据和缺失值,以及将合适的数据挖掘技术与患者数据集相匹配。本文报告了杜克大学围产期数据库正在进行的数据挖掘研究结果,该数据库包含近十年的临床患者数据、71753条数据库(患者)记录,每位患者有4000至5000个变量。