IBM Research Europe.
Stud Health Technol Inform. 2021 Nov 18;287:8-12. doi: 10.3233/SHTI210800.
There is a growing trend in building deep learning patient representations from health records to obtain a comprehensive view of a patient's data for machine learning tasks. This paper proposes a reproducible approach to generate patient pathways from health records and to transform them into a machine-processable image-like structure useful for deep learning tasks. Based on this approach, we generated over a million pathways from FAIR synthetic health records and used them to train a convolutional neural network. Our initial experiments show the accuracy of the CNN on a prediction task is comparable or better than other autoencoders trained on the same data, while requiring significantly less computational resources for training. We also assess the impact of the size of the training dataset on autoencoders performances. The source code for generating pathways from health records is provided as open source.
从健康记录中构建深度学习患者表示以获取患者数据的全面视图,从而用于机器学习任务,这已经成为一种趋势。本文提出了一种从健康记录中生成患者路径并将其转换为可用于深度学习任务的机器可处理的图像结构的可重现方法。基于这种方法,我们从 FAIR 合成健康记录中生成了超过一百万条路径,并使用它们来训练卷积神经网络。我们的初步实验表明,在预测任务上,CNN 的准确性与在相同数据上训练的其他自动编码器相当或更好,同时训练所需的计算资源明显更少。我们还评估了训练数据集大小对自动编码器性能的影响。生成健康记录路径的代码已作为开源提供。