Liu Xiaonan, Chen Kewei, Weidman David, Wu Teresa, Lure Fleming, Li Jing
Industrial Engineering, Arizona State University, Tempe, AZ, USA.
Banner Alzheimer's Institute, Phoenix, AZ, USA.
IISE Trans. 2021;53(9):1010-1022. doi: 10.1080/24725854.2020.1798569. Epub 2020 Sep 17.
Multimodality datasets are becoming increasingly common in various domains to provide complementary information for predictive analytics. One significant challenge in fusing multimodality data is that the multiple modalities are not universally available for all samples due to cost and accessibility constraints. This results in a unique data structure called Incomplete Multimodality Dataset (IMD). We propose a novel Incomplete-Multimodality Transfer Learning (IMTL) model that builds a predictive model for each sub-cohort of samples with the same missing modality pattern, and meanwhile couples the model estimation processes for different sub-cohorts to allow for transfer learning. We develop an Expectation-Maximization (EM) algorithm to estimate the parameters of IMTL and further extend it to a collaborative learning paradigm that is specifically valuable for patient privacy preservation in health care applications. We prove two advantageous properties of IMTL: the ability for out-of-sample prediction and a theoretical guarantee for a larger Fisher information compared with models without transfer learning. IMTL is applied to diagnosis and prognosis of the Alzheimer's Disease (AD) at an early stage called Mild Cognitive Impairment (MCI) using incomplete multimodality imaging data. IMTL achieves higher accuracy than competing methods without transfer learning. Supplementary materials are available for this article on the publisher's website.
多模态数据集在各个领域正变得越来越普遍,以提供用于预测分析的补充信息。融合多模态数据的一个重大挑战是,由于成本和可获取性限制,并非所有样本都能普遍获得多种模态的数据。这导致了一种独特的数据结构,称为不完全多模态数据集(IMD)。我们提出了一种新颖的不完全多模态迁移学习(IMTL)模型,该模型为具有相同缺失模态模式的每个样本子群组构建预测模型,同时将不同子群组的模型估计过程耦合起来以实现迁移学习。我们开发了一种期望最大化(EM)算法来估计IMTL的参数,并进一步将其扩展为一种协作学习范式,这在医疗保健应用中对保护患者隐私特别有价值。我们证明了IMTL的两个有利特性:样本外预测能力以及与无迁移学习的模型相比具有更大Fisher信息的理论保证。IMTL使用不完全多模态成像数据应用于阿尔茨海默病(AD)早期阶段即轻度认知障碍(MCI)的诊断和预后。与无迁移学习的竞争方法相比,IMTL实现了更高的准确率。本文的补充材料可在出版商网站上获取。