Levin-Schwartz Yuri, Calhoun Vince D, Adali Tulay
IEEE Trans Med Imaging. 2017 Jul;36(7):1385-1395. doi: 10.1109/TMI.2017.2678483. Epub 2017 Mar 6.
The extraction of information from multiple sets of data is a problem inherent to many disciplines. This is possible by either analyzing the data sets jointly as in data fusion or separately and then combining as in data integration. However, selecting the optimal method to combine and analyze multiset data is an ever-present challenge. The primary reason for this is the difficulty in determining the optimal contribution of each data set to an analysis as well as the amount of potentially exploitable complementary information among data sets. In this paper, we propose a novel classification rate-based technique to unambiguously quantify the contribution of each data set to a fusion result as well as facilitate direct comparisons of fusion methods on real data and apply a new method, independent vector analysis (IVA), to multiset fusion. This classification rate-based technique is used on functional magnetic resonance imaging data collected from 121 patients with schizophrenia and 150 healthy controls during the performance of three tasks. Through this application, we find that though optimal performance is achieved by exploiting all tasks, each task does not contribute equally to the result and this framework enables effective quantification of the value added by each task. Our results also demonstrate that data fusion methods are more powerful than data integration methods, with the former achieving a classification rate of 73.5 % and the latter achieving one of 70.9 %, a difference which we show is significant when all three tasks are analyzed together. Finally, we show that IVA, due to its flexibility, has equivalent or superior performance compared with the popular data fusion method, joint independent component analysis.
从多组数据中提取信息是许多学科所固有的问题。这可以通过像数据融合那样联合分析数据集,或者像数据整合那样分别分析然后合并来实现。然而,选择最优的方法来合并和分析多集数据一直是个挑战。主要原因在于难以确定每个数据集对分析的最优贡献以及各数据集之间潜在可利用的互补信息量。在本文中,我们提出一种基于分类率的新颖技术,以明确量化每个数据集对融合结果的贡献,并便于在真实数据上直接比较融合方法,同时将一种新方法——独立向量分析(IVA)应用于多集融合。这种基于分类率的技术用于从121名精神分裂症患者和150名健康对照在执行三项任务期间收集的功能磁共振成像数据。通过此应用,我们发现虽然利用所有任务可实现最优性能,但每个任务对结果的贡献并不相同,并且该框架能够有效量化每个任务所增加的价值。我们的结果还表明,数据融合方法比数据整合方法更强大,前者的分类率为73.5%,后者为70.9%,当对所有三项任务一起分析时,我们表明这种差异是显著的。最后,我们表明,由于其灵活性,IVA与流行的数据融合方法联合独立成分分析相比,具有同等或更优的性能。