Lyons Emily P Hendryx
Department of Mathematics and Statistics, University of Central Oklahoma.
Wiley Interdiscip Rev Comput Stat. 2024 May-Jun;16(3). doi: 10.1002/wics.1653. Epub 2024 May 5.
The discrete empirical interpolation method (DEIM) is well-established as a means of performing model order reduction in approximating solutions to differential equations, but it has also more recently demonstrated potential in performing data class detection through subset selection. Leveraging the singular value decomposition for dimension reduction, DEIM uses interpolatory projection to identify the representative rows and/or columns of a data matrix. This approach has been adapted to develop additional algorithms, including a CUR matrix factorization for performing dimension reduction while preserving the interpretability of the data. DEIM-oversampling techniques have also been developed expressly for the purpose of index selection in identifying more DEIM representatives than would typically be allowed by the matrix rank. Even with these developments, there is still a relatively large gap in the literature regarding the use of DEIM in performing unsupervised learning tasks to analyze large data sets. Known examples of DEIM's demonstrated applicability include contexts such as physics-based modeling/monitoring, electrocardiogram data summarization and classification, and document term subset selection. This overview presents a description of DEIM and some DEIM-related algorithms, discusses existing results from the literature with an emphasis on more statistical-learning-based tasks, and identifies areas for further exploration moving forward.
离散经验插值方法(DEIM)作为一种在逼近微分方程解时进行模型降阶的手段已得到广泛认可,但最近它在通过子集选择进行数据类别检测方面也展现出了潜力。DEIM利用奇异值分解进行降维,通过插值投影来识别数据矩阵的代表性行和/或列。这种方法已被用于开发其他算法,包括一种CUR矩阵分解,用于在降维的同时保留数据的可解释性。DEIM过采样技术也是专门为索引选择而开发的,目的是识别比矩阵秩通常允许的更多的DEIM代表。即便有了这些进展,在文献中关于使用DEIM执行无监督学习任务以分析大数据集方面仍存在较大差距。DEIM已证明的适用性的已知示例包括基于物理的建模/监测、心电图数据汇总和分类以及文档术语子集选择等背景。本综述介绍了DEIM和一些与DEIM相关的算法,讨论了文献中的现有结果,重点是更多基于统计学习的任务,并确定了未来进一步探索的领域。