Suppr超能文献

类别识别与数据汇总中的离散经验插值方法

The discrete empirical interpolation method in class identification and data summarization.

作者信息

Lyons Emily P Hendryx

机构信息

Department of Mathematics and Statistics, University of Central Oklahoma.

出版信息

Wiley Interdiscip Rev Comput Stat. 2024 May-Jun;16(3). doi: 10.1002/wics.1653. Epub 2024 May 5.

Abstract

The discrete empirical interpolation method (DEIM) is well-established as a means of performing model order reduction in approximating solutions to differential equations, but it has also more recently demonstrated potential in performing data class detection through subset selection. Leveraging the singular value decomposition for dimension reduction, DEIM uses interpolatory projection to identify the representative rows and/or columns of a data matrix. This approach has been adapted to develop additional algorithms, including a CUR matrix factorization for performing dimension reduction while preserving the interpretability of the data. DEIM-oversampling techniques have also been developed expressly for the purpose of index selection in identifying more DEIM representatives than would typically be allowed by the matrix rank. Even with these developments, there is still a relatively large gap in the literature regarding the use of DEIM in performing unsupervised learning tasks to analyze large data sets. Known examples of DEIM's demonstrated applicability include contexts such as physics-based modeling/monitoring, electrocardiogram data summarization and classification, and document term subset selection. This overview presents a description of DEIM and some DEIM-related algorithms, discusses existing results from the literature with an emphasis on more statistical-learning-based tasks, and identifies areas for further exploration moving forward.

摘要

离散经验插值方法(DEIM)作为一种在逼近微分方程解时进行模型降阶的手段已得到广泛认可,但最近它在通过子集选择进行数据类别检测方面也展现出了潜力。DEIM利用奇异值分解进行降维,通过插值投影来识别数据矩阵的代表性行和/或列。这种方法已被用于开发其他算法,包括一种CUR矩阵分解,用于在降维的同时保留数据的可解释性。DEIM过采样技术也是专门为索引选择而开发的,目的是识别比矩阵秩通常允许的更多的DEIM代表。即便有了这些进展,在文献中关于使用DEIM执行无监督学习任务以分析大数据集方面仍存在较大差距。DEIM已证明的适用性的已知示例包括基于物理的建模/监测、心电图数据汇总和分类以及文档术语子集选择等背景。本综述介绍了DEIM和一些与DEIM相关的算法,讨论了文献中的现有结果,重点是更多基于统计学习的任务,并确定了未来进一步探索的领域。

相似文献

2
3
Finding representative electrocardiogram beat morphologies with CUR.用 CUR 寻找有代表性的心电图波形态。
J Biomed Inform. 2018 Jan;77:97-110. doi: 10.1016/j.jbi.2017.12.003. Epub 2017 Dec 7.
5
rCUR: an R package for CUR matrix decomposition.rCUR:一个用于 CUR 矩阵分解的 R 包。
BMC Bioinformatics. 2012 May 17;13:103. doi: 10.1186/1471-2105-13-103.
9
Empirical Bayes Linked Matrix Decomposition.经验贝叶斯链接矩阵分解
Mach Learn. 2024 Oct;113(10):7451-7477. doi: 10.1007/s10994-024-06599-8. Epub 2024 Aug 7.

本文引用的文献

1
3
Finding representative electrocardiogram beat morphologies with CUR.用 CUR 寻找有代表性的心电图波形态。
J Biomed Inform. 2018 Jan;77:97-110. doi: 10.1016/j.jbi.2017.12.003. Epub 2017 Dec 7.
5
NCBI GEO: archive for functional genomics data sets--update.NCBI GEO:功能基因组学数据集存档 - 更新。
Nucleic Acids Res. 2013 Jan;41(Database issue):D991-5. doi: 10.1093/nar/gks1193. Epub 2012 Nov 27.
6
CUR matrix decompositions for improved data analysis.用于改进数据分析的CUR矩阵分解。
Proc Natl Acad Sci U S A. 2009 Jan 20;106(3):697-702. doi: 10.1073/pnas.0803205106. Epub 2009 Jan 12.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验