Institute for Intelligent Systems, 365 Innovation Drive, Suite 303, Memphis, TN 38152, USA.
Behav Res Methods. 2011 Jun;43(2):414-23. doi: 10.3758/s13428-010-0050-z.
Latent semantic analysis (LSA) is a statistical technique for representing word meaning that has been widely used for making semantic similarity judgments between words, sentences, and documents. In order to perform an LSA analysis, an LSA space is created in a two-stage procedure, involving the construction of a word frequency matrix and the dimensionality reduction of that matrix through singular value decomposition (SVD). This article presents LANSE, an SVD algorithm specifically designed for LSA, which allows extremely large matrices to be processed using off-the-shelf computer hardware.
潜在语义分析(LSA)是一种用于表示词义的统计技术,已广泛用于对词、句和文档之间的语义相似度进行判断。为了进行 LSA 分析,需要通过两步程序创建 LSA 空间,这两步程序涉及构建单词频率矩阵和通过奇异值分解(SVD)降低该矩阵的维数。本文介绍了 LANSE,这是一种专门为 LSA 设计的 SVD 算法,它允许使用现成的计算机硬件处理非常大的矩阵。