Suppr超能文献

大规模潜在语义分析。

Large-scale latent semantic analysis.

机构信息

Institute for Intelligent Systems, 365 Innovation Drive, Suite 303, Memphis, TN 38152, USA.

出版信息

Behav Res Methods. 2011 Jun;43(2):414-23. doi: 10.3758/s13428-010-0050-z.

Abstract

Latent semantic analysis (LSA) is a statistical technique for representing word meaning that has been widely used for making semantic similarity judgments between words, sentences, and documents. In order to perform an LSA analysis, an LSA space is created in a two-stage procedure, involving the construction of a word frequency matrix and the dimensionality reduction of that matrix through singular value decomposition (SVD). This article presents LANSE, an SVD algorithm specifically designed for LSA, which allows extremely large matrices to be processed using off-the-shelf computer hardware.

摘要

潜在语义分析(LSA)是一种用于表示词义的统计技术,已广泛用于对词、句和文档之间的语义相似度进行判断。为了进行 LSA 分析,需要通过两步程序创建 LSA 空间,这两步程序涉及构建单词频率矩阵和通过奇异值分解(SVD)降低该矩阵的维数。本文介绍了 LANSE,这是一种专门为 LSA 设计的 SVD 算法,它允许使用现成的计算机硬件处理非常大的矩阵。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验