文献检索，用中文搜 PubMed

应用&插件

Zotero 插件浏览器插件 Mac 客户端 Windows 客户端微信小程序

定价

高级版会员购买积分包购买API积分包

服务

文献检索文档翻译深度研究 API 文档 MCP 服务

关于我们

关于 Suppr 公司介绍联系我们用户协议隐私条款

关注我们

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

粤ICP备2023148730 号-1Suppr @ 2026

In text classification problems, the representation of a document has a strong impact on the performance of learning systems. The high dimensionality of the classical structured representations can lead to burdensome computations due to the great size of real-world data. Consequently, there is a need for reducing the quantity of handled information to improve the classification process. In this paper, we propose a method to reduce the dimensionality of a classical text representation based on a clustering technique to group documents, and a previously developed Hidden Markov Model to represent them. We have applied tests with the k-NN and SVM classifiers on the OHSUMED and TREC benchmark text corpora using the proposed dimensionality reduction technique. The experimental results obtained are very satisfactory compared to commonly used techniques like InfoGain and the statistical tests performed demonstrate the suitability of the proposed technique for the preprocessing step in a text classification task.

Improving the text classification using clustering and a novel HMM to reduce the dimensionality.

作者信息

Seara Vieira A, Borrajo L, Iglesias E L

机构信息

Department of Computer Science, Higher Technical School of Computer Engineering, University of Vigo, 32004 Ourense, Spain.

出版信息

Comput Methods Programs Biomed. 2016 Nov;136:119-30. doi: 10.1016/j.cmpb.2016.08.018. Epub 2016 Aug 26.

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

利用聚类和一种新颖的隐马尔可夫模型来降低维度，以改进文本分类。

Improving the text classification using clustering and a novel HMM to reduce the dimensionality.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

利用聚类和一种新颖的隐马尔可夫模型来降低维度，以改进文本分类。

Improving the text classification using clustering and a novel HMM to reduce the dimensionality.

作者信息

机构信息

出版信息

相似文献

引用本文的文献