Suppr超能文献

大规模科技文献查询服务的文献聚类算法研究。

Research on Literature Clustering Algorithm for Massive Scientific and Technical Literature Query Service.

机构信息

Wuhan University of Science & Technology Library, Wuhan 430081, Hubei, China.

出版信息

Comput Intell Neurosci. 2022 Aug 21;2022:3392489. doi: 10.1155/2022/3392489. eCollection 2022.

Abstract

Traditional science and technology literature search mainly provides users with reliable and detailed information materials and services through technical means, data resources, and service strategies. With the development of network technology, computer technology, and information technology, digital information resources are increasing day by day, which continuously impact the traditional knowledge service mode. Some traditional technical methods and service means can no longer meet the information needs of users under large data sets. This paper proposes a model of large-scale literature search service in the context of big data by studying the technical means and service modes used for scientific and technical literature search in universities in the era of big data. Specifically, this paper proposes a method for fast literature retrieval by combining R-tree indexing for the characteristics of diverse data types and large data volume of science and technology literature. The method uses an improved k-mean clustering algorithm to construct an R-tree clustering model and improve the retrieval efficiency of the system by retrieving scientific and technical literature data through R-tree indexing. Experiments on university science and technology literature datasets show that the method in this paper improves both efficiency and precision when searching literature.

摘要

传统的科技文献检索主要通过技术手段、数据资源和服务策略为用户提供可靠、详细的信息资料和服务。随着网络技术、计算机技术和信息技术的发展,数字信息资源日益增多,不断冲击着传统的知识服务模式。一些传统的技术方法和服务手段在大数据集下已不能满足用户的信息需求。本文通过研究大数据时代高校科技文献检索所采用的技术手段和服务模式,提出了一种大数据环境下的大规模文献检索服务模型。具体来说,本文针对科技文献数据类型多样、数据量大的特点,提出了一种结合 R 树索引的快速文献检索方法。该方法使用改进的 k-均值聚类算法构建 R 树聚类模型,并通过 R 树索引检索科技文献数据,提高系统的检索效率。在高校科技文献数据集上的实验表明,本文提出的方法在文献检索时提高了效率和精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dc38/9420566/b45db9f0563d/CIN2022-3392489.001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验