用于高效基于文本的信息检索的混合优化与基于本体的语义模型。

Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval.

作者信息

Kumar Ram, Sharma S C

机构信息

Electronics and Computer Discipline, DPT, Indian Institute of Technology, Roorkee, India.

出版信息

J Supercomput. 2023;79(2):2251-2280. doi: 10.1007/s11227-022-04708-9. Epub 2022 Aug 10.

DOI:10.1007/s11227-022-04708-9

PMID:35967462

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9364863/

Abstract

Query expansion is an important approach utilized to improve the efficiency of data retrieval tasks. Numerous works are carried out by the researchers to generate fair constructive results; however, they do not provide acceptable results for all kinds of queries particularly phrase and individual queries. The utilization of identical data sources and weighting strategies for expanding such terms are the major cause of this issue which leads the model unable to capture the comprehensive relationship between the query terms. In order to tackle this issue, we developed a novel approach for query expansion technique to analyze the different data sources namely WordNet, Wikipedia, and Text REtrieval Conference. This paper presents an Improved Aquila Optimization-based COOT(IAOCOOT) algorithm for query expansion which retrieves the semantic aspects that match the query term. The semantic heterogeneity associated with document retrieval mainly impacts the relevance matching between the query and the document. The main cause of this issue is that the similarity among the words is not evaluated correctly. To overcome this problem, we are using a Modified Needleman Wunsch algorithm algorithm to deal with the problems of uncertainty, imprecision in the information retrieval process, and semantic ambiguity of indexed terms in both the local and global perspectives. The k most similar word is determined and returned from a candidate set through the top-k words selection technique and it is widely utilized in different tasks. The proposed IAOCOOT model is evaluated using different standard Information Retrieval performance metrics to compute the validity of the proposed work by comparing it with other state-of-art techniques.

摘要

查询扩展是一种用于提高数据检索任务效率的重要方法。研究人员开展了大量工作以产生合理的建设性成果；然而，它们并不能为所有类型的查询（特别是短语查询和单个查询）提供可接受的结果。使用相同的数据源和加权策略来扩展此类术语是导致该问题的主要原因，这使得模型无法捕捉查询词之间的全面关系。为了解决这个问题，我们开发了一种新颖的查询扩展技术方法，以分析不同的数据源，即WordNet、维基百科和文本检索会议。本文提出了一种基于改进的天鹰座优化算法的查询扩展COOT（IAOCOOT）算法，该算法检索与查询词匹配的语义方面。与文档检索相关的语义异质性主要影响查询与文档之间的相关性匹配。这个问题的主要原因是单词之间的相似度没有得到正确评估。为了克服这个问题，我们使用一种改进的Needleman Wunsch算法来处理信息检索过程中的不确定性、不精确性以及索引词在局部和全局视角下的语义模糊性问题。通过top-k词选择技术从候选集中确定并返回k个最相似的词，它在不同任务中得到了广泛应用。使用不同的标准信息检索性能指标对所提出的IAOCOOT模型进行评估，通过与其他现有技术进行比较来计算所提出工作的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a4c0/9364863/d519b7a4a366/11227_2022_4708_Fig1_HTML.jpg

相似文献

Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval.用于高效基于文本的信息检索的混合优化与基于本体的语义模型。

J Supercomput. 2023;79(2):2251-2280. doi: 10.1007/s11227-022-04708-9. Epub 2022 Aug 10.

Hybrid ontology for semantic information retrieval model using keyword matching indexing system.使用关键词匹配索引系统的语义信息检索模型的混合本体。

ScientificWorldJournal. 2015;2015:414910. doi: 10.1155/2015/414910. Epub 2015 Apr 1.

Document Retrieval for Precision Medicine Using a Deep Learning Ensemble Method.使用深度学习集成方法进行精准医学的文献检索

JMIR Med Inform. 2021 Jun 29;9(6):e28272. doi: 10.2196/28272.

An approach to semantic query expansion system based on Hepatitis ontology.一种基于肝炎本体的语义查询扩展系统方法。

J Biol Res (Thessalon). 2016 Jul 4;23(Suppl 1):11. doi: 10.1186/s40709-016-0044-9. eCollection 2016 May.

Document/query expansion based on selecting significant concepts for context based retrieval of medical images.基于选择显著概念的文档/查询扩展，用于基于上下文的医学图像检索。

J Biomed Inform. 2019 Jul;95:103210. doi: 10.1016/j.jbi.2019.103210. Epub 2019 May 17.

Learning to rank query expansion terms for COVID-19 scholarly search.学习对 COVID-19 学术搜索进行查询扩展词的排序。

J Biomed Inform. 2023 Jun;142:104386. doi: 10.1016/j.jbi.2023.104386. Epub 2023 May 12.

Relevance Feedback Based Query Expansion Model Using Borda Count and Semantic Similarity Approach.基于Borda计数和语义相似性方法的相关反馈查询扩展模型

Comput Intell Neurosci. 2015;2015:568197. doi: 10.1155/2015/568197. Epub 2015 Dec 7.

On the query reformulation technique for effective MEDLINE document retrieval.针对有效 MEDLINE 文档检索的查询改写技术。

J Biomed Inform. 2010 Oct;43(5):686-93. doi: 10.1016/j.jbi.2010.04.005. Epub 2010 Apr 13.

Identification of the Best Semantic Expansion to Query PubMed Through Automatic Performance Assessment of Four Search Strategies on All Medical Subject Heading Descriptors: Comparative Study.通过对所有医学主题词描述符的四种检索策略进行自动性能评估来确定查询PubMed的最佳语义扩展：比较研究

JMIR Med Inform. 2020 Jun 4;8(6):e12799. doi: 10.2196/12799.

Semantic-Enhanced Query Expansion System for Retrieving Medical Image Notes.用于检索医学图像注释的语义增强查询扩展系统。

J Med Syst. 2018 Apr 25;42(6):105. doi: 10.1007/s10916-018-0954-1.

引用本文的文献

Toward clearer recognition and easier usefulness: development of a cross-lingual atherosclerotic cerebrovascular disease ontology.迈向更清晰的认知与更便捷的应用：跨语言动脉粥样硬化性脑血管疾病本体的开发

Database (Oxford). 2024 Dec 5;2024. doi: 10.1093/database/baae117.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于高效基于文本的信息检索的混合优化与基于本体的语义模型。

Hybrid optimization and ontology-based semantic model for efficient text-based information retrieval.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献