• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用关键词提高数字图书馆中搜索代理的效率的精确方法。

Accurate Approach Towards Efficiency of Searching Agents in Digital Libraries Using Keywords.

机构信息

Department of Computer Science & Engineering, Karpagam University, Coimbatore, India.

Department of Electronics and Communication Engineering, Karpagam University, Coimbatore, India.

出版信息

J Med Syst. 2019 May 1;43(6):164. doi: 10.1007/s10916-019-1294-5.

DOI:10.1007/s10916-019-1294-5
PMID:31044273
Abstract

The age of information has done it simple for storing huge amount of data. In actual fact, a considerable segment of existing information is accumulated in the text databases that have huge set of documents from different sources like research articles, news articles, books, e-mail messages, web pages and digital libraries. In many text databases, stored data are in the semi-structured format in that they are neither entirely structured nor entirely unstructured. IR (Information Retrieval) field has been growing in parallel using database systems for several years. Contrasting to the databases system fields that have concentrated mainly on transaction and query processing of the structured data, IR is concerned with firm and retrieval of data from a huge quantity of text-oriented documents. Thus, IR tackles with unstructured and/or semi-structured databases. Information security requirements within a firm have experience major variations in the past some decades. By the establishment of computer, the necessary for automated equipment for securing files as well as other information that stored on the computer turned into evident. This is particularly in case of shared information resources via public network. This is the origin for having a secure computer system / the need for computer security. Computer Security can be achieved by Intrusion Detection Systems. In this paper, we address these issues by applying Similarity Search in two diversified fields: Digital Libraries and Computer Security. The paper discusses a fast and efficient similarity search technique for approximate retrieval of books metadata in Digital Libraries. In DLI the books retrieval takes place just by using metadata such as title, year, edition, author, publishing of a book. Though, if metadata is missing, incorrect or unfinished, then it creates the library retrieval system inefficient, incorrect leads too much confusion to the user. In this context even if the query from the user matches partially or fully with a stored pattern, the information related to that be retrieved. The paper talks about a method that functions rapid and effective, language independent, and flexible library retrieval system signature based similarity search. This system is able to retrieve not only the metadata that exactly matches the query but also fairly accurate identical because of missing words, jumbled words and spell mistakes. Fundamentally, signature file approach is used here. A signature file approach looks like the most capable for huge database as it has superior text retrieval features and requires little storage overhead.

摘要

信息时代使得存储大量数据变得简单。事实上,大量现有的信息都积累在文本数据库中,这些数据库包含来自不同来源的大量文档,如研究文章、新闻文章、书籍、电子邮件、网页和数字图书馆。在许多文本数据库中,存储的数据采用半结构化格式,它们既不是完全结构化的,也不是完全非结构化的。信息检索 (IR) 领域多年来一直与数据库系统并行发展。与主要集中在结构化数据的事务和查询处理的数据库系统不同,IR 关注的是从大量面向文本的文档中提取和检索数据。因此,IR 处理非结构化和/或半结构化数据库。过去几十年,公司内部的信息安全要求经历了重大变化。随着计算机的建立,对计算机文件以及存储在计算机上的其他信息进行自动保护的必要性变得显而易见。这在通过公共网络共享信息资源的情况下尤其如此。这就是拥有安全计算机系统/计算机安全需求的起源。计算机安全可以通过入侵检测系统来实现。在本文中,我们通过在两个不同领域应用相似性搜索来解决这些问题:数字图书馆和计算机安全。本文讨论了一种快速有效的相似性搜索技术,用于在数字图书馆中近似检索书籍元数据。在 DLI 中,书籍检索仅通过使用元数据(如标题、年份、版本、作者、出版)来进行。然而,如果元数据缺失、不正确或不完整,那么它会使图书馆检索系统效率低下,错误会给用户带来太多困惑。在这种情况下,即使用户的查询与存储的模式部分或完全匹配,也要检索相关信息。本文介绍了一种快速、有效、独立于语言且灵活的基于库检索系统签名的相似性搜索方法。该系统不仅能够检索与查询完全匹配的元数据,还能够检索由于缺少单词、单词混乱和拼写错误而相当准确的相同信息。从根本上讲,这里使用的是签名文件方法。签名文件方法看起来最适合于大型数据库,因为它具有卓越的文本检索功能,并且需要的存储开销很小。

相似文献

1
Accurate Approach Towards Efficiency of Searching Agents in Digital Libraries Using Keywords.利用关键词提高数字图书馆中搜索代理的效率的精确方法。
J Med Syst. 2019 May 1;43(6):164. doi: 10.1007/s10916-019-1294-5.
2
Font adaptive word indexing of modern printed documents.现代印刷文档的字体自适应词索引
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1187-99. doi: 10.1109/TPAMI.2006.162.
3
The BioPrompt-box: an ontology-based clustering tool for searching in biological databases.生物提示框:一种用于在生物数据库中搜索的基于本体的聚类工具。
BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S8. doi: 10.1186/1471-2105-8-S1-S8.
4
A similarity learning approach to content-based image retrieval: application to digital mammography.一种基于内容的图像检索的相似性学习方法:应用于数字乳腺摄影
IEEE Trans Med Imaging. 2004 Oct;23(10):1233-44. doi: 10.1109/TMI.2004.834601.
5
NeuroExtract: facilitating neuroscience-oriented retrieval from broadly-focused bioscience databases using text-based query mediation.神经提取:通过基于文本的查询调解,促进从广泛的生物科学数据库中进行面向神经科学的检索。
J Am Med Inform Assoc. 2007 May-Jun;14(3):355-60. doi: 10.1197/jamia.M2321. Epub 2007 Feb 28.
6
Case retrieval in medical databases by fusing heterogeneous information.医学数据库中异构信息融合的病例检索。
IEEE Trans Med Imaging. 2011 Jan;30(1):108-18. doi: 10.1109/TMI.2010.2063711. Epub 2010 Aug 5.
7
A review of content-based image retrieval systems in medical applications-clinical benefits and future directions.医学应用中基于内容的图像检索系统综述——临床益处与未来方向
Int J Med Inform. 2004 Feb;73(1):1-23. doi: 10.1016/j.ijmedinf.2003.11.024.
8
A semantic approach for digital long-term preservation of electronic health documents.一种用于电子健康文档数字长期保存的语义方法。
Stud Health Technol Inform. 2012;180:265-9.
9
Querying EHRs with a Semantic and Entity-Oriented Query Language.使用语义和面向实体的查询语言查询电子健康记录。
Stud Health Technol Inform. 2017;235:121-125.
10
Development of an information retrieval tool for biomedical patents.生物医学专利信息检索工具的开发。
Comput Methods Programs Biomed. 2018 Jun;159:125-134. doi: 10.1016/j.cmpb.2018.03.012. Epub 2018 Mar 14.