• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language.适应医学语言特点的快速精确字符串模式匹配算法。
J Am Med Inform Assoc. 2000 Jul-Aug;7(4):378-91. doi: 10.1136/jamia.2000.0070378.
2
Block-suffix shifting: fast, simultaneous medical concept set identification in large medical record corpora.块后缀移位:在大型医疗记录语料库中快速、同步识别医学概念集
AMIA Annu Symp Proc. 2008 Nov 6;2008:424-8.
3
Entropy-Based Approach in Selection Exact String-Matching Algorithms.基于熵的精确字符串匹配算法选择方法。
Entropy (Basel). 2020 Dec 28;23(1):31. doi: 10.3390/e23010031.
4
The WM-q multiple exact string matching algorithm for DNA sequences.用于 DNA 序列的 WM-q 多重精确字符串匹配算法。
Comput Biol Med. 2021 Sep;136:104656. doi: 10.1016/j.compbiomed.2021.104656. Epub 2021 Jul 17.
5
Improving hash-q exact string matching algorithm with perfect hashing for DNA sequences.利用完美哈希提高 DNA 序列的哈希-Q 精确字符串匹配算法。
Comput Biol Med. 2021 Apr;131:104292. doi: 10.1016/j.compbiomed.2021.104292. Epub 2021 Feb 22.
6
A design of a parallel architecture for solving exact matching problem on DNA molecules.一种用于解决DNA分子上精确匹配问题的并行架构设计。
Biomed Sci Instrum. 2007;43:170-5.
7
Assessment of approximate string matching in a biomedical text retrieval problem.生物医学文本检索问题中近似字符串匹配的评估
Comput Biol Med. 2005 Oct;35(8):717-24. doi: 10.1016/j.compbiomed.2004.06.002.
8
A fast Boyer-Moore type pattern matching algorithm for highly similar sequences.一种用于高度相似序列的快速Boyer-Moore型模式匹配算法。
Int J Data Min Bioinform. 2015;13(3):266-88. doi: 10.1504/ijdmb.2015.072101.
9
Discriminative application of string similarity methods to chemical and non-chemical names for biomedical abbreviation clustering.字符串相似性方法在生物医学缩写聚类的化学和非化学名称中的判别应用。
BMC Genomics. 2012 Jun 11;13 Suppl 3(Suppl 3):S8. doi: 10.1186/1471-2164-13-S3-S8.
10
A dual-bound algorithm for very fast and exact template matching.一种用于极快速和精确模板匹配的双边界算法。
IEEE Trans Pattern Anal Mach Intell. 2011 Mar;33(3):459-70. doi: 10.1109/TPAMI.2010.105.

引用本文的文献

1
A de-identifier for medical discharge summaries.一份用于出院小结的去标识信息。
Artif Intell Med. 2008 Jan;42(1):13-35. doi: 10.1016/j.artmed.2007.10.001. Epub 2007 Nov 28.
2
Identifying patient smoking status from medical discharge records.从医疗出院记录中识别患者的吸烟状况。
J Am Med Inform Assoc. 2008 Jan-Feb;15(1):14-24. doi: 10.1197/jamia.M2408. Epub 2007 Oct 18.
3
The contribution of morphological knowledge to French MeSH mapping for information retrieval.形态学知识对用于信息检索的法语医学主题词表映射的贡献。
Proc AMIA Symp. 2001:796-800.

本文引用的文献

1
Power of expression in the electronic patient record: structured data or narrative text?电子病历中的表达能力:结构化数据还是叙述性文本?
Int J Med Inform. 2000 Sep;58-59:101-10. doi: 10.1016/s1386-5056(00)00079-4.
2
Exploring the degree of concordance of coded and textual data in answering clinical queries from a clinical data repository.探索编码数据与文本数据在回答来自临床数据存储库的临床问题时的一致程度。
J Am Med Inform Assoc. 2000 Jan-Feb;7(1):42-54. doi: 10.1136/jamia.2000.0070042.
3
Morpho-semantic parsing of medical expressions.医学表达式的形态语义解析
Proc AMIA Symp. 1998:760-4.
4
Hospitexte: towards a document-based hypertextual electronic medical record.医院文本:迈向基于文档的超文本电子病历。
Proc AMIA Symp. 1998:713-7.
5
Representing information in patient reports using natural language processing and the extensible markup language.使用自然语言处理和可扩展标记语言在患者报告中呈现信息。
J Am Med Inform Assoc. 1999 Jan-Feb;6(1):76-87. doi: 10.1136/jamia.1999.0060076.
6
Internet integrated in the daily medical practice within an electronic patient record.互联网集成于电子病历中的日常医疗实践之中。
Comput Biol Med. 1998 Sep;28(5):567-79. doi: 10.1016/s0010-4825(98)00034-1.
7
Medical dictionaries for patient encoding systems: a methodology.用于患者编码系统的医学词典:一种方法
Artif Intell Med. 1998 Sep-Oct;14(1-2):201-14. doi: 10.1016/s0933-3657(98)00023-2.
8
COFFEE: an objective function for multiple sequence alignments.COFFEE:一种用于多序列比对的目标函数。
Bioinformatics. 1998 Jun;14(5):407-22. doi: 10.1093/bioinformatics/14.5.407.
9
Genetic algorithms: principles of natural selection applied to computation.遗传算法:应用于计算的自然选择原理。
Science. 1993 Aug 13;261(5123):872-8. doi: 10.1126/science.8346439.
10
Morphosemantic analysis of -ITIS forms in medical language.医学语言中-ITIS形式的形态语义分析。
Methods Inf Med. 1980 Apr;19(2):99-105.

适应医学语言特点的快速精确字符串模式匹配算法。

Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language.

作者信息

Lovis C, Baud R H

机构信息

Puget Sound Health Care System, Seattle, Washington, USA.

出版信息

J Am Med Inform Assoc. 2000 Jul-Aug;7(4):378-91. doi: 10.1136/jamia.2000.0070378.

DOI:10.1136/jamia.2000.0070378
PMID:10887166
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC61442/
Abstract

OBJECTIVE

The authors consider the problem of exact string pattern matching using algorithms that do not require any preprocessing. To choose the most appropriate algorithm, distinctive features of the medical language must be taken into account. The characteristics of medical language are emphasized in this regard, the best algorithm of those reviewed is proposed, and detailed evaluations of time complexity for processing medical texts are provided.

DESIGN

The authors first illustrate and discuss the techniques of various string pattern-matching algorithms. Next, the source code and the behavior of representative exact string pattern-matching algorithms are presented in a comprehensive manner to promote their implementation. Detailed explanations of the use of various techniques to improve performance are given.

MEASUREMENTS

Real-time measures of time complexity with English medical texts are presented. They lead to results distinct from those found in the computer science literature, which are typically computed with normally distributed texts.

RESULTS

The Boyer-Moore-Horspool algorithm achieves the best overall results when used with medical texts. This algorithm usually performs at least twice as fast as the other algorithms tested.

CONCLUSION

The time performance of exact string pattern matching can be greatly improved if an efficient algorithm is used. Considering the growing amount of text handled in the electronic patient record, it is worth implementing this efficient algorithm.

摘要

目的

作者考虑使用无需任何预处理的算法来解决精确字符串模式匹配问题。为了选择最合适的算法,必须考虑医学语言的独特特征。在此方面强调了医学语言的特点,提出了所审查算法中最佳的算法,并提供了处理医学文本的时间复杂度的详细评估。

设计

作者首先说明并讨论各种字符串模式匹配算法的技术。接下来,全面展示代表性精确字符串模式匹配算法的源代码和行为,以促进其实现。给出了使用各种技术提高性能的详细解释。

测量

给出了对英文医学文本时间复杂度的实时测量结果。这些结果与计算机科学文献中的结果不同,后者通常是用正态分布文本计算得出的。

结果

Boyer-Moore-Horspool算法与医学文本一起使用时能取得最佳的总体结果。该算法的执行速度通常至少是其他测试算法的两倍。

结论

如果使用高效算法,精确字符串模式匹配的时间性能可以大大提高。考虑到电子病历中处理的文本量不断增加,值得实施这种高效算法。