Suppr超能文献

在生物医学文章中将缩写词映射为全称。

Mapping abbreviations to full forms in biomedical articles.

作者信息

Yu Hong, Hripcsak George, Friedman Carol

机构信息

Department of Medical Informatics, Columbia University, New York, New York 10032, USA.

出版信息

J Am Med Inform Assoc. 2002 May-Jun;9(3):262-72. doi: 10.1197/jamia.m0913.

Abstract

OBJECTIVE

To develop methods that automatically map abbreviations to their full forms in biomedical articles.

METHODS

The authors developed two methods of mapping defined and undefined abbreviations (defined abbreviations are paired with their full forms in the articles, whereas undefined ones are not). For defined abbreviations, they developed a set of pattern-matching rules to map an abbreviation to its full form and implemented the rules into a software program, AbbRE (for "abbreviation recognition and extraction"). Using the opinions of domain experts as a reference standard, they evaluated the recall and precision of AbbRE for defined abbreviations in ten biomedical articles randomly selected from the ten most frequently cited medical and biological journals. They also measured the percentage of undefined abbreviations in the same set of articles, and they investigated whether they could map undefined abbreviations to any of four public abbreviation databases (GenBank LocusLink, SWISSPROT, LRABR of the UMLS Specialist Lexicon, and BioABACUS).

RESULTS

AbbRE had an average 0.70 recall and 0.95 precision for the defined abbreviations. The authors found that an average of 25 percent of abbreviations were defined in biomedical articles and that of a randomly selected subset of undefined abbreviations, 68 percent could be mapped to any of four abbreviation databases. They also found that many abbreviations are ambiguous (i.e., they map to more than one full form in abbreviation databases).

CONCLUSION

AbbRE is efficient for mapping defined abbreviations. To couple AbbRE with abbreviation databases for the mapping of undefined abbreviations, not only exhaustive abbreviation databases but also a method to resolve the ambiguity of abbreviations in the databases are needed.

摘要

目的

开发能在生物医学文章中自动将缩写词映射为其全称的方法。

方法

作者开发了两种映射已定义和未定义缩写词的方法(已定义缩写词在文章中与它们的全称配对,而未定义的则没有)。对于已定义缩写词,他们制定了一组模式匹配规则,将缩写词映射为其全称,并将这些规则实现到一个软件程序AbbRE(“缩写词识别与提取”)中。以领域专家的意见作为参考标准,他们评估了AbbRE对从十份最常被引用的医学和生物学杂志中随机选取的十篇生物医学文章中已定义缩写词的召回率和精确率。他们还测量了同一组文章中未定义缩写词的百分比,并研究是否能将未定义缩写词映射到四个公共缩写词数据库(GenBank LocusLink、SWISSPROT、UMLS专业词典的LRABR和BioABACUS)中的任何一个。

结果

AbbRE对已定义缩写词的平均召回率为0.70,精确率为0.95。作者发现,生物医学文章中平均25%的缩写词是已定义的,在随机选取的未定义缩写词子集中,68%可以映射到四个缩写词数据库中的任何一个。他们还发现许多缩写词是模糊的(即它们在缩写词数据库中映射到不止一个全称)。

结论

AbbRE在映射已定义缩写词方面效率较高。要将AbbRE与缩写词数据库结合用于映射未定义缩写词,不仅需要详尽的缩写词数据库,还需要一种解决数据库中缩写词模糊性问题的方法。

相似文献

3
4
ADAM: another database of abbreviations in MEDLINE.ADAM:医学在线数据库(MEDLINE)中的另一个缩写词数据库。
Bioinformatics. 2006 Nov 15;22(22):2813-8. doi: 10.1093/bioinformatics/btl480. Epub 2006 Sep 18.
5
Resolving abbreviations to their senses in Medline.在医学文献数据库(Medline)中解析缩写词的含义。
Bioinformatics. 2005 Sep 15;21(18):3658-64. doi: 10.1093/bioinformatics/bti586. Epub 2005 Jul 21.

引用本文的文献

9
Automatically extracting information needs from complex clinical questions.从复杂的临床问题中自动提取信息需求。
J Biomed Inform. 2010 Dec;43(6):962-71. doi: 10.1016/j.jbi.2010.07.007. Epub 2010 Jul 27.
10
Building a high-quality sense inventory for improved abbreviation disambiguation.构建高质量的感观词库以提高缩写词消歧
Bioinformatics. 2010 May 1;26(9):1246-53. doi: 10.1093/bioinformatics/btq129. Epub 2010 Mar 25.

本文引用的文献

6
A short guide to abbreviations and their use in peptide science.肽科学中缩写词及其用法简短指南。
J Pept Sci. 1999 Nov;5(11):465-71. doi: 10.1002/(SICI)1099-1387(199911)5:11<465::AID-PSC224>3.0.CO;2-A.
7
The effect of abbreviations on MEDLINE searching.缩写词对医学文献数据库检索的影响。
Acad Emerg Med. 1999 Apr;6(4):292-6. doi: 10.1111/j.1553-2712.1999.tb00392.x.
9
Acronyms of clinical trials in cardiology--1998.1998年心脏病学临床试验的首字母缩略词
Am Heart J. 1999 Apr;137(4 Pt 1):726-65. doi: 10.1016/s0002-8703(99)70230-9.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验