• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

受医学主题词表(MeSH)和医学文献数据库(PubMed)启发,使用国际专利分类法(IPC)进行自动专利分类和引导式专利检索。

Automated Patent Categorization and Guided Patent Search using IPC as Inspired by MeSH and PubMed.

作者信息

Eisinger Daniel, Tsatsaronis George, Bundschus Markus, Wieneke Ulrich, Schroeder Michael

机构信息

TU Dresden, BIOTEC, Tatzberg 47/49, 01307 Dresden, Germany.

出版信息

J Biomed Semantics. 2013 Apr 15;4 Suppl 1(Suppl 1):S3. doi: 10.1186/2041-1480-4-S1-S3.

DOI:10.1186/2041-1480-4-S1-S3
PMID:23734562
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3632996/
Abstract

Document search on PubMed, the pre-eminent database for biomedical literature, relies on the annotation of its documents with relevant terms from the Medical Subject Headings ontology (MeSH) for improving recall through query expansion. Patent documents are another important information source, though they are considerably less accessible. One option to expand patent search beyond pure keywords is the inclusion of classification information: Since every patent is assigned at least one class code, it should be possible for these assignments to be automatically used in a similar way as the MeSH annotations in PubMed. In order to develop a system for this task, it is necessary to have a good understanding of the properties of both classification systems. This report describes our comparative analysis of MeSH and the main patent classification system, the International Patent Classification (IPC). We investigate the hierarchical structures as well as the properties of the terms/classes respectively, and we compare the assignment of IPC codes to patents with the annotation of PubMed documents with MeSH terms.Our analysis shows a strong structural similarity of the hierarchies, but significant differences of terms and annotations. The low number of IPC class assignments and the lack of occurrences of class labels in patent texts imply that current patent search is severely limited. To overcome these limits, we evaluate a method for the automated assignment of additional classes to patent documents, and we propose a system for guided patent search based on the use of class co-occurrence information and external resources.

摘要

在生物医学文献的卓越数据库PubMed上进行文献检索,依赖于用医学主题词表(MeSH)中的相关术语对文献进行标注,以通过查询扩展提高召回率。专利文献是另一个重要的信息来源,尽管获取难度要大得多。扩展专利检索范围使其超出纯关键词检索的一种方法是纳入分类信息:由于每项专利至少被分配一个分类代码,这些分类代码应该有可能以与PubMed中MeSH标注类似的方式被自动使用。为了开发用于此任务的系统,有必要深入了解这两种分类系统的特性。本报告描述了我们对MeSH和主要专利分类系统——国际专利分类(IPC)的比较分析。我们分别研究了层次结构以及术语/类别的特性,并将IPC代码对专利的分配与PubMed文献用MeSH术语进行的标注进行了比较。我们的分析表明,两者层次结构有很强的结构相似性,但术语和标注存在显著差异。IPC分类代码分配数量较少以及专利文本中类别标签出现次数不足,这意味着当前的专利检索受到严重限制。为克服这些限制,我们评估了一种为专利文献自动分配额外类别的方法,并提出了一种基于类别共现信息和外部资源使用的引导式专利检索系统。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/078a937477be/2041-1480-4-S1-S3-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/fff4f3802522/2041-1480-4-S1-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/76036bace547/2041-1480-4-S1-S3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/f64c02e6161d/2041-1480-4-S1-S3-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/6a23860f453c/2041-1480-4-S1-S3-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/55b19da9fd44/2041-1480-4-S1-S3-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/c8ec61618768/2041-1480-4-S1-S3-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/078a937477be/2041-1480-4-S1-S3-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/fff4f3802522/2041-1480-4-S1-S3-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/76036bace547/2041-1480-4-S1-S3-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/f64c02e6161d/2041-1480-4-S1-S3-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/6a23860f453c/2041-1480-4-S1-S3-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/55b19da9fd44/2041-1480-4-S1-S3-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/c8ec61618768/2041-1480-4-S1-S3-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c129/3632996/078a937477be/2041-1480-4-S1-S3-11.jpg

相似文献

1
Automated Patent Categorization and Guided Patent Search using IPC as Inspired by MeSH and PubMed.受医学主题词表(MeSH)和医学文献数据库(PubMed)启发,使用国际专利分类法(IPC)进行自动专利分类和引导式专利检索。
J Biomed Semantics. 2013 Apr 15;4 Suppl 1(Suppl 1):S3. doi: 10.1186/2041-1480-4-S1-S3.
2
A Maximum-Entropy approach for accurate document annotation in the biomedical domain.一种用于生物医学领域准确文档标注的最大熵方法。
J Biomed Semantics. 2012 Apr 24;3 Suppl 1(Suppl 1):S2. doi: 10.1186/2041-1480-3-S1-S2.
3
MeSH Up: effective MeSH text classification for improved document retrieval.医学主题词表升级:用于改进文档检索的有效医学主题词表文本分类。
Bioinformatics. 2009 Jun 1;25(11):1412-8. doi: 10.1093/bioinformatics/btp249. Epub 2009 Apr 17.
4
Identification of the Best Semantic Expansion to Query PubMed Through Automatic Performance Assessment of Four Search Strategies on All Medical Subject Heading Descriptors: Comparative Study.通过对所有医学主题词描述符的四种检索策略进行自动性能评估来确定查询PubMed的最佳语义扩展:比较研究
JMIR Med Inform. 2020 Jun 4;8(6):e12799. doi: 10.2196/12799.
5
The TREC 2004 genomics track categorization task: classifying full text biomedical documents.2004年文本检索会议(TREC)基因组学专题分类任务:对生物医学全文文档进行分类。
J Biomed Discov Collab. 2006 Mar 14;1:4. doi: 10.1186/1747-5333-1-4.
6
Search and Graph Database Technologies for Biomedical Semantic Indexing: Experimental Analysis.用于生物医学语义索引的搜索与图形数据库技术:实验分析
JMIR Med Inform. 2017 Dec 1;5(4):e48. doi: 10.2196/medinform.7059.
7
MeSH2Matrix: combining MeSH keywords and machine learning for biomedical relation classification based on PubMed.医学主题词表到矩阵:基于PubMed结合医学主题词表关键词与机器学习进行生物医学关系分类
J Biomed Semantics. 2024 Oct 2;15(1):18. doi: 10.1186/s13326-024-00319-w.
8
G-Bean: an ontology-graph based web tool for biomedical literature retrieval.G-Bean:基于本体图的生物医学文献检索网络工具。
BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-15-S12-S1. Epub 2014 Nov 6.
9
Research Trend Visualization by MeSH Terms from PubMed.基于 PubMed 的 MeSH 术语的研究趋势可视化。
Int J Environ Res Public Health. 2018 May 30;15(6):1113. doi: 10.3390/ijerph15061113.
10
Using cited references to improve the retrieval of related biomedical documents.利用被引参考文献提高生物医学文献的检索相关性。
BMC Bioinformatics. 2013 Mar 27;14:113. doi: 10.1186/1471-2105-14-113.

引用本文的文献

1
Patent bibliometric analysis for global trend of organoid technologies in the past decade.过去十年间类器官技术全球趋势的专利文献计量分析。
iScience. 2022 Jul 8;25(8):104728. doi: 10.1016/j.isci.2022.104728. eCollection 2022 Aug 19.
2
Patented technologies for schistosomiasis control and prevention filed by Chinese applicants.中国申请人申请的血吸虫病防治专利技术。
Infect Dis Poverty. 2021 Jun 12;10(1):84. doi: 10.1186/s40249-021-00869-6.
3
Predicting MeSH Beyond MEDLINE.超越医学文献数据库(MEDLINE)预测医学主题词(MeSH)

本文引用的文献

1
Analysing patent landscapes in plant biotechnology and new plant breeding techniques.分析植物生物技术和新植物育种技术的专利格局。
Transgenic Res. 2013 Feb;22(1):15-29. doi: 10.1007/s11248-012-9641-z. Epub 2012 Aug 17.
2
Improving information retrieval using Medical Subject Headings Concepts: a test case on rare and chronic diseases.利用医学主题词概念提高信息检索:罕见和慢性病的案例研究。
J Med Libr Assoc. 2012 Jul;100(3):176-83. doi: 10.3163/1536-5050.100.3.007.
3
A Maximum-Entropy approach for accurate document annotation in the biomedical domain.
Proc 1st Workshop Sch Web Min (2017). 2017 Feb;2017:49-56. doi: 10.1145/3057148.3057155.
4
Patterns of database citation in articles and patents indicate long-term scientific and industry value of biological data resources.文章和专利中数据库引用模式表明了生物数据资源的长期科学和产业价值。
F1000Res. 2016 Feb 11;5. doi: 10.12688/f1000research.7911.1. eCollection 2016.
5
SureChEMBL: a large-scale, chemically annotated patent document database.SureChEMBL:一个大规模的、经过化学注释的专利文献数据库。
Nucleic Acids Res. 2016 Jan 4;44(D1):D1220-8. doi: 10.1093/nar/gkv1253. Epub 2015 Nov 17.
6
Selected papers from the 15th Annual Bio-Ontologies Special Interest Group Meeting.第15届生物本体特别兴趣小组年度会议精选论文。
J Biomed Semantics. 2013 Apr 15;4 Suppl 1(Suppl 1):I1. doi: 10.1186/2041-1480-4-S1-I1.
一种用于生物医学领域准确文档标注的最大熵方法。
J Biomed Semantics. 2012 Apr 24;3 Suppl 1(Suppl 1):S2. doi: 10.1186/2041-1480-3-S1-S2.
4
Evaluation of Query Expansion Using MeSH in PubMed.在PubMed中使用医学主题词表(MeSH)进行查询扩展的评估
Inf Retr Boston. 2009;12(1):69-80. doi: 10.1007/s10791-008-9074-8.
5
MeSH Up: effective MeSH text classification for improved document retrieval.医学主题词表升级:用于改进文档检索的有效医学主题词表文本分类。
Bioinformatics. 2009 Jun 1;25(11):1412-8. doi: 10.1093/bioinformatics/btp249. Epub 2009 Apr 17.
6
Text processing through Web services: calling Whatizit.通过网络服务进行文本处理:调用Whatizit。
Bioinformatics. 2008 Jan 15;24(2):296-8. doi: 10.1093/bioinformatics/btm557. Epub 2007 Nov 15.
7
EBIMed--text crunching to gather facts for proteins from Medline.EBIMed——通过文本处理从医学在线数据库中收集蛋白质相关事实。
Bioinformatics. 2007 Jan 15;23(2):e237-44. doi: 10.1093/bioinformatics/btl302.
8
GoPubMed: exploring PubMed with the Gene Ontology.GoPubMed:利用基因本体论探索PubMed
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W783-6. doi: 10.1093/nar/gki470.
9
Distribution of information in biomedical abstracts and full-text publications.生物医学摘要和全文出版物中的信息分布。
Bioinformatics. 2004 Nov 1;20(16):2597-604. doi: 10.1093/bioinformatics/bth291. Epub 2004 May 6.
10
Information extraction from full text scientific articles: where are the keywords?从全文科学文章中提取信息:关键词在哪里?
BMC Bioinformatics. 2003 May 29;4:20. doi: 10.1186/1471-2105-4-20.