• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于医学主题词过度表达谱(MeSHOPs)的定量生物医学注释。

Quantitative biomedical annotation using medical subject heading over-representation profiles (MeSHOPs).

机构信息

Centre for Molecular Medicine and Therapeutics at the Child and Family Research Institute, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada.

出版信息

BMC Bioinformatics. 2012 Sep 27;13:249. doi: 10.1186/1471-2105-13-249.

DOI:10.1186/1471-2105-13-249
PMID:23017167
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3564935/
Abstract

BACKGROUND

MEDLINE®/PubMed® indexes over 20 million biomedical articles, providing curated annotation of its contents using a controlled vocabulary known as Medical Subject Headings (MeSH). The MeSH vocabulary, developed over 50+ years, provides a broad coverage of topics across biomedical research. Distilling the essential biomedical themes for a topic of interest from the relevant literature is important to both understand the importance of related concepts and discover new relationships.

RESULTS

We introduce a novel method for determining enriched curator-assigned MeSH annotations in a set of papers associated to a topic, such as a gene, an author or a disease. We generate MeSH Over-representation Profiles (MeSHOPs) to quantitatively summarize the annotations in a form convenient for further computational analysis and visualization. Based on a hypergeometric distribution of assigned terms, MeSHOPs statistically account for the prevalence of the associated biomedical annotation while highlighting unusually prevalent terms based on a specified background. MeSHOPs can be visualized using word clouds, providing a succinct quantitative graphical representation of the relative importance of terms. Using the publication dates of articles, MeSHOPs track changing patterns of annotation over time. Since MeSHOPs are quantitative vectors, MeSHOPs can be compared using standard techniques such as hierarchical clustering. The reliability of MeSHOP annotations is assessed based on the capacity to re-derive the subset of the Gene Ontology annotations with equivalent MeSH terms.

CONCLUSIONS

MeSHOPs allows quantitative measurement of the degree of association between any entity and the annotated medical concepts, based directly on relevant primary literature. Comparison of MeSHOPs allows entities to be related based on shared medical themes in their literature. A web interface is provided for generating and visualizing MeSHOPs.

摘要

背景

MEDLINE®/PubMed® 索引了超过 2000 万篇生物医学文章,使用称为医学主题词 (MeSH) 的受控词汇表对其内容进行精心注释。MeSH 词汇表经过 50 多年的发展,提供了对生物医学研究各个主题的广泛覆盖。从相关文献中提取出与感兴趣的主题相关的基本生物医学主题,对于理解相关概念的重要性和发现新的关系都很重要。

结果

我们介绍了一种新颖的方法,用于确定与主题(如基因、作者或疾病)相关的一组论文中丰富的编目分配 MeSH 注释。我们生成 MeSH 过度表达谱 (MeSHOPs) ,以定量总结注释形式,方便进一步的计算分析和可视化。基于分配术语的超几何分布,MeSHOPs 统计上考虑了相关生物医学注释的流行程度,同时根据指定的背景突出显示异常流行的术语。MeSHOPs 可以使用词云进行可视化,提供术语相对重要性的简洁定量图形表示。使用文章的出版日期,MeSHOPs 可以跟踪随时间变化的注释模式。由于 MeSHOPs 是定量向量,因此可以使用标准技术(如层次聚类)进行比较。MeSHOP 注释的可靠性基于重新推导出具有等效 MeSH 术语的基因本体论注释子集的能力来评估。

结论

MeSHOPs 允许根据相关原始文献,直接对任何实体与注释的医学概念之间的关联程度进行定量测量。MeSHOPs 的比较允许基于其文献中的共享医学主题来关联实体。提供了一个网络界面,用于生成和可视化 MeSHOPs。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/2844fc09ad0d/1471-2105-13-249-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/bf2fa4e45ef7/1471-2105-13-249-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/18146dc53d95/1471-2105-13-249-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/e52c9c5b8911/1471-2105-13-249-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/2e6a2654dfc1/1471-2105-13-249-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/2844fc09ad0d/1471-2105-13-249-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/bf2fa4e45ef7/1471-2105-13-249-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/18146dc53d95/1471-2105-13-249-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/e52c9c5b8911/1471-2105-13-249-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/2e6a2654dfc1/1471-2105-13-249-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4b5b/3564935/2844fc09ad0d/1471-2105-13-249-5.jpg

相似文献

1
Quantitative biomedical annotation using medical subject heading over-representation profiles (MeSHOPs).基于医学主题词过度表达谱(MeSHOPs)的定量生物医学注释。
BMC Bioinformatics. 2012 Sep 27;13:249. doi: 10.1186/1471-2105-13-249.
2
Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles.利用医学主题词过度表达谱推断新的基因-疾病关联。
Genome Med. 2012 Sep 28;4(9):75. doi: 10.1186/gm376. eCollection 2012.
3
Compensating for literature annotation bias when predicting novel drug-disease relationships through Medical Subject Heading Over-representation Profile (MeSHOP) similarity.通过主题词过度表达谱(MeSHOP)相似性预测新型药物-疾病关系时补偿文献标注偏倚。
BMC Med Genomics. 2013;6 Suppl 2(Suppl 2):S3. doi: 10.1186/1755-8794-6-S2-S3. Epub 2013 May 7.
4
Context-driven automatic subgraph creation for literature-based discovery.用于基于文献的发现的上下文驱动自动子图创建
J Biomed Inform. 2015 Apr;54:141-57. doi: 10.1016/j.jbi.2015.01.014. Epub 2015 Feb 7.
5
MeSH ORA framework: R/Bioconductor packages to support MeSH over-representation analysis.医学主题词表ORA框架:用于支持医学主题词表过度表达分析的R/Bioconductor软件包。
BMC Bioinformatics. 2015 Feb 15;16:45. doi: 10.1186/s12859-015-0453-z.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Cross-organism learning method to discover new gene functionalities.跨生物学习方法发现新基因功能。
Comput Methods Programs Biomed. 2016 Apr;126:20-34. doi: 10.1016/j.cmpb.2015.12.002. Epub 2015 Dec 17.
8
Ontology annotation treebrowser : an interactive tool where the complementarity of medical subject headings and gene ontology improves the interpretation of gene lists.本体注释树浏览器:一种交互式工具,其中医学主题词表和基因本体的互补性提高了基因列表的解读。
Appl Bioinformatics. 2006;5(4):225-36. doi: 10.2165/00822942-200605040-00005.
9
Comparison of automated and human assignment of MeSH terms on publicly-available molecular datasets.比较公共分子数据集上的自动和人工 MeSH 术语分配。
J Biomed Inform. 2011 Dec;44 Suppl 1(Suppl 1):S39-S43. doi: 10.1016/j.jbi.2011.03.007. Epub 2011 Mar 21.
10
Analysis of MeSH Indexing Patterns and Frequency of Predicates.医学主题词表(MeSH)标引模式及谓词频率分析
Stud Health Technol Inform. 2018;247:666-670.

引用本文的文献

1
Novel application of normalized pointwise mutual information (NPMI) to mine biomedical literature for gene sets associated with disease: use case in breast carcinogenesis.归一化逐点互信息(NPMI)在挖掘与疾病相关基因集的生物医学文献中的新应用:乳腺癌发生的案例分析
Comput Toxicol. 2018 Aug;7:46-57. doi: 10.1016/j.comtox.2018.06.003. Epub 2018 Jun 19.
2
BCScreen: A gene panel to test for breast carcinogenesis in chemical safety screening.BCScreen:一种用于化学安全性筛查中检测乳腺癌发生的基因检测组合。
Comput Toxicol. 2018 Feb;5:16-24. doi: 10.1016/j.comtox.2017.11.003. Epub 2017 Nov 21.
3
Glutaminase Deficiency Caused by Short Tandem Repeat Expansion in .

本文引用的文献

1
Mining the Gene Wiki for functional genomic knowledge.从基因维基中挖掘功能基因组学知识。
BMC Genomics. 2011 Dec 13;12:603. doi: 10.1186/1471-2164-12-603.
2
Genes2WordCloud: a quick way to identify biological themes from gene lists and free text.Genes2WordCloud:一种从基因列表和自由文本中识别生物学主题的快速方法。
Source Code Biol Med. 2011 Oct 13;6:15. doi: 10.1186/1751-0473-6-15.
3
Enabling enrichment analysis with the Human Disease Ontology.利用人类疾病本体进行富集分析。
谷氨酸酶缺乏症由. 中的短串联重复扩展引起
N Engl J Med. 2019 Apr 11;380(15):1433-1441. doi: 10.1056/NEJMoa1806627.
4
Aberration hubs in protein interaction networks highlight actionable targets in cancer.蛋白质相互作用网络中的畸变中心突出了癌症中可操作的靶点。
Oncotarget. 2018 May 18;9(38):25166-25180. doi: 10.18632/oncotarget.25382.
5
Leveraging Population-Based Clinical Quantitative Phenotyping for Drug Repositioning.利用基于人群的临床定量表型分析进行药物重定位。
CPT Pharmacometrics Syst Pharmacol. 2018 Feb;7(2):124-129. doi: 10.1002/psp4.12258. Epub 2018 Jan 24.
6
A standard database for drug repositioning.一个药物重定位的标准数据库。
Sci Data. 2017 Mar 14;4:170029. doi: 10.1038/sdata.2017.29.
7
A review of validation strategies for computational drug repositioning.计算药物重定位的验证策略综述。
Brief Bioinform. 2018 Jan 1;19(1):174-177. doi: 10.1093/bib/bbw110.
8
MeSHDD: Literature-based drug-drug similarity for drug repositioning.医学主题词表驱动的药物-药物相似性用于药物重新定位
J Am Med Inform Assoc. 2017 May 1;24(3):614-618. doi: 10.1093/jamia/ocw142.
9
Exploring the cellular basis of human disease through a large-scale mapping of deleterious genes to cell types.通过对有害基因进行大规模细胞类型定位来探索人类疾病的细胞基础。
Genome Med. 2015 Sep 1;7(1):95. doi: 10.1186/s13073-015-0212-9.
10
An application of MeSH enrichment analysis in livestock.医学主题词表(MeSH)富集分析在畜牧领域的应用。
Anim Genet. 2015 Aug;46(4):381-7. doi: 10.1111/age.12307. Epub 2015 Jun 2.
J Biomed Inform. 2011 Dec;44 Suppl 1(Suppl 1):S31-S38. doi: 10.1016/j.jbi.2011.04.007. Epub 2011 Apr 29.
4
Omics and literature mining.组学与文献挖掘
Methods Mol Biol. 2011;719:457-77. doi: 10.1007/978-1-61779-027-0_21.
5
An ontology-neutral framework for enrichment analysis.一种用于富集分析的本体中立框架。
AMIA Annu Symp Proc. 2010 Nov 13;2010:797-801.
6
Visual presentation as a welcome alternative to textual presentation of gene annotation information.将基因注释信息以可视化的方式呈现,是一种受欢迎的替代文本呈现方式。
Adv Exp Med Biol. 2010;680:709-15. doi: 10.1007/978-1-4419-5913-3_79.
7
GeneMesh: a web-based microarray analysis tool for relating differentially expressed genes to MeSH terms.GeneMesh:一个基于网络的微阵列分析工具,用于将差异表达基因与 MeSH 术语相关联。
BMC Bioinformatics. 2010 Apr 1;11:166. doi: 10.1186/1471-2105-11-166.
8
LigerCat: using "MeSH Clouds" from journal, article, or gene citations to facilitate the identification of relevant biomedical literature.LigerCat:利用来自期刊、文章或基因引用的“医学主题词云”来促进相关生物医学文献的识别。
AMIA Annu Symp Proc. 2009 Nov 14;2009:563-7.
9
Can literature analysis identify innovation drivers in drug discovery?文献分析能否识别药物研发中的创新驱动因素?
Nat Rev Drug Discov. 2009 Nov;8(11):865-78. doi: 10.1038/nrd2973.
10
Circos: an information aesthetic for comparative genomics.Circos:一种用于比较基因组学的信息美学。
Genome Res. 2009 Sep;19(9):1639-45. doi: 10.1101/gr.092759.109. Epub 2009 Jun 18.