• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于构建一组疾病相关医学主题词(MeSH®)来收集疾病候选基因并对其进行优先级排序的新方法。

A novel method for gathering and prioritizing disease candidate genes based on construction of a set of disease-related MeSH® terms.

作者信息

Ono Toshihide, Kuhara Satoru

机构信息

Department of Genetic Resources Technology, Faculty of Agriculture, Kyushu University, 6-10-1 Hakozaki Higashi-ku, Fukuoka 812-8581, Japan.

出版信息

BMC Bioinformatics. 2014 Jun 10;15:179. doi: 10.1186/1471-2105-15-179.

DOI:10.1186/1471-2105-15-179
PMID:24917541
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4068192/
Abstract

BACKGROUND

Understanding the molecular mechanisms involved in disease is critical for the development of more effective and individualized strategies for prevention and treatment. The amount of disease-related literature, including new genetic information on the molecular mechanisms of disease, is rapidly increasing. Extracting beneficial information from literature can be facilitated by computational methods such as the knowledge-discovery approach. Several methods for mining gene-disease relationships using computational methods have been developed, however, there has been a lack of research evaluating specific disease candidate genes.

RESULTS

We present a novel method for gathering and prioritizing specific disease candidate genes. Our approach involved the construction of a set of Medical Subject Headings (MeSH) terms for the effective retrieval of publications related to a disease candidate gene. Information regarding the relationships between genes and publications was obtained from the gene2pubmed database. The set of genes was prioritized using a "weighted literature score" based on the number of publications and weighted by the number of genes occurring in a publication. Using our method for the disease states of pain and Alzheimer's disease, a total of 1101 pain candidate genes and 2810 Alzheimer's disease candidate genes were gathered and prioritized. The precision was 0.30 and the recall was 0.89 in the case study of pain. The precision was 0.04 and the recall was 0.6 in the case study of Alzheimer's disease. The precision-recall curve indicated that the performance of our method was superior to that of other publicly available tools.

CONCLUSIONS

Our method, which involved the use of a set of MeSH terms related to disease candidate genes and a novel weighted literature score, improved the accuracy of gathering and prioritizing candidate genes by focusing on a specific disease.

摘要

背景

了解疾病相关的分子机制对于制定更有效和个性化的预防及治疗策略至关重要。包括疾病分子机制新遗传信息在内的疾病相关文献数量正在迅速增加。知识发现等计算方法有助于从文献中提取有益信息。已经开发了几种使用计算方法挖掘基因与疾病关系的方法,然而,缺乏对特定疾病候选基因进行评估的研究。

结果

我们提出了一种收集特定疾病候选基因并对其进行优先级排序的新方法。我们的方法包括构建一组医学主题词(MeSH)术语,以有效检索与疾病候选基因相关的出版物。基因与出版物之间关系的信息来自gene2pubmed数据库。基于出版物数量并根据出版物中出现的基因数量进行加权,使用“加权文献分数”对基因集进行优先级排序。使用我们的方法针对疼痛和阿尔茨海默病的疾病状态,共收集并优先排序了1101个疼痛候选基因和2810个阿尔茨海默病候选基因。在疼痛案例研究中,精确率为0.30,召回率为0.89。在阿尔茨海默病案例研究中,精确率为0.04,召回率为0.6。精确率-召回率曲线表明我们方法的性能优于其他公开可用工具。

结论

我们的方法涉及使用一组与疾病候选基因相关的MeSH术语和一种新的加权文献分数,通过关注特定疾病提高了收集和优先排序候选基因的准确性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/44373a7fdd02/1471-2105-15-179-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/57b68ba49032/1471-2105-15-179-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/da3a20c3dd7b/1471-2105-15-179-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/ce75b36a8fc4/1471-2105-15-179-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/d0332347bc46/1471-2105-15-179-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/a347ef44cd9b/1471-2105-15-179-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/44373a7fdd02/1471-2105-15-179-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/57b68ba49032/1471-2105-15-179-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/da3a20c3dd7b/1471-2105-15-179-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/ce75b36a8fc4/1471-2105-15-179-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/d0332347bc46/1471-2105-15-179-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/a347ef44cd9b/1471-2105-15-179-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfbf/4068192/44373a7fdd02/1471-2105-15-179-6.jpg

相似文献

1
A novel method for gathering and prioritizing disease candidate genes based on construction of a set of disease-related MeSH® terms.一种基于构建一组疾病相关医学主题词(MeSH®)来收集疾病候选基因并对其进行优先级排序的新方法。
BMC Bioinformatics. 2014 Jun 10;15:179. doi: 10.1186/1471-2105-15-179.
2
Reengineering of MeSH thesauri for term selection to optimize literature retrieval and knowledge reconstruction in support of stem cell research.重新设计医学主题词表以进行术语选择,以优化文献检索和知识重建,支持干细胞研究。
BMC Med Inform Decis Mak. 2016 May 23;16:54. doi: 10.1186/s12911-016-0298-z.
3
Text mining facilitates database curation - extraction of mutation-disease associations from Bio-medical literature.文本挖掘有助于数据库管理——从生物医学文献中提取突变与疾病的关联。
BMC Bioinformatics. 2015 Jun 6;16:185. doi: 10.1186/s12859-015-0609-x.
4
Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature.广义富集分析可提高从生物医学文献中检测药物不良事件的能力。
BMC Bioinformatics. 2016 Jun 23;17:250. doi: 10.1186/s12859-016-1080-z.
5
Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression.通过扩大种子集并融合网络拓扑结构和基因表达信息来对候选疾病基因进行优先级排序。
Mol Biosyst. 2014 Jun;10(6):1400-8. doi: 10.1039/c3mb70588a. Epub 2014 Apr 3.
6
MeSHDD: Literature-based drug-drug similarity for drug repositioning.医学主题词表驱动的药物-药物相似性用于药物重新定位
J Am Med Inform Assoc. 2017 May 1;24(3):614-618. doi: 10.1093/jamia/ocw142.
7
Methodologies for extracting functional pharmacogenomic experiments from international repository.从国际数据库中提取功能性药物基因组学实验的方法。
AMIA Annu Symp Proc. 2007 Oct 11;2007:463-7.
8
HypertenGene: extracting key hypertension genes from biomedical literature with position and automatically-generated template features.HypertensionGene:从生物医学文献中提取关键高血压基因,使用位置和自动生成的模板特征。
BMC Bioinformatics. 2009 Dec 3;10 Suppl 15(Suppl 15):S9. doi: 10.1186/1471-2105-10-S15-S9.
9
SciMiner: web-based literature mining tool for target identification and functional enrichment analysis.SciMiner:用于靶点识别和功能富集分析的基于网络的文献挖掘工具。
Bioinformatics. 2009 Mar 15;25(6):838-40. doi: 10.1093/bioinformatics/btp049. Epub 2009 Feb 2.
10
BioLitMine: Advanced Mining of Biomedical and Biological Literature About Human Genes and Genes from Major Model Organisms.生物文献挖掘:人类基因和主要模式生物基因的生物医学和生物学文献的高级挖掘。
G3 (Bethesda). 2020 Dec 3;10(12):4531-4539. doi: 10.1534/g3.120.401775.

引用本文的文献

1
Automated Recommendation of Research Keywords from PubMed That Suggest the Molecular Mechanism Associated with Biomarker Metabolites.从PubMed中自动推荐研究关键词,这些关键词揭示与生物标志物代谢物相关的分子机制。
Metabolites. 2022 Feb 1;12(2):133. doi: 10.3390/metabo12020133.
2
MeSH Now: automatic MeSH indexing at PubMed scale via learning to rank.医学主题词表现状:通过学习排序实现PubMed规模的自动医学主题词表索引编制。
J Biomed Semantics. 2017 Apr 17;8(1):15. doi: 10.1186/s13326-017-0123-3.

本文引用的文献

1
Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression.通过扩大种子集并融合网络拓扑结构和基因表达信息来对候选疾病基因进行优先级排序。
Mol Biosyst. 2014 Jun;10(6):1400-8. doi: 10.1039/c3mb70588a. Epub 2014 Apr 3.
2
PainNetworks: a web-based resource for the visualisation of pain-related genes in the context of their network associations.PainNetworks:一个基于网络的资源,用于在其网络关联的背景下可视化与疼痛相关的基因。
Pain. 2013 Dec;154(12):2586e1-2586e12. doi: 10.1016/j.pain.2013.09.003. Epub 2013 Sep 11.
3
Cataloging the biomedical world of pain through semi-automated curation of molecular interactions.
通过分子相互作用的半自动策展来编目疼痛的生物医学世界。
Database (Oxford). 2013 May 23;2013:bat033. doi: 10.1093/database/bat033. Print 2013.
4
Functional genomics of pain in analgesic drug development and therapy.镇痛药物研发与治疗中的疼痛功能基因组学
Pharmacol Ther. 2013 Jul;139(1):60-70. doi: 10.1016/j.pharmthera.2013.04.004. Epub 2013 Apr 6.
5
Inferring novel gene-disease associations using Medical Subject Heading Over-representation Profiles.利用医学主题词过度表达谱推断新的基因-疾病关联。
Genome Med. 2012 Sep 28;4(9):75. doi: 10.1186/gm376. eCollection 2012.
6
Candidate gene prioritization.候选基因优先级排序。
Mol Genet Genomics. 2012 Sep;287(9):679-98. doi: 10.1007/s00438-012-0710-z. Epub 2012 Aug 15.
7
Computational tools for prioritizing candidate genes: boosting disease gene discovery.计算工具在候选基因优先级排序中的应用:提高疾病基因发现的效率。
Nat Rev Genet. 2012 Jul 3;13(8):523-36. doi: 10.1038/nrg3253.
8
Biomarker identification for prostate cancer and lymph node metastasis from microarray data and protein interaction network using gene prioritization method.利用基因优先级排序方法从微阵列数据和蛋白质相互作用网络中鉴定前列腺癌和淋巴结转移的生物标志物。
ScientificWorldJournal. 2012;2012:842727. doi: 10.1100/2012/842727. Epub 2012 May 2.
9
Inhibition of Bax protects neuronal cells from oligomeric Aβ neurotoxicity.Bax 的抑制可保护神经元细胞免受寡聚体 Aβ 的神经毒性。
Cell Death Dis. 2012 May 17;3(5):e309. doi: 10.1038/cddis.2012.43.
10
Comprehensive research synopsis and systematic meta-analyses in Parkinson's disease genetics: The PDGene database.帕金森病遗传学的综合研究综述和系统荟萃分析:PDGene 数据库。
PLoS Genet. 2012;8(3):e1002548. doi: 10.1371/journal.pgen.1002548. Epub 2012 Mar 15.