• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于精准医学的全球文本挖掘与药物基因组学知识资源开发。

Global Text Mining and Development of Pharmacogenomic Knowledge Resource for Precision Medicine.

作者信息

Guin Debleena, Rani Jyoti, Singh Priyanka, Grover Sandeep, Bora Shivangi, Talwar Puneet, Karthikeyan Muthusamy, Satyamoorthy K, Adithan C, Ramachandran S, Saso Luciano, Hasija Yasha, Kukreti Ritushree

机构信息

Genomics and Molecular Medicine Unit, Council of Scientific and Industrial Research (CSIR)-Institute of Genomics and Integrative Biology (IGIB), New Delhi, India.

Department of Biotechnology, Delhi Technological University, Delhi, India.

出版信息

Front Pharmacol. 2019 Aug 7;10:839. doi: 10.3389/fphar.2019.00839. eCollection 2019.

DOI:10.3389/fphar.2019.00839
PMID:31447668
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6692532/
Abstract

Understanding patients' genomic variations and their effect in protecting or predisposing them to drug response phenotypes is important for providing personalized healthcare. Several studies have manually curated such genotype-phenotype relationships into organized databases from clinical trial data or published literature. However, there are no text mining tools available to extract high-accuracy information from such existing knowledge. In this work, we used a semiautomated text mining approach to retrieve a complete pharmacogenomic (PGx) resource integrating disease-drug-gene-polymorphism relationships to derive a global perspective for ease in therapeutic approaches. We used an R package, pubmed.mineR, to automatically retrieve PGx-related literature. We identified 1,753 disease types, and 666 drugs, associated with 4,132 genes and 33,942 polymorphisms collated from 180,088 publications. With further manual curation, we obtained a total of 2,304 PGx relationships. We evaluated our approach by performance (precision = 0.806) with benchmark datasets like Pharmacogenomic Knowledgebase (PharmGKB) (0.904), Online Mendelian Inheritance in Man (OMIM) (0.600), and The Comparative Toxicogenomics Database (CTD) (0.729). We validated our study by comparing our results with 362 commercially used the US- Food and drug administration (FDA)-approved drug labeling biomarkers. Of the 2,304 PGx relationships identified, 127 belonged to the FDA list of 362 approved pharmacogenomic markers, indicating that our semiautomated text mining approach may reveal significant PGx information with markers for drug response prediction. In addition, it is a scalable and state-of-art approach in curation for PGx clinical utility.

摘要

了解患者的基因组变异及其在保护或使他们易患药物反应表型方面的作用,对于提供个性化医疗保健至关重要。几项研究已从临床试验数据或已发表的文献中手动整理了此类基因型-表型关系,并将其纳入有组织的数据库中。然而,目前还没有文本挖掘工具可用于从此类现有知识中提取高精度信息。在这项工作中,我们使用了一种半自动文本挖掘方法来检索一个完整的药物基因组学(PGx)资源,该资源整合了疾病-药物-基因-多态性之间的关系,以便从全局角度轻松制定治疗方法。我们使用一个R包pubmed.mineR自动检索与PGx相关的文献。我们从180,088篇出版物中整理出1,753种疾病类型、666种药物,这些与4,132个基因和33,942个多态性相关。经过进一步的人工整理,我们总共获得了2,304个PGx关系。我们使用诸如药物基因组学知识库(PharmGKB)(精度 = 0.904)、《人类孟德尔遗传在线》(OMIM)(精度 = 0.600)和比较毒理基因组学数据库(CTD)(精度 = 0.729)等基准数据集来评估我们方法的性能(精度 = 0.806)。我们通过将我们的结果与362种商业使用的美国食品药品监督管理局(FDA)批准的药物标签生物标志物进行比较来验证我们的研究。在确定的2,304个PGx关系中,有127个属于FDA列出的362个批准的药物基因组学标记,这表明我们的半自动文本挖掘方法可能通过药物反应预测标记揭示重要的PGx信息。此外,它是一种用于PGx临床应用整理的可扩展且先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/4ee2749e2adb/fphar-10-00839-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/ec009c10beab/fphar-10-00839-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/0e864efbf4da/fphar-10-00839-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/4ee2749e2adb/fphar-10-00839-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/ec009c10beab/fphar-10-00839-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/0e864efbf4da/fphar-10-00839-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8db0/6692532/4ee2749e2adb/fphar-10-00839-g003.jpg

相似文献

1
Global Text Mining and Development of Pharmacogenomic Knowledge Resource for Precision Medicine.用于精准医学的全球文本挖掘与药物基因组学知识资源开发。
Front Pharmacol. 2019 Aug 7;10:839. doi: 10.3389/fphar.2019.00839. eCollection 2019.
2
Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.从生物医学文献中挖掘基因型-表型关系以用于数据库管理和精准医学。
PLoS Comput Biol. 2016 Nov 30;12(11):e1005017. doi: 10.1371/journal.pcbi.1005017. eCollection 2016 Nov.
3
Systematic identification of pharmacogenomics information from clinical trials.从临床试验中系统地识别药物基因组学信息。
J Biomed Inform. 2012 Oct;45(5):870-8. doi: 10.1016/j.jbi.2012.04.005. Epub 2012 Apr 24.
4
Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD).文本挖掘和化学-基因-疾病网络的人工整理用于比较毒理学基因组数据库(CTD)。
BMC Bioinformatics. 2009 Oct 8;10:326. doi: 10.1186/1471-2105-10-326.
5
PGxO and PGxLOD: a reconciliation of pharmacogenomic knowledge of various provenances, enabling further comparison.PGxO 和 PGxLOD:对各种来源的药物基因组学知识进行协调,从而实现进一步比较。
BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):139. doi: 10.1186/s12859-019-2693-9.
6
PharmGKB: A worldwide resource for pharmacogenomic information.PharmGKB:一个全球性的药物基因组学信息资源。
Wiley Interdiscip Rev Syst Biol Med. 2018 Jul;10(4):e1417. doi: 10.1002/wsbm.1417. Epub 2018 Feb 23.
7
Pharmacogenomic Biomarkers in US FDA-Approved Drug Labels (2000-2020).美国食品药品监督管理局批准的药品标签中的药物基因组生物标志物(2000 - 2020年)
J Pers Med. 2021 Mar 4;11(3):179. doi: 10.3390/jpm11030179.
8
Text mining effectively scores and ranks the literature for improving chemical-gene-disease curation at the comparative toxicogenomics database.文本挖掘有效地对文献进行评分和排序,以提高比较毒理学基因组学数据库中的化学物质-基因-疾病的编纂工作。
PLoS One. 2013 Apr 17;8(4):e58201. doi: 10.1371/journal.pone.0058201. Print 2013.
9
Text-mining clinically relevant cancer biomarkers for curation into the CIViC database.从临床相关癌症生物标志物文本中挖掘信息,将其纳入 CIViC 数据库。
Genome Med. 2019 Dec 3;11(1):78. doi: 10.1186/s13073-019-0686-y.
10
Text Mining for Precision Medicine: Bringing Structure to EHRs and Biomedical Literature to Understand Genes and Health.精准医学的文本挖掘:为电子健康记录和生物医学文献构建结构以理解基因与健康。
Adv Exp Med Biol. 2016;939:139-166. doi: 10.1007/978-981-10-1503-8_7.

引用本文的文献

1
Mini review on skin biopsy: traditional and modern techniques.皮肤活检综述:传统与现代技术
Front Med (Lausanne). 2025 Mar 5;12:1476685. doi: 10.3389/fmed.2025.1476685. eCollection 2025.
2
Pharmacogenomics Tools for Precision Public Health and Lessons for Low- and Middle-Income Countries: A Scoping Review.用于精准公共卫生的药物基因组学工具及对低收入和中等收入国家的启示:一项范围综述
Pharmgenomics Pers Med. 2025 Jan 30;18:19-34. doi: 10.2147/PGPM.S490135. eCollection 2025.
3
Text Mining Protocol to Retrieve Significant Drug-Gene Interactions from PubMed Abstracts.

本文引用的文献

1
PreMedKB: an integrated precision medicine knowledgebase for interpreting relationships between diseases, genes, variants and drugs.PreMedKB:一个综合的精准医学知识库,用于解释疾病、基因、变异和药物之间的关系。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1090-D1101. doi: 10.1093/nar/gky1042.
2
A global network of biomedical relationships derived from text.从文本中提取的生物医学关系的全球网络。
Bioinformatics. 2018 Aug 1;34(15):2614-2624. doi: 10.1093/bioinformatics/bty114.
3
Pharmacogenomics of GPCR Drug Targets.GPCR 药物靶点的药物基因组学。
从 PubMed 摘要中检索重要药物-基因相互作用的文本挖掘方案。
Methods Mol Biol. 2022;2496:17-39. doi: 10.1007/978-1-0716-2305-3_2.
4
PharmGKB, an Integrated Resource of Pharmacogenomic Knowledge.PharmGKB,一个综合性的药物基因组学知识库。
Curr Protoc. 2021 Aug;1(8):e226. doi: 10.1002/cpz1.226.
5
Automated Extraction of Information From Texts of Scientific Publications: Insights Into HIV Treatment Strategies.从科学出版物文本中自动提取信息:对HIV治疗策略的见解
Front Genet. 2020 Dec 22;11:618862. doi: 10.3389/fgene.2020.618862. eCollection 2020.
6
A Novel Text-Mining Approach for Retrieving Pharmacogenomics Associations From the Literature.一种从文献中检索药物基因组学关联的新型文本挖掘方法。
Front Pharmacol. 2020 Nov 10;11:602030. doi: 10.3389/fphar.2020.602030. eCollection 2020.
Cell. 2018 Jan 11;172(1-2):41-54.e19. doi: 10.1016/j.cell.2017.11.033. Epub 2017 Dec 14.
4
Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.从生物医学文献中挖掘基因型-表型关系以用于数据库管理和精准医学。
PLoS Comput Biol. 2016 Nov 30;12(11):e1005017. doi: 10.1371/journal.pcbi.1005017. eCollection 2016 Nov.
5
Genenames.org: the HGNC and VGNC resources in 2017.Genenames.org:2017年的HGNC和VGNC资源。
Nucleic Acids Res. 2017 Jan 4;45(D1):D619-D625. doi: 10.1093/nar/gkw1033. Epub 2016 Oct 30.
6
The Comparative Toxicogenomics Database: update 2017.比较毒理基因组学数据库:2017年更新版
Nucleic Acids Res. 2017 Jan 4;45(D1):D972-D978. doi: 10.1093/nar/gkw838. Epub 2016 Sep 19.
7
pubmed.mineR: an R package with text-mining algorithms to analyse PubMed abstracts.pubmed.mineR:一个带有文本挖掘算法的R包,用于分析PubMed摘要。
J Biosci. 2015 Oct;40(4):671-82. doi: 10.1007/s12038-015-9552-2.
8
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
9
GNormPlus: An Integrative Approach for Tagging Genes, Gene Families, and Protein Domains.GNormPlus:一种用于标记基因、基因家族和蛋白质结构域的综合方法。
Biomed Res Int. 2015;2015:918710. doi: 10.1155/2015/918710. Epub 2015 Aug 25.
10
Text mining facilitates database curation - extraction of mutation-disease associations from Bio-medical literature.文本挖掘有助于数据库管理——从生物医学文献中提取突变与疾病的关联。
BMC Bioinformatics. 2015 Jun 6;16:185. doi: 10.1186/s12859-015-0609-x.