• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TrendyGenes,一个用于检测学术界和药物发现文献趋势的计算管道。

TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery.

机构信息

Division of Cell and Developmental Biology, School of Life Sciences, University of Dundee, Dundee, DD1 5EH, UK.

Division of Population Health and Genomics, Ninewells Hospital, School of Medicine, University of Dundee, Dundee, DD1 9SY, UK.

出版信息

Sci Rep. 2021 Aug 3;11(1):15747. doi: 10.1038/s41598-021-94897-9.

DOI:10.1038/s41598-021-94897-9
PMID:34344904
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8333311/
Abstract

Target identification and prioritisation are prominent first steps in modern drug discovery. Traditionally, individual scientists have used their expertise to manually interpret scientific literature and prioritise opportunities. However, increasing publication rates and the wider routine coverage of human genes by omic-scale research make it difficult to maintain meaningful overviews from which to identify promising new trends. Here we propose an automated yet flexible pipeline that identifies trends in the scientific corpus which align with the specific interests of a researcher and facilitate an initial prioritisation of opportunities. Using a procedure based on co-citation networks and machine learning, genes and diseases are first parsed from PubMed articles using a novel named entity recognition system together with publication date and supporting information. Then recurrent neural networks are trained to predict the publication dynamics of all human genes. For a user-defined therapeutic focus, genes generating more publications or citations are identified as high-interest targets. We also used topic detection routines to help understand why a gene is trendy and implement a system to propose the most prominent review articles for a potential target. This TrendyGenes pipeline detects emerging targets and pathways and provides a new way to explore the literature for individual researchers, pharmaceutical companies and funding agencies.

摘要

目标识别和优先级确定是现代药物发现的重要第一步。传统上,个别科学家利用自己的专业知识来手动解释科学文献并确定优先级。然而,随着出版物数量的增加以及组学规模的研究更广泛地涵盖人类基因,从这些文献中很难获得有意义的概览,从而难以确定有前途的新趋势。在这里,我们提出了一个自动化但灵活的流程,该流程可以识别与研究人员特定兴趣一致的科学语料库中的趋势,并促进机会的初步优先级排序。使用基于共引网络和机器学习的程序,首先使用一种新颖的命名实体识别系统以及出版物日期和支持信息,从 PubMed 文章中解析基因和疾病。然后,训练递归神经网络以预测所有人类基因的出版动态。对于用户定义的治疗重点,生成更多出版物或引文的基因被识别为高兴趣目标。我们还使用主题检测例程来帮助了解为什么某个基因是趋势,并实施了一个系统,为潜在目标提出最突出的综述文章。TrendyGenes 流程可检测新兴目标和途径,并为个别研究人员、制药公司和资助机构提供了一种探索文献的新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/e59e1e981e68/41598_2021_94897_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/313c3b310994/41598_2021_94897_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/0b99771cebe8/41598_2021_94897_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/a5a87b7df165/41598_2021_94897_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/e59e1e981e68/41598_2021_94897_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/313c3b310994/41598_2021_94897_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/0b99771cebe8/41598_2021_94897_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/a5a87b7df165/41598_2021_94897_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35b4/8333311/e59e1e981e68/41598_2021_94897_Fig4_HTML.jpg

相似文献

1
TrendyGenes, a computational pipeline for the detection of literature trends in academia and drug discovery.TrendyGenes,一个用于检测学术界和药物发现文献趋势的计算管道。
Sci Rep. 2021 Aug 3;11(1):15747. doi: 10.1038/s41598-021-94897-9.
2
In silico prediction of novel therapeutic targets using gene-disease association data.利用基因-疾病关联数据进行新型治疗靶点的计算机预测。
J Transl Med. 2017 Aug 29;15(1):182. doi: 10.1186/s12967-017-1285-6.
3
Combining Literature Mining and Machine Learning for Predicting Biomedical Discoveries.结合文献挖掘和机器学习预测生物医学发现。
Methods Mol Biol. 2022;2496:123-140. doi: 10.1007/978-1-0716-2305-3_7.
4
Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks.基于文本挖掘的词表示在生物医学数据分析和机器学习任务中的蛋白质-蛋白质相互作用网络。
PLoS One. 2021 Oct 15;16(10):e0258623. doi: 10.1371/journal.pone.0258623. eCollection 2021.
5
Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测:机器学习在 1 型糖尿病中的应用。
Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.
6
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
7
Can literature analysis identify innovation drivers in drug discovery?文献分析能否识别药物研发中的创新驱动因素?
Nat Rev Drug Discov. 2009 Nov;8(11):865-78. doi: 10.1038/nrd2973.
8
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
9
Academic-industrial partnerships in drug discovery in the age of genomics.基因组学时代药物研发中的产学研合作。
Trends Biotechnol. 2015 Jun;33(6):320-2. doi: 10.1016/j.tibtech.2015.02.009.
10
Academia and industry: allocating credit for discovery and development of new therapies.学术界与产业界:为新疗法的发现与开发分配功劳。
J Clin Invest. 2019 May 20;129(6):2172-2174. doi: 10.1172/JCI129122.

引用本文的文献

1
pubCounteR: an R package for interrogating published literature for experimentally-derived gene lists within a user-defined biological context.pubCounteR:一个R软件包,用于在用户定义的生物学背景下,针对通过实验得出的基因列表查询已发表文献。
Front Bioinform. 2025 May 6;5:1523184. doi: 10.3389/fbinf.2025.1523184. eCollection 2025.
2
What's next? Forecasting scientific research trends.接下来是什么?预测科学研究趋势。
Heliyon. 2023 Dec 15;10(1):e23781. doi: 10.1016/j.heliyon.2023.e23781. eCollection 2024 Jan 15.
3
A publication-wide association study (PWAS), historical language models to prioritise novel therapeutic drug targets.

本文引用的文献

1
Moving targets in drug discovery.药物发现中的移动目标。
Sci Rep. 2020 Nov 19;10(1):20213. doi: 10.1038/s41598-020-77033-x.
2
Formation and Function of Mammalian Epithelia: Roles for Mechanosensitive PIEZO1 Ion Channels.哺乳动物上皮细胞的形成与功能:机械敏感型PIEZO1离子通道的作用
Front Cell Dev Biol. 2019 Oct 31;7:260. doi: 10.3389/fcell.2019.00260. eCollection 2019.
3
Targeting the TIGIT-PVR immune checkpoint axis as novel therapeutic option in breast cancer.靶向TIGIT-PVR免疫检查点轴作为乳腺癌的新型治疗选择。
全领域关联研究(PWAS),历史语言模型,以确定新型治疗药物靶点的优先级。
Sci Rep. 2023 May 24;13(1):8366. doi: 10.1038/s41598-023-35597-4.
4
Protection of the human gene research literature from contract cheating organizations known as research paper mills.保护人类基因研究文献免受被称为论文工厂的合同作弊组织的侵害。
Nucleic Acids Res. 2022 Nov 28;50(21):12058-12070. doi: 10.1093/nar/gkac1139.
5
Use of Real-World Evidence to Drive Drug Development Strategy and Inform Clinical Trial Design.利用真实世界证据推动药物研发策略并为临床试验设计提供信息。
Clin Pharmacol Ther. 2022 Jan;111(1):77-89. doi: 10.1002/cpt.2480. Epub 2021 Nov 28.
Oncoimmunology. 2019 Oct 12;8(12):e1674605. doi: 10.1080/2162402X.2019.1674605. eCollection 2019.
4
The gasdermins, a protein family executing cell death and inflammation.gasdermins,一个执行细胞死亡和炎症的蛋白家族。
Nat Rev Immunol. 2020 Mar;20(3):143-157. doi: 10.1038/s41577-019-0228-2. Epub 2019 Nov 5.
5
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
6
Artificial intelligence and machine learning in clinical development: a translational perspective.临床开发中的人工智能与机器学习:转化医学视角
NPJ Digit Med. 2019 Jul 26;2:69. doi: 10.1038/s41746-019-0148-3. eCollection 2019.
7
DNA sensing by the cGAS-STING pathway in health and disease.cGAS-STING 通路在健康和疾病中的 DNA 感应。
Nat Rev Genet. 2019 Nov;20(11):657-674. doi: 10.1038/s41576-019-0151-1. Epub 2019 Jul 29.
8
Discovery of potent necroptosis inhibitors targeting RIPK1 kinase activity for the treatment of inflammatory disorder and cancer metastasis.发现针对 RIPK1 激酶活性的强效坏死性凋亡抑制剂,用于治疗炎症性疾病和癌症转移。
Cell Death Dis. 2019 Jun 24;10(7):493. doi: 10.1038/s41419-019-1735-6.
9
Astrocytes and Microglia as Potential Contributors to the Pathogenesis of Repeat Expansion-Associated FTLD and ALS.星形胶质细胞和小胶质细胞作为重复序列扩增相关的额颞叶痴呆和肌萎缩侧索硬化症发病机制的潜在促成因素。
Front Neurosci. 2019 May 15;13:486. doi: 10.3389/fnins.2019.00486. eCollection 2019.
10
Targeting RIPK1 for the treatment of human diseases.靶向 RIPK1 治疗人类疾病。
Proc Natl Acad Sci U S A. 2019 May 14;116(20):9714-9722. doi: 10.1073/pnas.1901179116. Epub 2019 May 2.