• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用生物主题模型揭示生物医学文献中的主题及其演变:以人参为例的研究

Revealing topics and their evolution in biomedical literature using Bio-DTM: a case study of ginseng.

作者信息

Chen Qian, Ai Ni, Liao Jie, Shao Xin, Liu Yufeng, Fan Xiaohui

机构信息

School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou, 325035 China.

Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058 China.

出版信息

Chin Med. 2017 Sep 12;12:27. doi: 10.1186/s13020-017-0148-7. eCollection 2017.

DOI:10.1186/s13020-017-0148-7
PMID:28919923
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5596940/
Abstract

BACKGROUND

Valuable scientific results on biomedicine are very rich, but they are widely scattered in the literature. Topic modeling enables researchers to discover themes from an unstructured collection of documents without any prior annotations or labels. In this paper, taking ginseng as an example, biological dynamic topic model (Bio-DTM) was proposed to conduct a retrospective study and interpret the temporal evolution of the research of ginseng.

METHODS

The system of Bio-DTM mainly includes four components, documents pre-processing, bio-dictionary construction, dynamic topic models, topics analysis and visualization. Scientific articles pertaining to ginseng were retrieved through text mining from PubMed. The bio-dictionary integrates MedTerms medical dictionary, the second edition of side effect resource, a dictionary of biology and HGNC database of human gene names (HGNC). A dynamic topic model, a text mining technique, was used to emphasize on capturing the development trends of topics in a sequentially collected documents. Besides the contents of topics taken on, the evolution of topics was visualized over time using ThemeRiver.

RESULTS

From the topic 9, ginseng was used in dietary supplements and complementary and integrative health practices, and became very popular since the early twentieth century. Topic 6 reminded that the planting of ginseng is a major area of research and symbiosis and allelopathy of ginseng became a research hotspot in 2007. In addition, the Bio-DTM model gave an insight into the main pharmacologic effects of ginseng, such as anti-metabolic disorder effect, cardioprotective effect, anti-cancer effect, hepatoprotective effect, anti-thrombotic effect and neuroprotective effect.

CONCLUSION

The Bio-DTM model not only discovers what ginseng's research involving in but also displays how these topics evolving over time. This approach can be applied to the biomedical field to conduct a retrospective study and guide future studies.

摘要

背景

生物医学领域有价值的科学成果非常丰富,但它们广泛分散在文献中。主题建模使研究人员能够从非结构化的文档集合中发现主题,而无需任何预先注释或标签。本文以人参为例,提出生物动态主题模型(Bio-DTM)进行回顾性研究,并解读人参研究的时间演变。

方法

Bio-DTM系统主要包括四个组件,即文档预处理、生物词典构建、动态主题模型、主题分析与可视化。通过文本挖掘从PubMed检索与人参相关的科学文章。生物词典整合了医学术语词典、副作用资源第二版、生物学词典和人类基因名称HGNC数据库(HGNC)。动态主题模型是一种文本挖掘技术,用于强调捕捉顺序收集文档中主题的发展趋势。除了主题所涵盖的内容外,还使用主题河流(ThemeRiver)随时间可视化主题的演变。

结果

从主题9来看,人参被用于膳食补充剂以及补充和综合健康实践,自20世纪初以来变得非常流行。主题6提醒人们,人参种植是一个主要研究领域,人参的共生和化感作用在2007年成为研究热点。此外,Bio-DTM模型深入了解了人参的主要药理作用,如抗代谢紊乱作用、心脏保护作用、抗癌作用、肝脏保护作用、抗血栓作用和神经保护作用。

结论

Bio-DTM模型不仅发现了人参研究涉及的内容,还展示了这些主题如何随时间演变。这种方法可应用于生物医学领域进行回顾性研究并指导未来研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/17fa46aeca96/13020_2017_148_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/4548d7debd71/13020_2017_148_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/ecc0fa45f4f7/13020_2017_148_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/17fa46aeca96/13020_2017_148_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/4548d7debd71/13020_2017_148_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/ecc0fa45f4f7/13020_2017_148_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1712/5596940/17fa46aeca96/13020_2017_148_Fig3_HTML.jpg

相似文献

1
Revealing topics and their evolution in biomedical literature using Bio-DTM: a case study of ginseng.使用生物主题模型揭示生物医学文献中的主题及其演变:以人参为例的研究
Chin Med. 2017 Sep 12;12:27. doi: 10.1186/s13020-017-0148-7. eCollection 2017.
2
Extracting information and inferences from a large text corpus.从大型文本语料库中提取信息和推论。
Int J Inf Technol. 2023;15(1):435-445. doi: 10.1007/s41870-022-01123-4. Epub 2022 Nov 20.
3
Application of dynamic topic models to toxicogenomics data.动态主题模型在毒理基因组学数据中的应用。
BMC Bioinformatics. 2016 Oct 6;17(Suppl 13):368. doi: 10.1186/s12859-016-1225-0.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
6
Natural language processing reveals research trends and topics in The Spine Journal over two decades: a topic modeling study.自然语言处理揭示了《脊柱杂志》二十多年来的研究趋势和主题:一项主题建模研究。
Spine J. 2024 Mar;24(3):397-405. doi: 10.1016/j.spinee.2023.09.024. Epub 2023 Oct 4.
7
Technological Innovations in Disease Management: Text Mining US Patent Data From 1995 to 2017.疾病管理中的技术创新:挖掘1995年至2017年的美国专利数据
J Med Internet Res. 2019 Apr 30;21(4):e13316. doi: 10.2196/13316.
8
Machine Learning-Based Approach for Identifying Research Gaps: COVID-19 as a Case Study.基于机器学习的研究空白识别方法:以COVID-19为例
JMIR Form Res. 2024 Mar 5;8:e49411. doi: 10.2196/49411.
9
Mining FDA drug labels using an unsupervised learning technique--topic modeling.利用无监督学习技术——主题建模挖掘 FDA 药物标签。
BMC Bioinformatics. 2011 Oct 18;12 Suppl 10(Suppl 10):S11. doi: 10.1186/1471-2105-12-S10-S11.
10
Exploring Trends in Environmental, Social, and Governance Themes and Their Sentimental Value Over Time.探索环境、社会和治理主题的趋势及其随时间的情感价值。
Front Psychol. 2022 Jun 28;13:890435. doi: 10.3389/fpsyg.2022.890435. eCollection 2022.

引用本文的文献

1
RegenX: an NLP recommendation engine for neuroregeneration topics over time.RegenX:一个随时间推移用于神经再生主题的自然语言处理推荐引擎。
Ann Eye Sci. 2022 Mar;7. doi: 10.21037/aes-21-29. Epub 2022 Mar 15.
2
Tracking Knowledge Evolution Based on the Terminology Dynamics in 4P-Medicine.基于 4P 医学术语动态的知识演化追踪。
Int J Environ Res Public Health. 2020 Oct 13;17(20):7444. doi: 10.3390/ijerph17207444.
3
Open Agile text mining for bioinformatics: the PubAnnotation ecosystem.开放的生物信息学敏捷文本挖掘:PubAnnotation 生态系统。

本文引用的文献

1
Quality and characteristics of fermented ginseng seed oil based on bacterial strain and extraction method.基于菌株和提取方法的发酵人参籽油的品质与特性
J Ginseng Res. 2017 Jul;41(3):428-433. doi: 10.1016/j.jgr.2017.03.003. Epub 2017 Mar 19.
2
Integrated Transcriptomic and Metabolomic Analysis of Five Cultivars Reveals the Dynamics of Ginsenoside Biosynthesis.五个品种的转录组学与代谢组学综合分析揭示人参皂苷生物合成的动态变化
Front Plant Sci. 2017 Jun 19;8:1048. doi: 10.3389/fpls.2017.01048. eCollection 2017.
3
SparkText: Biomedical Text Mining on Big Data Framework.
Bioinformatics. 2019 Nov 1;35(21):4372-4380. doi: 10.1093/bioinformatics/btz227.
SparkText:大数据框架下的生物医学文本挖掘
PLoS One. 2016 Sep 29;11(9):e0162721. doi: 10.1371/journal.pone.0162721. eCollection 2016.
4
An overview of topic modeling and its current applications in bioinformatics.主题建模概述及其在生物信息学中的当前应用。
Springerplus. 2016 Sep 20;5(1):1608. doi: 10.1186/s40064-016-3252-8. eCollection 2016.
5
SWIFT-Review: a text-mining workbench for systematic review.SWIFT-Review:一个用于系统评价的文本挖掘工作台。
Syst Rev. 2016 May 23;5:87. doi: 10.1186/s13643-016-0263-z.
6
Trends in the use of complementary health approaches among adults: United States, 2002-2012.2002 - 2012年美国成年人使用补充健康方法的趋势
Natl Health Stat Report. 2015 Feb 10(79):1-16.
7
Use of complementary health approaches among children aged 4-17 years in the United States: National Health Interview Survey, 2007-2012.美国4至17岁儿童补充健康方法的使用情况:2007 - 2012年国家健康访谈调查
Natl Health Stat Report. 2015 Feb 10(78):1-19.
8
Identification of constituent herbs in ginseng decoctions by DNA markers.采用 DNA 标记鉴定人参汤中的组成草药。
Chin Med. 2015 Jan 30;10(1):1. doi: 10.1186/s13020-015-0029-x. eCollection 2015.
9
Application of text mining in the biomedical domain.文本挖掘在生物医学领域的应用。
Methods. 2015 Mar;74:97-106. doi: 10.1016/j.ymeth.2015.01.015. Epub 2015 Jan 30.
10
Retro: concept-based clustering of biomedical topical sets.回溯:基于概念的生物医学主题集聚类。
Bioinformatics. 2014 Nov 15;30(22):3240-8. doi: 10.1093/bioinformatics/btu514. Epub 2014 Jul 29.