• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用多方面主题建模技术分析生物信息学领域。

Analyzing the field of bioinformatics with the multi-faceted topic modeling technique.

作者信息

Heo Go Eun, Kang Keun Young, Song Min, Lee Jeong-Hoon

机构信息

Department of Library and Information Science, Yonsei University, 50 Yonsei-ro Seodaemun-gu, Seoul, 03722, Republic of Korea.

Department of Creative IT Engineering, POSTECH, 77 Cheongam-ro Nam-gu, Pohang, Gyeongbuk, 37673, Republic of Korea.

出版信息

BMC Bioinformatics. 2017 May 31;18(Suppl 7):251. doi: 10.1186/s12859-017-1640-x.

DOI:10.1186/s12859-017-1640-x
PMID:28617229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5471940/
Abstract

BACKGROUND

Bioinformatics is an interdisciplinary field at the intersection of molecular biology and computing technology. To characterize the field as convergent domain, researchers have used bibliometrics, augmented with text-mining techniques for content analysis. In previous studies, Latent Dirichlet Allocation (LDA) was the most representative topic modeling technique for identifying topic structure of subject areas. However, as opposed to revealing the topic structure in relation to metadata such as authors, publication date, and journals, LDA only displays the simple topic structure.

METHODS

In this paper, we adopt the Tang et al.'s Author-Conference-Topic (ACT) model to study the field of bioinformatics from the perspective of keyphrases, authors, and journals. The ACT model is capable of incorporating the paper, author, and conference into the topic distribution simultaneously. To obtain more meaningful results, we use journals and keyphrases instead of conferences and bag-of-words.. For analysis, we use PubMed to collected forty-six bioinformatics journals from the MEDLINE database. We conducted time series topic analysis over four periods from 1996 to 2015 to further examine the interdisciplinary nature of bioinformatics.

RESULTS

We analyze the ACT Model results in each period. Additionally, for further integrated analysis, we conduct a time series analysis among the top-ranked keyphrases, journals, and authors according to their frequency. We also examine the patterns in the top journals by simultaneously identifying the topical probability in each period, as well as the top authors and keyphrases. The results indicate that in recent years diversified topics have become more prevalent and convergent topics have become more clearly represented.

CONCLUSION

The results of our analysis implies that overtime the field of bioinformatics becomes more interdisciplinary where there is a steady increase in peripheral fields such as conceptual, mathematical, and system biology. These results are confirmed by integrated analysis of topic distribution as well as top ranked keyphrases, authors, and journals.

摘要

背景

生物信息学是分子生物学与计算技术交叉的跨学科领域。为了将该领域描述为一个融合领域,研究人员使用了文献计量学,并辅以文本挖掘技术进行内容分析。在先前的研究中,潜在狄利克雷分配(LDA)是识别学科领域主题结构最具代表性的主题建模技术。然而,与揭示与作者、出版日期和期刊等元数据相关的主题结构不同,LDA仅显示简单的主题结构。

方法

在本文中,我们采用唐等人的作者 - 会议 - 主题(ACT)模型,从关键词、作者和期刊的角度研究生物信息学领域。ACT模型能够将论文、作者和会议同时纳入主题分布。为了获得更有意义的结果,我们使用期刊和关键词代替会议和词袋。为了进行分析,我们使用PubMed从MEDLINE数据库中收集了46种生物信息学期刊。我们对1996年至2015年的四个时期进行了时间序列主题分析,以进一步研究生物信息学的跨学科性质。

结果

我们分析了每个时期的ACT模型结果。此外,为了进行进一步的综合分析,我们根据频率对排名靠前的关键词、期刊和作者进行了时间序列分析。我们还通过同时识别每个时期的主题概率以及顶级作者和关键词,研究了顶级期刊中的模式。结果表明,近年来多样化的主题变得更加普遍,而趋同的主题也得到了更清晰的体现。

结论

我们的分析结果表明,随着时间的推移,生物信息学领域变得更加跨学科,概念生物学、数学和系统生物学等外围领域稳步增加。这些结果通过主题分布以及排名靠前的关键词、作者和期刊的综合分析得到了证实。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/36e16c51ad24/12859_2017_1640_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/446ccf8f42bc/12859_2017_1640_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/a57d06a0e089/12859_2017_1640_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/79d9dec4f1eb/12859_2017_1640_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/8507aaade346/12859_2017_1640_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/36e16c51ad24/12859_2017_1640_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/446ccf8f42bc/12859_2017_1640_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/a57d06a0e089/12859_2017_1640_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/79d9dec4f1eb/12859_2017_1640_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/8507aaade346/12859_2017_1640_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8ed/5471940/36e16c51ad24/12859_2017_1640_Fig5_HTML.jpg

相似文献

1
Analyzing the field of bioinformatics with the multi-faceted topic modeling technique.运用多方面主题建模技术分析生物信息学领域。
BMC Bioinformatics. 2017 May 31;18(Suppl 7):251. doi: 10.1186/s12859-017-1640-x.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Analysis of research topics and trends in investigator-initiated research/trials (IIRs/IITs): A topic modeling study.研究者发起的研究/试验(IIRs/IITs)研究主题和趋势分析:主题建模研究。
Medicine (Baltimore). 2024 Mar 8;103(10):e37375. doi: 10.1097/MD.0000000000037375.
4
Analysis of Persian Bioinformatics Research with Topic Modeling.基于主题建模的波斯文生物信息学研究分析。
Biomed Res Int. 2023 Apr 17;2023:3728131. doi: 10.1155/2023/3728131. eCollection 2023.
5
Visualizing the knowledge structure and evolution of bioinformatics.生物信息学的知识结构和演化可视化。
BMC Bioinformatics. 2022 Sep 30;23(Suppl 8):404. doi: 10.1186/s12859-022-04948-9.
6
Research topics and hotspot trends of lumbar spondylolisthesis: A text-mining study with machine learning.腰椎滑脱的研究主题与热点趋势:一项基于机器学习的文本挖掘研究
Front Surg. 2023 Jan 6;9:1037978. doi: 10.3389/fsurg.2022.1037978. eCollection 2022.
7
The Top 100 Articles in the Medical Informatics: a Bibliometric Analysis.医学信息学领域的100篇顶尖文章:文献计量分析
J Med Syst. 2017 Aug 19;41(10):150. doi: 10.1007/s10916-017-0794-4.
8
Promoting Pro-Environmental Behavior in Policymaking: A Text-Mining Approach for Literature Review.在政策制定中促进亲环境行为:一种用于文献综述的文本挖掘方法
Eval Rev. 2024 Apr;48(2):370-398. doi: 10.1177/0193841X231176869. Epub 2023 May 17.
9
A Survey of Bioinformatics Database and Software Usage through Mining the Literature.通过文献挖掘对生物信息学数据库和软件使用情况的调查
PLoS One. 2016 Jun 22;11(6):e0157989. doi: 10.1371/journal.pone.0157989. eCollection 2016.
10
A heuristic approach to determine an appropriate number of topics in topic modeling.一种用于确定主题建模中合适主题数量的启发式方法。
BMC Bioinformatics. 2015;16 Suppl 13(Suppl 13):S8. doi: 10.1186/1471-2105-16-S13-S8. Epub 2015 Sep 25.

引用本文的文献

1
Mechanisms of exercise intervention in type 2 diabetes: a bibliometric and visualization analysis based on CiteSpace.基于 CiteSpace 的 2 型糖尿病运动干预机制的文献计量与可视化分析。
Front Endocrinol (Lausanne). 2024 Aug 1;15:1401342. doi: 10.3389/fendo.2024.1401342. eCollection 2024.
2
Application of convolutional neural networks in medical images: a bibliometric analysis.卷积神经网络在医学图像中的应用:一项文献计量分析。
Quant Imaging Med Surg. 2024 May 1;14(5):3501-3518. doi: 10.21037/qims-23-1600. Epub 2024 Apr 11.
3
Analysis of Persian Bioinformatics Research with Topic Modeling.

本文引用的文献

1
Type 2 Diabetes Research Yield, 1951-2012: Bibliometrics Analysis and Density-Equalizing Mapping.1951 - 2012年2型糖尿病研究产出:文献计量学分析与密度均衡映射
PLoS One. 2015 Jul 24;10(7):e0133009. doi: 10.1371/journal.pone.0133009. eCollection 2015.
2
A bibliometric analysis on tobacco regulation investigators.烟草监管调查人员的文献计量分析。
BioData Min. 2015 Mar 21;8:11. doi: 10.1186/s13040-015-0043-7. eCollection 2015.
3
A Bird's-Eye View of Alzheimer's Disease Research: Reflecting Different Perspectives of Indexers, Authors, or Citers in Mapping the Field.
基于主题建模的波斯文生物信息学研究分析。
Biomed Res Int. 2023 Apr 17;2023:3728131. doi: 10.1155/2023/3728131. eCollection 2023.
4
Visualizing the knowledge structure and evolution of bioinformatics.生物信息学的知识结构和演化可视化。
BMC Bioinformatics. 2022 Sep 30;23(Suppl 8):404. doi: 10.1186/s12859-022-04948-9.
5
Comparing cefotaxime and ceftriaxone in combating meningitis through nose-to-brain delivery using bio/chemoinformatics tools.通过生物/化学信息学工具比较经鼻脑递药治疗脑膜炎时头孢噻肟和头孢曲松的效果。
Sci Rep. 2020 Dec 4;10(1):21250. doi: 10.1038/s41598-020-78327-w.
6
Tracking knowledge evolution, hotspots and future directions of emerging technologies in cancers research: a bibliometrics review.追踪癌症研究中新兴技术的知识演进、热点及未来方向:一项文献计量学综述
J Cancer. 2019 Jun 2;10(12):2643-2653. doi: 10.7150/jca.32739. eCollection 2019.
7
The research topic landscape in the literature of social class and inequality.社会阶级和不平等文献中的研究主题景观。
PLoS One. 2018 Jul 2;13(7):e0199510. doi: 10.1371/journal.pone.0199510. eCollection 2018.
8
A bibliometric analysis of natural language processing in medical research.自然语言处理在医学研究中的文献计量分析。
BMC Med Inform Decis Mak. 2018 Mar 22;18(Suppl 1):14. doi: 10.1186/s12911-018-0594-x.
阿尔茨海默病研究的鸟瞰:反映索引者、作者或引用者在描绘该领域时的不同视角
J Alzheimers Dis. 2015;45(4):1207-22. doi: 10.3233/JAD-142688.
4
Entitymetrics: measuring the impact of entities.实体计量学:衡量实体的影响力。
PLoS One. 2013 Aug 29;8(8):e71416. doi: 10.1371/journal.pone.0071416. eCollection 2013.
5
A bibliometric analysis of tuberculosis research indexed in PubMed, 1997-2006.1997 - 2006年PubMed收录的结核病研究文献计量分析
Int J Tuberc Lung Dis. 2008 Dec;12(12):1461-8.
6
A bibliometric analysis of scientific production in cancer molecular epidemiology.癌症分子流行病学领域科研产出的文献计量分析
Carcinogenesis. 2007 Aug;28(8):1774-9. doi: 10.1093/carcin/bgm129. Epub 2007 Jun 4.
7
A bibliometric analysis in the fields of preventive medicine, occupational and environmental medicine, epidemiology, and public health.预防医学、职业与环境医学、流行病学和公共卫生领域的文献计量分析。
BMC Public Health. 2006 Dec 15;6:301. doi: 10.1186/1471-2458-6-301.