Feng Chenzhe, Chen Haolin, Huang Leyi, Feng Yeqian, Chang Shi
Department of Oncology, The Second Xiangya Hospital of Central South University, Changsha, China.
Department of General Surgery, Xiangya Hospital of Central South University, Changsha, China.
Front Med (Lausanne). 2022 Apr 8;9:832662. doi: 10.3389/fmed.2022.832662. eCollection 2022.
This study aimed to investigate the landscape of Multiple Endocrine Neoplasia Type 1 research during the last 22 years using machine learning and text analysis.
In December 2021, all publications indexed under the MeSH term "Multiple Endocrine Neoplasia Type 1" were obtained from PubMed. The whole set of search results was downloaded in XML format, and metadata such as title, abstract, keywords, mesh words, and year of publication were extracted from the original XML files for bibliometric evaluation. The Latent Dirichlet allocation (LDA) topic modeling method was used to analyze specific themes.
This study eventually contained 1,407 publications. Among them, there are 768 (54.58%) case reports and reviews. Text analysis based on MeSH words revealed that the most often studied clinical areas include therapy efficacy, prognosis, and genetic diagnosis. The majority of basic study is focused on genetic alterations. The LDA topic model further identifies three topic clusters include basic research, treatment cluster, and diagnosis cluster. In the basic research cluster, many studies are focused on the expression of Menin. The primary focus of the therapy cluster is pancreatic resections and parathyroidectomy. In the diagnose cluster, the main focus is on Genetic Diagnosis and screening strategies for Hereditary Cancer Syndrome.
The current state of research on MEN1 is far from adequate. Research on rare diseases MEN1 necessitates implementing a broad research program involving multiple centers to advance MEN1 research together.
本研究旨在利用机器学习和文本分析,调查过去22年中1型多发性内分泌肿瘤的研究概况。
2021年12月,从PubMed获取所有以医学主题词“1型多发性内分泌肿瘤”编入索引的出版物。整套检索结果以XML格式下载,并从原始XML文件中提取标题、摘要、关键词、医学主题词和出版年份等元数据用于文献计量评估。使用潜在狄利克雷分配(LDA)主题建模方法分析特定主题。
本研究最终纳入1407篇出版物。其中,病例报告和综述有768篇(54.58%)。基于医学主题词的文本分析显示,最常研究的临床领域包括治疗效果、预后和基因诊断。大多数基础研究集中在基因改变方面。LDA主题模型进一步识别出三个主题集群,包括基础研究集群、治疗集群和诊断集群。在基础研究集群中,许多研究集中在Menin的表达上。治疗集群的主要重点是胰腺切除术和甲状旁腺切除术。在诊断集群中,主要重点是遗传性癌症综合征的基因诊断和筛查策略。
MEN1的当前研究状况远远不够。对罕见病MEN1的研究需要实施一项涉及多个中心的广泛研究计划,以共同推进MEN1的研究。