Plikus Maksim V, Zhang Zina, Chuong Cheng-Ming
Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, California, USA.
BMC Bioinformatics. 2006 Oct 2;7:424. doi: 10.1186/1471-2105-7-424.
Understanding research activity within any given biomedical field is important. Search outputs generated by MEDLINE/PubMed are not well classified and require lengthy manual citation analysis. Automation of citation analytics can be very useful and timesaving for both novices and experts.
PubFocus web server automates analysis of MEDLINE/PubMed search queries by enriching them with two widely used human factor-based bibliometric indicators of publication quality: journal impact factor and volume of forward references. In addition to providing basic volumetric statistics, PubFocus also prioritizes citations and evaluates authors' impact on the field of search. PubFocus also analyses presence and occurrence of biomedical key terms within citations by utilizing controlled vocabularies.
We have developed citations' prioritisation algorithm based on journal impact factor, forward referencing volume, referencing dynamics, and author's contribution level. It can be applied either to the primary set of PubMed search results or to the subsets of these results identified through key terms from controlled biomedical vocabularies and ontologies. NCI (National Cancer Institute) thesaurus and MGD (Mouse Genome Database) mammalian gene orthology have been implemented for key terms analytics. PubFocus provides a scalable platform for the integration of multiple available ontology databases. PubFocus analytics can be adapted for input sources of biomedical citations other than PubMed.
了解任何特定生物医学领域内的研究活动都很重要。MEDLINE/PubMed生成的搜索结果分类不佳,需要冗长的手动引文分析。引文分析自动化对新手和专家都非常有用且节省时间。
PubFocus网络服务器通过用两个广泛使用的基于人为因素的出版物质量文献计量指标(期刊影响因子和正向引用量)丰富MEDLINE/PubMed搜索查询,实现了搜索查询的自动化分析。除了提供基本的数量统计外,PubFocus还对引文进行优先级排序,并评估作者对搜索领域的影响。PubFocus还通过使用受控词汇表分析引文中生物医学关键词的存在和出现情况。
我们基于期刊影响因子、正向引用量、引用动态和作者贡献水平开发了引文优先级排序算法。它既可以应用于PubMed搜索结果的主要集合,也可以应用于通过受控生物医学词汇表和本体中的关键词识别出的这些结果的子集。已将美国国立癌症研究所(NCI)词库和小鼠基因组数据库(MGD)哺乳动物基因直系同源关系用于关键词分析。PubFocus为集成多个可用的本体数据库提供了一个可扩展的平台。PubFocus分析可以适用于除PubMed之外的生物医学引文输入源。