Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud. Ave. Morones Prieto 3000, Monterrey, N.L., México.
Centro de Investigación Biomédica, Hospital Zambrano-Hellion, Tec Salud, Tecnologico de Monterrey, Batallón San Patricio 112 Col. Real de San Agustín, San Pedro Garza García, N.L., México.
Database (Oxford). 2019 Jan 1;2019:bay137. doi: 10.1093/database/bay137.
Analysis, annotation and curation of biomedical scientific literature is a recurrent task in biomedical research, database curation and clinics. Commonly, the reading is centered on concepts such as genes, diseases or molecules. Database curators may also need to annotate published abstracts related to a specific topic. However, few free and intuitive tools exist to assist users in this context. Therefore, we developed PubTerm, a web tool to organize, categorize, curate and annotate a large number of PubMed abstracts related to biological entities such as genes, diseases, chemicals, species, sequence variants and other related information.
A variety of interfaces were implemented to facilitate curation and annotation, including the organization of abstracts by terms, by the co-occurrence of terms or by specific phrases. Information includes statistics on the occurrence of terms. The abstracts, terms and other related information can be annotated and categorized using user-defined categories. The session information can be saved and restored, and the data can be exported to other formats.
The pipeline in PubTerm starts by specifying a PubMed query or list of PubMed identifiers. Then, the user can specify three lists of categories and specify what information will be highlighted in which colors. The user then utilizes the `term view' to organize the abstracts by gene, disease, species or other information to facilitate the annotation and categorization of terms or abstracts. Other views also facilitate the exploration of abstracts and connections between terms. We have used PubTerm to quickly and efficiently curate collections of more than 400 abstracts that mention more than 350 genes to generate revised lists of susceptibility genes for diseases. An example is provided for pulmonary arterial hypertension.
PubTerm saves time for literature revision by assisting with annotation organization and knowledge acquisition.
生物医学科学文献的分析、注释和管理是生物医学研究、数据库管理和临床工作中的一项经常性任务。通常,阅读的重点是基因、疾病或分子等概念。数据库管理员还可能需要注释与特定主题相关的已发表摘要。然而,在这种情况下,很少有免费且直观的工具来帮助用户。因此,我们开发了 PubTerm,这是一种用于组织、分类、管理和注释与生物实体(如基因、疾病、化学物质、物种、序列变体和其他相关信息)相关的大量 PubMed 摘要的网络工具。
实现了多种界面来促进管理和注释,包括按术语、术语共现或特定短语组织摘要。信息包括术语出现的统计信息。可以使用用户定义的类别对摘要、术语和其他相关信息进行注释和分类。会话信息可以保存和恢复,并且可以将数据导出到其他格式。
PubTerm 中的管道首先指定一个 PubMed 查询或 PubMed 标识符列表。然后,用户可以指定三个类别列表,并指定将哪些信息突出显示为哪些颜色。用户然后利用“术语视图”按基因、疾病、物种或其他信息组织摘要,以方便术语或摘要的注释和分类。其他视图也便于探索摘要和术语之间的联系。我们已经使用 PubTerm 快速有效地管理了超过 400 篇提及超过 350 个基因的摘要的集合,以生成疾病易感性基因的修订列表。以肺动脉高压为例提供了一个示例。
PubTerm 通过协助注释组织和知识获取,为文献修订节省了时间。