a State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics , Beijing , China.
b School of Traditional Chinese Medicine, Beijing University of Chinese Medicine , Beijing , China.
Int J Radiat Biol. 2019 Aug;95(8):1172-1177. doi: 10.1080/09553002.2019.1609127. Epub 2019 May 13.
Exposing to ultraviolet for a certain time will trigger some significant molecular biology effects in an organism. In the past few decades, varied ultraviolet-associated biological effects as well as their related genes, have been discovered under biologists' efforts. However, information about ultraviolet-related genes is dispersed in thousands of scientific papers, and there is still no study emphasizing on the systematic collection of ultraviolet-related genes. We collected ultraviolet-related genes and built this gene-centric database UVGD based on literature mining and manual curation. Literature mining was based on the ultraviolet-related abstracts downloaded from PubMed, and we obtained sentences in which ultraviolet keywords and genes co-occur at single-sentence level by using bio-entity recognizer. After that, manual curation was implemented in order to identify whether the genes are related to ultraviolet or not. We built the ultraviolet-related knowledge base UVGD 1.0 (URL: http://biokb.ncpsb.org/UVGD/ ), which contains 663 ultraviolet-related genes, together with 17 associated biological processes, 117 associated phenotypes, and 2628 MeSH terms. UVGD is helpful to understand the ultraviolet-related biological processes in organisms and we believe it would be useful for biologists to study the responding mechanisms to ultraviolet.
暴露在一定时间的紫外线下会在生物体中引发一些重要的分子生物学效应。在过去的几十年中,生物学家们发现了各种与紫外线相关的生物学效应及其相关基因。然而,关于紫外线相关基因的信息分散在数千篇科学论文中,目前还没有研究强调系统地收集紫外线相关基因。
我们通过文献挖掘和人工注释收集了紫外线相关基因,并基于此构建了以基因为中心的数据库 UVGD。文献挖掘是基于从 PubMed 下载的紫外线相关摘要进行的,我们使用生物实体识别器获得了在单句级别紫外线关键词和基因共同出现的句子。之后,我们进行了人工注释,以确定这些基因是否与紫外线有关。
我们构建了紫外线相关知识库 UVGD 1.0(网址:http://biokb.ncpsb.org/UVGD/),其中包含 663 个紫外线相关基因,以及 17 个相关生物过程、117 个相关表型和 2628 个 MeSH 术语。UVGD 有助于理解生物体中与紫外线相关的生物过程,我们相信它将对生物学家研究对紫外线的响应机制有帮助。