[大数据中心的数据库资源]
[The BIG Data Center's database resources].
作者信息
Zhang Yuan Sheng, Xia Lin, Sang Jian, Li Man, Liu Lin, Li Meng Wei, Niu Guang Yi, Cao Jia Bao, Teng Xu Fei, Zhou Qing, Zhang Zhang
机构信息
BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
CAS Key Laboratory of Genomics and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
出版信息
Yi Chuan. 2018 Nov 20;40(11):1039-1043. doi: 10.16288/j.yczz.18-190.
Omics data in life and health sciences are of fundamental significance for scientific research and biomedical technology development. However, there is yet to be a platform for biological data management and sharing in China, making it difficult to meet the development needs of biomedical and related fields and consequently leading to severe issues in big data management, sharing and translation. To address these issues, Beijing Institute of Genomics (BIG) of Chinese Academy of Sciences founded the BIG Data Center (BIGD) in 2016, which is dedicated to establish a biological big data management platform and multi-omics databases, with a particular focus on national population healthcare and important strategic biological resources. In this paper, we describe core database resources in BIGD, including GSA (Genome Sequence Archive), GWH (Genome Warehouse), GVM (Genome Variation Map), GEN (Gene Expression Nebulas), MethBank (Methylation Bank), BioCode and Science Wikis. Taken together, all these resources provide a series of services for data deposition, integration and sharing, laying solid foundations for enhancing national biological science data management and further promoting the construction of national bioinformatics center.
生命与健康科学中的组学数据对于科学研究和生物医学技术发展具有至关重要的意义。然而,中国目前尚缺乏一个生物数据管理与共享平台,难以满足生物医学及相关领域的发展需求,进而在大数据管理、共享及转化方面引发了严重问题。为解决这些问题,中国科学院北京基因组研究所(BIG)于2016年成立了大数据中心(BIGD),致力于建立一个生物大数据管理平台和多组学数据库,特别关注国家人口健康和重要战略生物资源。在本文中,我们介绍了BIGD的核心数据库资源,包括基因组序列归档库(GSA)、基因组数据库(GWH)、基因组变异图谱(GVM)、基因表达星云(GEN)、甲基化数据库(MethBank)、生物编码库(BioCode)和科学维基(Science Wikis)。这些资源共同为数据存储、整合和共享提供了一系列服务,为加强国家生物科学数据管理以及进一步推动国家生物信息学中心的建设奠定了坚实基础。