Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Pampulha, Belo Horizonte, Brazil.
Genet Epidemiol. 2012 May;36(4):360-7. doi: 10.1002/gepi.21629. Epub 2012 Apr 16.
Large-scale genomics initiatives such as the HapMap project and the 1000-genomes rely on powerful bioinformatics support to assist data production and analysis. Contrastingly, few bioinformatics platforms oriented to smaller research groups exist to store, handle, share, and integrate data from different sources, as well as to assist these scientists to perform their analyses efficiently. We developed such a bioinformatics platform, DIVERGENOME, to assist population genetics and genetic epidemiology studies performed by small- to medium-sized research groups. The platform is composed of two integrated components, a relational database (DIVERGENOMEdb), and a set of tools to convert data formats as required by popular software in population genetics and genetic epidemiology (DIVERGENOMEtools). In DIVERGENOMEdb, information on genotypes, polymorphism, laboratory protocols, individuals, populations, and phenotypes is organized in projects. These can be queried according to permissions. Here, we validated DIVERGENOME through a use case regarding the analysis of SLC2A4 genetic diversity in human populations. DIVERGENOME, with its intuitive Web interface and automatic data loading capability, facilitates its use by individuals without bioinformatics background, allowing complex queries to be easily interrogated and straightforward data format conversions (not available in similar platforms). DIVERGENOME is open source, freely available, and can be accessed online (pggenetica.icb.ufmg.br/divergenome) or hosted locally.
大规模基因组学计划,如 HapMap 项目和 1000 基因组计划,都依赖于强大的生物信息学支持,以协助数据生产和分析。相比之下,很少有面向小型研究小组的生物信息学平台能够存储、处理、共享和整合来自不同来源的数据,并协助这些科学家有效地进行分析。我们开发了这样一个生物信息学平台 DIVERGENOME,以协助中小规模研究小组进行群体遗传学和遗传流行病学研究。该平台由两个集成组件组成,一个关系型数据库(DIVERGENOMEdb),以及一组工具,用于转换群体遗传学和遗传流行病学中常用软件所需的数据格式(DIVERGENOMEtools)。在 DIVERGENOMEdb 中,基因型、多态性、实验室方案、个体、群体和表型的信息在项目中进行组织。这些可以根据权限进行查询。在这里,我们通过一个关于人类群体中 SLC2A4 遗传多样性分析的用例来验证 DIVERGENOME。DIVERGENOME 具有直观的 Web 界面和自动数据加载功能,便于没有生物信息学背景的个人使用,允许轻松查询复杂的查询和直接的数据格式转换(类似平台不可用)。DIVERGENOME 是开源的,免费提供,可在线访问(pggenetica.icb.ufmg.br/divergenome)或本地托管。