Parks Donovan H, Porter Michael, Churcher Sylvia, Wang Suwen, Blouin Christian, Whalley Jacqueline, Brooks Stephen, Beiko Robert G
Dalhousie University, Halifax, Nova Scotia, Canada.
Genome Res. 2009 Oct;19(10):1896-904. doi: 10.1101/gr.095612.109. Epub 2009 Jul 27.
The increasing availability of genetic sequence data associated with explicit geographic and ecological information is offering new opportunities to study the processes that shape biodiversity. The generation and testing of hypotheses using these data sets requires effective tools for mathematical and visual analysis that can integrate digital maps, ecological data, and large genetic, genomic, or metagenomic data sets. GenGIS is a free and open-source software package that supports the integration of digital map data with genetic sequences and environmental information from multiple sample sites. Essential bioinformatic and statistical tools are integrated into the software, allowing the user a wide range of analysis options for their sequence data. Data visualizations are combined with the cartographic display to yield a clear view of the relationship between geography and genomic diversity, with a particular focus on the hierarchical clustering of sites based on their similarity or phylogenetic proximity. Here we outline the features of GenGIS and demonstrate its application to georeferenced microbial metagenomic, HIV-1, and human mitochondrial DNA data sets.
与明确的地理和生态信息相关的遗传序列数据越来越容易获取,这为研究塑造生物多样性的过程提供了新机会。利用这些数据集生成和检验假设需要有效的数学和可视化分析工具,这些工具能够整合数字地图、生态数据以及大型遗传、基因组或宏基因组数据集。GenGIS是一个免费的开源软件包,支持将数字地图数据与来自多个采样点的遗传序列和环境信息进行整合。该软件集成了基本的生物信息学和统计工具,为用户提供了针对其序列数据的广泛分析选项。数据可视化与地图显示相结合,以便清晰地了解地理与基因组多样性之间的关系,特别关注基于采样点相似性或系统发育接近度的分层聚类。在此,我们概述了GenGIS的功能,并展示了其在地理参考的微生物宏基因组、HIV-1和人类线粒体DNA数据集上的应用。