Tahsin Tasnia, Weissenbacher Davy, O'Connor Karen, Magge Arjun, Scotch Matthew, Gonzalez-Hernandez Graciela
Department of Biomedical Informatics, Arizona State University, Scottsdale, AZ 85259, USA.
Institute of Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
Bioinformatics. 2018 May 1;34(9):1606-1608. doi: 10.1093/bioinformatics/btx799.
GeoBoost is a command-line software package developed to address sparse or incomplete metadata in GenBank sequence records that relate to the location of the infected host (LOIH) of viruses. Given a set of GenBank accession numbers corresponding to virus GenBank records, GeoBoost extracts, integrates and normalizes geographic information reflecting the LOIH of the viruses using integrated information from GenBank metadata and related full-text publications. In addition, to facilitate probabilistic geospatial modeling, GeoBoost assigns probability scores for each possible LOIH.
Binaries and resources required for running GeoBoost are packed into a single zipped file and freely available for download at https://tinyurl.com/geoboost. A video tutorial is included to help users quickly and easily install and run the software. The software is implemented in Java 1.8, and supported on MS Windows and Linux platforms.
Supplementary data are available at Bioinformatics online.
GeoBoost是一个命令行软件包,旨在处理GenBank序列记录中与病毒感染宿主位置(LOIH)相关的稀疏或不完整元数据。给定一组与病毒GenBank记录对应的GenBank登录号,GeoBoost利用来自GenBank元数据和相关全文出版物的综合信息,提取、整合并规范反映病毒LOIH的地理信息。此外,为便于进行概率地理空间建模,GeoBoost为每个可能的LOIH分配概率分数。
运行GeoBoost所需的二进制文件和资源被打包成一个压缩文件,可在https://tinyurl.com/geoboost免费下载。包含一个视频教程,以帮助用户快速轻松地安装和运行该软件。该软件用Java 1.8实现,支持MS Windows和Linux平台。
补充数据可在《生物信息学》在线获取。