Kohl Jochen, Paulsen Ingo, Laubach Thomas, Radtke Achim, von Haeseler Arndt
Heinrich-Heine-University Duesseldorf, Universitaetsstrasse 1, 40225 Duesseldorf, Germany.
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D700-4. doi: 10.1093/nar/gkj030.
HvrBase++ is the improved and extended version of HvrBase. Extensions are made by adding more population-based sequence samples from all primates including humans. The current collection comprises 13,873 hypervariable region I (HVRI) sequences and 4940 hypervariable region II (HVRII) sequences. In addition, we included 1376 complete mitochondrial genomes, 205 sequences from X-chromosomal loci and 202 sequences from autosomal chromosomes 1, 8, 11 and 16. In order to reduce the introduction of erroneous data into HvrBase++, we have developed a procedure that monitors GenBank for new versions of the current data in HvrBase++ and automatically updates the collection if necessary. For the stored sequences, supplementary information such as geographic origin, population affiliation and language of the sequence donor can be retrieved. HvrBase++ is Oracle based and easily accessible by a web interface (http://www.hvrbase.org). As a new key feature, HvrBase++ provides an interactive graphical tool to easily access data from dynamically created geographical maps.
HvrBase++是HvrBase的改进和扩展版本。扩展是通过添加包括人类在内的所有灵长类动物中更多基于群体的序列样本来实现的。当前的数据集包含13873个高变区I(HVRI)序列和4940个高变区II(HVRII)序列。此外,我们还纳入了1376个完整的线粒体基因组、205个来自X染色体位点的序列以及202个来自常染色体1、8、11和16的序列。为了减少错误数据引入HvrBase++,我们开发了一种程序,该程序会监测GenBank中HvrBase++当前数据的新版本,并在必要时自动更新数据集。对于存储的序列,可以检索诸如地理来源、群体归属和序列提供者语言等补充信息。HvrBase++基于甲骨文数据库,可通过网络界面(http://www.hvrbase.org)轻松访问。作为一项新的关键特性,HvrBase++提供了一个交互式图形工具,可轻松从动态创建的地理地图中访问数据。