Division of Genetics and Genomics, The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK.
Bioinformatics. 2012 Nov 1;28(21):2724-31. doi: 10.1093/bioinformatics/bts525. Epub 2012 Sep 3.
The Ensembl Project provides release-specific Perl APIs for efficient high-level programmatic access to data stored in various Ensembl database schema. Although Perl scripts are perfectly suited for processing large volumes of text-based data, Perl is not ideal for developing large-scale software applications nor embedding in graphical interfaces. The provision of a novel Java API would facilitate type-safe, modular, object-orientated development of new Bioinformatics tools with which to access, analyse and visualize Ensembl data.
The JEnsembl API implementation provides basic data retrieval and manipulation functionality from the Core, Compara and Variation databases for all species in Ensembl and EnsemblGenomes and is a platform for the development of a richer API to Ensembl datasources. The JEnsembl architecture uses a text-based configuration module to provide evolving, versioned mappings from database schema to code objects. A single installation of the JEnsembl API can therefore simultaneously and transparently connect to current and previous database instances (such as those in the public archive) thus facilitating better analysis repeatability and allowing 'through time' comparative analyses to be performed.
Project development, released code libraries, Maven repository and documentation are hosted at SourceForge (http://jensembl.sourceforge.net).
Ensembl 项目提供了特定于版本的 Perl API,用于高效地以高级编程方式访问存储在各种 Ensembl 数据库模式中的数据。虽然 Perl 脚本非常适合处理大量基于文本的数据,但 Perl 并不适合开发大规模软件应用程序,也不适合嵌入图形界面。提供新的 Java API 将有助于使用安全的、模块化的、面向对象的方法来开发新的生物信息学工具,以访问、分析和可视化 Ensembl 数据。
JEnsembl API 实现提供了来自 Core、Compara 和 Variation 数据库的所有 Ensembl 和 EnsemblGenomes 物种的基本数据检索和操作功能,是开发更丰富的 Ensembl 数据源 API 的平台。JEnsembl 架构使用基于文本的配置模块,提供从数据库模式到代码对象的可进化、版本化映射。因此,单个 JEnsembl API 安装可以同时透明地连接到当前和以前的数据库实例(如公共档案中的那些),从而促进更好的分析可重复性,并允许执行“随时间”比较分析。
项目开发、发布的代码库、Maven 存储库和文档托管在 SourceForge(http://jensembl.sourceforge.net)上。