Suppr超能文献

Ruby UCSC API:使用 Ruby 访问 UCSC 基因组数据库。

The Ruby UCSC API: accessing the UCSC genome database using Ruby.

机构信息

Department of Human Genetics, Nagasaki University Graduate School of Biomedical Sciences, 1-12-4 Sakamoto, Nagasaki, Nagasaki, 852-8523, Japan.

出版信息

BMC Bioinformatics. 2012 Sep 21;13:240. doi: 10.1186/1471-2105-13-240.

Abstract

BACKGROUND

The University of California, Santa Cruz (UCSC) genome database is among the most used sources of genomic annotation in human and other organisms. The database offers an excellent web-based graphical user interface (the UCSC genome browser) and several means for programmatic queries. A simple application programming interface (API) in a scripting language aimed at the biologist was however not yet available. Here, we present the Ruby UCSC API, a library to access the UCSC genome database using Ruby.

RESULTS

The API is designed as a BioRuby plug-in and built on the ActiveRecord 3 framework for the object-relational mapping, making writing SQL statements unnecessary. The current version of the API supports databases of all organisms in the UCSC genome database including human, mammals, vertebrates, deuterostomes, insects, nematodes, and yeast.The API uses the bin index-if available-when querying for genomic intervals. The API also supports genomic sequence queries using locally downloaded *.2bit files that are not stored in the official MySQL database. The API is implemented in pure Ruby and is therefore available in different environments and with different Ruby interpreters (including JRuby).

CONCLUSIONS

Assisted by the straightforward object-oriented design of Ruby and ActiveRecord, the Ruby UCSC API will facilitate biologists to query the UCSC genome database programmatically. The API is available through the RubyGem system. Source code and documentation are available at https://github.com/misshie/bioruby-ucsc-api/ under the Ruby license. Feedback and help is provided via the website at http://rubyucscapi.userecho.com/.

摘要

背景

加利福尼亚大学圣克鲁兹分校(UCSC)基因组数据库是人类和其他生物中使用最多的基因组注释来源之一。该数据库提供了出色的基于网络的图形用户界面(UCSC 基因组浏览器)和几种用于编程查询的方法。但是,它还没有针对生物学家的简单脚本语言应用程序编程接口(API)。在这里,我们介绍了 Ruby UCSC API,这是一种用于使用 Ruby 访问 UCSC 基因组数据库的库。

结果

API 设计为 BioRuby 插件,并构建在用于对象关系映射的 ActiveRecord 3 框架上,从而无需编写 SQL 语句。当前版本的 API 支持 UCSC 基因组数据库中所有生物的数据库,包括人类、哺乳动物、脊椎动物、后口动物、昆虫、线虫和酵母。API 在查询基因组区间时使用二进制索引(如果可用)。API 还支持使用本地下载的 *.2bit 文件进行基因组序列查询,这些文件未存储在官方 MySQL 数据库中。API 完全用 Ruby 实现,因此可在不同的环境和具有不同的 Ruby 解释器(包括 JRuby)中使用。

结论

在 Ruby 和 ActiveRecord 的简单面向对象设计的辅助下,Ruby UCSC API 将使生物学家能够以编程方式查询 UCSC 基因组数据库。API 通过 RubyGem 系统提供。源代码和文档可在 Ruby 许可证下的 https://github.com/misshie/bioruby-ucsc-api/ 获得。反馈和帮助可通过 http://rubyucscapi.userecho.com/ 网站获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7dca/3542311/f787dd88ff2e/1471-2105-13-240-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验