Ensembl核心软件资源：用于DNA序列和基因组注释的存储及编程访问。

Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation.

作者信息

Ruffier Magali, Kähäri Andreas, Komorowska Monika, Keenan Stephen, Laird Matthew, Longden Ian, Proctor Glenn, Searle Steve, Staines Daniel, Taylor Kieron, Vullo Alessandro, Yates Andrew, Zerbino Daniel, Flicek Paul

机构信息

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK.

Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

出版信息

Database (Oxford). 2017 Jan 1;2017(1). doi: 10.1093/database/bax020.

DOI:10.1093/database/bax020

PMID:28365736

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5467575/

Abstract

UNLABELLED

The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl 'Core' database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list ( http://www.ensembl.org/info/about/contact/index.html ).

DATABASE URL

http://www.ensembl.org.

摘要

未标注

Ensembl软件资源是用于存储、访问和操作基因组组装及其功能注释的稳定基础设施。Ensembl“核心”数据库和应用程序编程接口（API）是我们的首个主要软件基础设施，并且仍然是我们所有基因组资源的核心。自15年多前最初设计以来，在DNA测序技术不断进步的推动下，公开可用的基因组、转录组和蛋白质组数据集数量大幅增长。最初旨在为参考人类基因组提供注释，我们已扩展框架以支持所有物种的基因组以及更丰富的组装模型。与其他信息学资源的交叉引用链接便于使用诸如UniProt和RefSeq等各种流行标识符搜索我们的数据库。我们全面且强大的框架在一个位置存储了大量多样的基因组注释，为其他团队生成和维护自己定制的注释提供了一个平台。我们欢迎重用和贡献：我们的数据库和API是公开可用的，我们所有的源代码都根据宽松的Apache v2.0许可在http://github.com/Ensembl上发布，并且我们有一个活跃的开发者邮件列表（http://www.ensembl.org/info/about/contact/index.html）。

数据库网址

http://www.ensembl.org。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05f7/5467575/1a3476bccf6d/bax020f1.jpg

相似文献

Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation.

Database (Oxford). 2017 Jan 1;2017(1). doi: 10.1093/database/bax020.

Ensembl 2021.

Nucleic Acids Res. 2021 Jan 8;49(D1):D884-D891. doi: 10.1093/nar/gkaa942.

Ensembl 2015.

Nucleic Acids Res. 2015 Jan;43(Database issue):D662-9. doi: 10.1093/nar/gku1010. Epub 2014 Oct 28.

Using the Ensembl genome server to browse genomic sequence data.

Curr Protoc Bioinformatics. 2007 Jan;Chapter 1:Unit 1.15. doi: 10.1002/0471250953.bi0115s16.

Ensembl 2022.

Nucleic Acids Res. 2022 Jan 7;50(D1):D988-D995. doi: 10.1093/nar/gkab1049.

GenomeHubs: simple containerized setup of a custom Ensembl database and web server for any species.

Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax039.

UniqTag: Content-Derived Unique and Stable Identifiers for Gene Annotation.

PLoS One. 2015 May 28;10(5):e0128026. doi: 10.1371/journal.pone.0128026. eCollection 2015.

The Ensembl gene annotation system.

Database (Oxford). 2016 Jun 23;2016. doi: 10.1093/database/baw093. Print 2016.

Ensembl regulation resources.

Database (Oxford). 2016 Feb 17;2016. doi: 10.1093/database/bav119. Print 2016.

A database and API for variation, dense genotyping and resequencing data.

BMC Bioinformatics. 2010 May 11;11:238. doi: 10.1186/1471-2105-11-238.

引用本文的文献

A deep ensemble framework for human essential gene prediction by integrating multi-omics data.

Sci Rep. 2025 Jul 21;15(1):26407. doi: 10.1038/s41598-025-99164-9.

Exon-variant interplay and multi-modal evidence identify endocrine dysregulation in severe psychiatric disorders impacting excitatory neurons.

Transl Psychiatry. 2025 Apr 19;15(1):153. doi: 10.1038/s41398-025-03366-8.

GrameneOryza: a comprehensive resource for Oryza genomes, genetic variation, and functional data.

Database (Oxford). 2025 Apr 4;2025. doi: 10.1093/database/baaf021.

Probabilistic pathway-based multimodal factor analysis.

Bioinformatics. 2024 Jun 28;40(Suppl 1):i189-i198. doi: 10.1093/bioinformatics/btae216.

ReUseData: an R/Bioconductor tool for reusable and reproducible genomic data management.

BMC Bioinformatics. 2024 Jan 3;25(1):8. doi: 10.1186/s12859-023-05626-0.

Specimen, biological structure, and spatial ontologies in support of a Human Reference Atlas.

Sci Data. 2023 Mar 27;10(1):171. doi: 10.1038/s41597-023-01993-8.

Cardiac copper content and its relationship with heart physiology: Insights based on quantitative genetic and functional analyses using BXD family mice.

Front Cardiovasc Med. 2023 Feb 2;10:1089963. doi: 10.3389/fcvm.2023.1089963. eCollection 2023.

PKD1 and PKD2 mRNA cis-inhibition drives polycystic kidney disease progression.

Nat Commun. 2022 Aug 15;13(1):4765. doi: 10.1038/s41467-022-32543-2.

Effects of Hypoxia on RNA Cargo in Extracellular Vesicles from Human Adipose-Derived Stromal/Stem Cells.

Int J Mol Sci. 2022 Jul 2;23(13):7384. doi: 10.3390/ijms23137384.

Transcriptomic signals of mitochondrial dysfunction and OXPHOS dynamics in fast-growth chicken.

PeerJ. 2022 May 4;10:e13364. doi: 10.7717/peerj.13364. eCollection 2022.

本文引用的文献

The Ensembl gene annotation system.

Database (Oxford). 2016 Jun 23;2016. doi: 10.1093/database/baw093. Print 2016.

Ensembl comparative genomics resources.

Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/bav096. Print 2016.

Ensembl regulation resources.

Database (Oxford). 2016 Feb 17;2016. doi: 10.1093/database/bav119. Print 2016.

The 2016 database issue of Nucleic Acids Research and an updated molecular biology database collection.

Nucleic Acids Res. 2016 Jan 4;44(D1):D1-6. doi: 10.1093/nar/gkv1356.

Ensembl 2016.

Nucleic Acids Res. 2016 Jan 4;44(D1):D710-6. doi: 10.1093/nar/gkv1157. Epub 2015 Dec 19.

The International Nucleotide Sequence Database Collaboration.

Nucleic Acids Res. 2016 Jan 4;44(D1):D48-50. doi: 10.1093/nar/gkv1323. Epub 2015 Dec 10.

Wasabi: An Integrated Platform for Evolutionary Sequence Analysis and Data Visualization.

Mol Biol Evol. 2016 Apr;33(4):1126-30. doi: 10.1093/molbev/msv333. Epub 2015 Dec 3.

AREsite2: an enhanced database for the comprehensive investigation of AU/GU/U-rich elements.

Nucleic Acids Res. 2016 Jan 4;44(D1):D90-5. doi: 10.1093/nar/gkv1238. Epub 2015 Nov 23.

The UCSC Genome Browser database: 2016 update.

Nucleic Acids Res. 2016 Jan 4;44(D1):D717-25. doi: 10.1093/nar/gkv1275. Epub 2015 Nov 20.

Ensembl Genomes 2016: more genomes, more complexity.

Nucleic Acids Res. 2016 Jan 4;44(D1):D574-80. doi: 10.1093/nar/gkv1209. Epub 2015 Nov 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Ensembl核心软件资源：用于DNA序列和基因组注释的存储及编程访问。

Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation.

作者信息

机构信息

出版信息

UNLABELLED

DATABASE URL

未标注

数据库网址

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献