Suppr超能文献

欧松数据库:一个高覆盖度的欧洲赤松转录组学网络数据库。

EuroPineDB: a high-coverage web database for maritime pine transcriptome.

机构信息

Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Campus de Teatinos s/n, Universidad de Málaga, 29071 Málaga, Spain.

出版信息

BMC Genomics. 2011 Jul 15;12:366. doi: 10.1186/1471-2164-12-366.

Abstract

BACKGROUND

Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases.

DESCRIPTION

EuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided.

CONCLUSIONS

The EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome.

摘要

背景

欧洲赤松是一种具有重要经济和生态意义的物种,正在成为一种木本裸子植物模式生物。其巨大的基因组大小使得全基因组测序方法难以应用。因此,必须对基因组的表达部分进行特征描述,并将结果和注释存储在专用数据库中。

描述

EuroPineDB 是迄今为止可用于单一松树物种(欧洲赤松,即沿海松)的最大序列集合,因为它包含了 951641 个从非标准化 cDNA 文库和高通量测序获得的原始序列读段,这些序列来自成年(木质部、韧皮部、根、茎、针叶、球果、雄球花)和胚胎(萌发的胚胎、芽、愈伤组织)的欧洲赤松组织。使用开源工具,对序列进行了最佳的预处理、组装和广泛的注释(GO、EC 和 KEGG 术语、描述、SNP、SSR、ORF 和 InterPro 代码)。结果,覆盖了 10.5 倍的欧洲赤松基因组,并组装成 55322 个 UniGenes。共有 32919 个(59.5%)欧洲赤松 UniGenes 至少被注释了一个描述,揭示了至少 18466 个不同的基因。该完整数据库旨在实现可扩展、可维护和可扩展,可免费在以下网址获得:http://www.scbi.uma.es/pindb/。它可以通过基因文库、松树物种、注释、UniGenes 和微阵列(即,这些序列分布在双色微阵列中;这是唯一提供此信息的针叶树数据库)进行检索,并将定期更新。可以使用专用可视化工具查看小的组装,该工具将它们与 SNPs 连接起来。屏幕上显示的任何序列或注释集都可以下载。还提供了用于序列和基因注释的检索机制。

结论

具有集成信息的 EuroPineDB 可用于揭示新知识,为直接支持实验工作(包括微阵列杂交)提供易于使用的信息集合,并提供关于沿海松转录组的更深入知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7281/3152544/8b69fc822202/1471-2164-12-366-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验