欧松数据库：一个高覆盖度的欧洲赤松转录组学网络数据库。

EuroPineDB: a high-coverage web database for maritime pine transcriptome.

机构信息

Departamento de Biología Molecular y Bioquímica, Facultad de Ciencias, Campus de Teatinos s/n, Universidad de Málaga, 29071 Málaga, Spain.

出版信息

BMC Genomics. 2011 Jul 15;12:366. doi: 10.1186/1471-2164-12-366.

DOI:10.1186/1471-2164-12-366

PMID:21762488

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3152544/

Abstract

BACKGROUND

Pinus pinaster is an economically and ecologically important species that is becoming a woody gymnosperm model. Its enormous genome size makes whole-genome sequencing approaches are hard to apply. Therefore, the expressed portion of the genome has to be characterised and the results and annotations have to be stored in dedicated databases.

DESCRIPTION

EuroPineDB is the largest sequence collection available for a single pine species, Pinus pinaster (maritime pine), since it comprises 951 641 raw sequence reads obtained from non-normalised cDNA libraries and high-throughput sequencing from adult (xylem, phloem, roots, stem, needles, cones, strobili) and embryonic (germinated embryos, buds, callus) maritime pine tissues. Using open-source tools, sequences were optimally pre-processed, assembled, and extensively annotated (GO, EC and KEGG terms, descriptions, SNPs, SSRs, ORFs and InterPro codes). As a result, a 10.5× P. pinaster genome was covered and assembled in 55 322 UniGenes. A total of 32 919 (59.5%) of P. pinaster UniGenes were annotated with at least one description, revealing at least 18 466 different genes. The complete database, which is designed to be scalable, maintainable, and expandable, is freely available at: http://www.scbi.uma.es/pindb/. It can be retrieved by gene libraries, pine species, annotations, UniGenes and microarrays (i.e., the sequences are distributed in two-colour microarrays; this is the only conifer database that provides this information) and will be periodically updated. Small assemblies can be viewed using a dedicated visualisation tool that connects them with SNPs. Any sequence or annotation set shown on-screen can be downloaded. Retrieval mechanisms for sequences and gene annotations are provided.

CONCLUSIONS

The EuroPineDB with its integrated information can be used to reveal new knowledge, offers an easy-to-use collection of information to directly support experimental work (including microarray hybridisation), and provides deeper knowledge on the maritime pine transcriptome.

摘要

背景

欧洲赤松是一种具有重要经济和生态意义的物种，正在成为一种木本裸子植物模式生物。其巨大的基因组大小使得全基因组测序方法难以应用。因此，必须对基因组的表达部分进行特征描述，并将结果和注释存储在专用数据库中。

描述

EuroPineDB 是迄今为止可用于单一松树物种（欧洲赤松，即沿海松）的最大序列集合，因为它包含了 951641 个从非标准化 cDNA 文库和高通量测序获得的原始序列读段，这些序列来自成年（木质部、韧皮部、根、茎、针叶、球果、雄球花）和胚胎（萌发的胚胎、芽、愈伤组织）的欧洲赤松组织。使用开源工具，对序列进行了最佳的预处理、组装和广泛的注释（GO、EC 和 KEGG 术语、描述、SNP、SSR、ORF 和 InterPro 代码）。结果，覆盖了 10.5 倍的欧洲赤松基因组，并组装成 55322 个 UniGenes。共有 32919 个（59.5%）欧洲赤松 UniGenes 至少被注释了一个描述，揭示了至少 18466 个不同的基因。该完整数据库旨在实现可扩展、可维护和可扩展，可免费在以下网址获得：http://www.scbi.uma.es/pindb/。它可以通过基因文库、松树物种、注释、UniGenes 和微阵列（即，这些序列分布在双色微阵列中；这是唯一提供此信息的针叶树数据库）进行检索，并将定期更新。可以使用专用可视化工具查看小的组装，该工具将它们与 SNPs 连接起来。屏幕上显示的任何序列或注释集都可以下载。还提供了用于序列和基因注释的检索机制。

结论

具有集成信息的 EuroPineDB 可用于揭示新知识，为直接支持实验工作（包括微阵列杂交）提供易于使用的信息集合，并提供关于沿海松转录组的更深入知识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7281/3152544/8b69fc822202/1471-2164-12-366-1.jpg

相似文献

EuroPineDB: a high-coverage web database for maritime pine transcriptome.欧松数据库：一个高覆盖度的欧洲赤松转录组学网络数据库。

BMC Genomics. 2011 Jul 15;12:366. doi: 10.1186/1471-2164-12-366.

De novo assembly of maritime pine transcriptome: implications for forest breeding and biotechnology.马尾松转录组从头组装：对林学育种和生物技术的启示。

Plant Biotechnol J. 2014 Apr;12(3):286-99. doi: 10.1111/pbi.12136. Epub 2013 Nov 21.

Comprehensive assembly and analysis of the transcriptome of maritime pine developing embryos.综合组装和分析海松发育胚胎的转录组。

BMC Plant Biol. 2018 Dec 29;18(1):379. doi: 10.1186/s12870-018-1564-2.

Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine.开发和应用一种高多重性 SNP 芯片用于海洋松的遗传作图和与火炬松的比较作图。

BMC Genomics. 2011 Jul 18;12:368. doi: 10.1186/1471-2164-12-368.

Generation and analysis of expressed sequence tags from six developing xylem libraries in Pinus radiata D. Don.辐射松六个发育中木质部文库的表达序列标签的产生与分析

BMC Genomics. 2009 Jan 21;10:41. doi: 10.1186/1471-2164-10-41.

Transcriptome analysis in maritime pine using laser capture microdissection and 454 pyrosequencing.利用激光捕获显微切割和454焦磷酸测序技术对海岸松进行转录组分析。

Tree Physiol. 2014 Nov;34(11):1278-88. doi: 10.1093/treephys/tpt113. Epub 2014 Jan 3.

The gene expression landscape of pine seedling tissues.松树苗组织的基因表达图谱。

Plant J. 2017 Sep;91(6):1064-1087. doi: 10.1111/tpj.13617. Epub 2017 Aug 4.

Genetic mapping of Pinus flexilis major gene (Cr4) for resistance to white pine blister rust using transcriptome-based SNP genotyping.利用基于转录组的单核苷酸多态性基因分型对柔枝松抗白松疱锈病主要基因（Cr4）进行遗传定位。

BMC Genomics. 2016 Sep 23;17(1):753. doi: 10.1186/s12864-016-3079-2.

Transcriptional analysis of differentially expressed genes in response to stem inclination in young seedlings of pine.松树苗对茎倾斜的差异表达基因的转录分析。

Plant Biol (Stuttg). 2012 Nov;14(6):923-33. doi: 10.1111/j.1438-8677.2012.00572.x. Epub 2012 May 30.

Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome.辐射松幼嫩枝梢转录组的从头组装与基因组引导组装及注释

BMC Genomics. 2015 Dec 12;16:1057. doi: 10.1186/s12864-015-2277-7.

引用本文的文献

Stone Pine ( L.) High-Added-Value Genetics: An Overview.石松（L.）高附加值遗传学：概述。

Genes (Basel). 2024 Jan 10;15(1):84. doi: 10.3390/genes15010084.

Evolutionary history of the mediterranean Pinus halepensis-brutia species complex using gene-resequencing and transcriptomic approaches.利用基因重测序和转录组学方法研究地中海柏木-布蒂亚种复合体的进化历史。

Plant Mol Biol. 2021 Jul;106(4-5):367-380. doi: 10.1007/s11103-021-01155-7. Epub 2021 May 1.

TransFlow: a modular framework for assembling and assessing accurate de novo transcriptomes in non-model organisms.TransFlow：一种用于在非模式生物中组装和评估准确从头转录组的模块化框架。

BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):416. doi: 10.1186/s12859-018-2384-y.

Approaches to variant discovery for conifer transcriptome sequencing.针叶树转录组测序中变异发现方法。

PLoS One. 2018 Nov 5;13(11):e0205835. doi: 10.1371/journal.pone.0205835. eCollection 2018.

Transcriptome sequencing of Pinus kesiya var. langbianensis and comparative analysis in the Pinus phylogeny.梁王杉转录组测序及松属系统发育中的比较分析。

BMC Genomics. 2018 Oct 3;19(1):725. doi: 10.1186/s12864-018-5127-6.

Molecular basis of the evolution of alternative tyrosine biosynthetic routes in plants.植物中替代酪氨酸生物合成途径进化的分子基础。

Nat Chem Biol. 2017 Sep;13(9):1029-1035. doi: 10.1038/nchembio.2414. Epub 2017 Jun 26.

Combined de novo and genome guided assembly and annotation of the Pinus patula juvenile shoot transcriptome.辐射松幼嫩枝梢转录组的从头组装与基因组引导组装及注释

BMC Genomics. 2015 Dec 12;16:1057. doi: 10.1186/s12864-015-2277-7.

ReprOlive: a database with linked data for the olive tree (Olea europaea L.) reproductive transcriptome.ReprOlive：一个包含油橄榄（Olea europaea L.）生殖转录组关联数据的数据库。

Front Plant Sci. 2015 Aug 11;6:625. doi: 10.3389/fpls.2015.00625. eCollection 2015.

De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray.塞内加尔鳎（Solea senegalensis）和欧洲鳎（Solea solea）转录组的从头组装、特征分析与功能注释：整合到数据库及微阵列设计

BMC Genomics. 2014 Nov 3;15(1):952. doi: 10.1186/1471-2164-15-952.

Why assembling plant genome sequences is so challenging.为什么组装植物基因组序列如此具有挑战性。

Biology (Basel). 2012 Sep 18;1(2):439-59. doi: 10.3390/biology1020439.

本文引用的文献

AlignMiner: a Web-based tool for detection of divergent regions in multiple sequence alignments of conserved sequences.AlignMiner：一种基于网络的工具，用于检测保守序列多序列比对中的差异区域。

Algorithms Mol Biol. 2010 Jun 2;5:24. doi: 10.1186/1748-7188-5-24.

Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery.转录组测序在一个具有重要生态学意义的树种中的应用：组装、注释和标记发现。

BMC Genomics. 2010 Mar 16;11:180. doi: 10.1186/1471-2164-11-180.

Identification of genes regulated by ammonium availability in the roots of maritime pine trees.鉴定在海松树根中受铵供应影响的基因。

Amino Acids. 2010 Oct;39(4):991-1001. doi: 10.1007/s00726-010-0483-9. Epub 2010 Feb 6.

SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read.SeqTrim：一种用于预处理任何类型序列读取的高通量管道。

BMC Bioinformatics. 2010 Jan 20;11:38. doi: 10.1186/1471-2105-11-38.

BLAST+: architecture and applications.BLAST+：体系结构与应用。

BMC Bioinformatics. 2009 Dec 15;10:421. doi: 10.1186/1471-2105-10-421.

PineSAP--sequence alignment and SNP identification pipeline.PineSAP——序列比对和 SNP 识别流程。

Bioinformatics. 2009 Oct 1;25(19):2609-10. doi: 10.1093/bioinformatics/btp477. Epub 2009 Aug 10.

Sequencing and de novo analysis of a coral larval transcriptome using 454 GSFlx.使用454 GSFlx对珊瑚幼虫转录组进行测序和从头分析。

BMC Genomics. 2009 May 12;10:219. doi: 10.1186/1471-2164-10-219.

A new genomic resource dedicated to wood formation in Eucalyptus.一种专门用于研究桉树木材形成的新基因组资源。

BMC Plant Biol. 2009 Mar 27;9:36. doi: 10.1186/1471-2229-9-36.

Processing the loblolly pine PtGen2 cDNA microarray.处理火炬松PtGen2 cDNA微阵列。

J Vis Exp. 2009 Mar 20(25):1182. doi: 10.3791/1182.

Identifying protein-coding genes in genomic sequences.在基因组序列中识别蛋白质编码基因。

Genome Biol. 2009;10(1):201. doi: 10.1186/gb-2009-10-1-201. Epub 2009 Jan 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

欧松数据库：一个高覆盖度的欧洲赤松转录组学网络数据库。

EuroPineDB: a high-coverage web database for maritime pine transcriptome.

机构信息

出版信息

BACKGROUND

DESCRIPTION

CONCLUSIONS

背景

描述

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献