Suppr超能文献

药用蛭中央神经系统从头转录组装配数据库。

De novo transcriptome assembly databases for the central nervous system of the medicinal leech.

机构信息

Faculty of Life Sciences, Bar Ilan University , Ramat Gan 5290002, Israel ; Faculty of Engineering, Bar Ilan University , Ramat Gan 5290002, Israel ; Institute of Nanotechnologies and Advanced Materials, Bar Ilan University , Ramat Gan 5290002, Israel.

Faculty of Engineering, Bar Ilan University , Ramat Gan 5290002, Israel ; Institute of Nanotechnologies and Advanced Materials, Bar Ilan University , Ramat Gan 5290002, Israel.

出版信息

Sci Data. 2015 Apr 28;2:150015. doi: 10.1038/sdata.2015.15. eCollection 2015.

Abstract

The study of non-model organisms stands to benefit greatly from genetic and genomic data. For a better understanding of the molecular mechanisms driving neuronal development, and to characterize the entire leech Hirudo medicinalis central nervous system (CNS) transcriptome we combined Trinity for de-novo assembly and Illumina HiSeq2000 for RNA-Seq. We present a set of 73,493 de-novo assembled transcripts for the leech, reconstructed from RNA collected, at a single ganglion resolution, from the CNS. This set of transcripts greatly enriches the available data for the leech. Here, we share two databases, such that each dataset allows a different type of search for candidate homologues. The first is the raw set of assembled transcripts. This set allows a sequence-based search. A comprehensive analysis of which revealed 22,604 contigs with high e-values, aligned versus the Swiss-Prot database. This analysis enabled the production of the second database, which includes correlated sequences to annotated transcript names, with the confidence of BLAST best hit.

摘要

非模式生物的研究将极大地受益于遗传和基因组数据。为了更好地了解驱动神经元发育的分子机制,并对整个医用水蛭 Hirudo medicinalis 中枢神经系统 (CNS) 的转录组进行特征描述,我们将 Trinity 用于从头组装和 Illumina HiSeq2000 用于 RNA-Seq。我们从 CNS 中单个神经节收集的 RNA 重建了一套 73493 个医用水蛭的从头组装转录本。该转录本集极大地丰富了医用水蛭的现有数据。在这里,我们共享两个数据库,以便每个数据集都可以进行不同类型的候选同源物搜索。第一个是组装转录本的原始数据集。该数据集允许基于序列的搜索。对其进行的综合分析显示,有 22604 个具有高 e 值的连续序列与 Swiss-Prot 数据库对齐。该分析生成了第二个数据库,其中包括与已注释转录本名称相关的序列,其置信度为 BLAST 最佳命中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbd2/4412018/e85f91303a26/sdata201515-f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验