ESTuber数据库：一个用于意大利白块菌EST序列的在线数据库。

ESTuber db: an online database for Tuber borchii EST sequences.

作者信息

Lazzari Barbara, Caprera Andrea, Cosentino Cristian, Stella Alessandra, Milanesi Luciano, Viotti Angelo

机构信息

Istituto di Biologia e Biotecnologia Agraria, via Bassini 15, 20133 Milan, Italy.

出版信息

BMC Bioinformatics. 2007 Mar 8;8 Suppl 1(Suppl 1):S13. doi: 10.1186/1471-2105-8-S1-S13.

DOI:10.1186/1471-2105-8-S1-S13

PMID:17430557

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1885842/

Abstract

BACKGROUND

The ESTuber database (http://www.itb.cnr.it/estuber) includes 3,271 Tuber borchii expressed sequence tags (EST). The dataset consists of 2,389 sequences from an in-house prepared cDNA library from truffle vegetative hyphae, and 882 sequences downloaded from GenBank and representing four libraries from white truffle mycelia and ascocarps at different developmental stages. An automated pipeline was prepared to process EST sequences using public software integrated by in-house developed Perl scripts. Data were collected in a MySQL database, which can be queried via a php-based web interface.

RESULTS

Sequences included in the ESTuber db were clustered and annotated against three databases: the GenBank nr database, the UniProtKB database and a third in-house prepared database of fungi genomic sequences. An algorithm was implemented to infer statistical classification among Gene Ontology categories from the ontology occurrences deduced from the annotation procedure against the UniProtKB database. Ontologies were also deduced from the annotation of more than 130,000 EST sequences from five filamentous fungi, for intra-species comparison purposes. Further analyses were performed on the ESTuber db dataset, including tandem repeats search and comparison of the putative protein dataset inferred from the EST sequences to the PROSITE database for protein patterns identification. All the analyses were performed both on the complete sequence dataset and on the contig consensus sequences generated by the EST assembly procedure.

CONCLUSION

The resulting web site is a resource of data and links related to truffle expressed genes. The Sequence Report and Contig Report pages are the web interface core structures which, together with the Text search utility and the Blast utility, allow easy access to the data stored in the database.

摘要

背景

ESTuber数据库（http://www.itb.cnr.it/estuber）包含3271条意大利白块菌表达序列标签（EST）。该数据集由来自内部制备的块菌营养菌丝体cDNA文库的2389条序列，以及从GenBank下载的代表不同发育阶段白块菌菌丝体和子囊果的四个文库的882条序列组成。利用内部开发的Perl脚本整合的公共软件，准备了一个自动化流程来处理EST序列。数据收集在一个MySQL数据库中，可通过基于php的网页界面进行查询。

结果

ESTuber数据库中包含的序列针对三个数据库进行了聚类和注释：GenBank nr数据库、UniProtKB数据库以及第三个内部制备的真菌基因组序列数据库。实施了一种算法，以便从针对UniProtKB数据库的注释过程推导的本体出现情况中推断基因本体类别之间的统计分类。还从五种丝状真菌的130,000多条EST序列的注释中推导本体，用于种内比较。对ESTuber数据库数据集进行了进一步分析，包括串联重复序列搜索，以及将从EST序列推断的假定蛋白质数据集与PROSITE数据库进行比较以识别蛋白质模式。所有分析均在完整序列数据集以及EST组装过程生成的重叠群一致序列上进行。

结论

所得网站是与块菌表达基因相关的数据和链接资源。序列报告和重叠群报告页面是网页界面的核心结构，它们与文本搜索实用程序和Blast实用程序一起，便于访问存储在数据库中的数据。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

ESTuber数据库：一个用于意大利白块菌EST序列的在线数据库。

ESTuber db: an online database for Tuber borchii EST sequences.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

ESTuber数据库：一个用于意大利白块菌EST序列的在线数据库。

ESTuber db: an online database for Tuber borchii EST sequences.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献