Suppr超能文献

从日本真珠贝(Mizuhopecten yessoensis)的闭壳肌中生成和分析表达序列标签。

Generation and analysis of expressed sequence tags from adductor muscle of Japanese scallop Mizuhopecten yessoensis.

机构信息

College of Life Science and Biotechnology, Dalian Ocean University, Dalian 116023, China.

出版信息

Comp Biochem Physiol Part D Genomics Proteomics. 2010 Dec;5(4):288-94. doi: 10.1016/j.cbd.2010.08.002. Epub 2010 Aug 19.

Abstract

A normalized cDNA library was constructed from the adductor muscle of M. yessoensis and acquired 4595 high quality expressed sequence tags (ESTs). After clustering and assembly of the ESTs, 3061 unigenes containing 654 contigs and 2407 singletons were identified. The contig length ranged from 266 bp to 2364 bp and the average length of these contigs was 544 bp. Blastx nonredundant protein database analysis showed that 1522 unigenes had significant homology to known genes (E value ≤ 10⁻⁵). By comparing to Clusters of Orthologous Groups (COG) categories, 460 unigenes were annotated (E value ≤10(-10)). Using Kyoto Encyclopedia of Genes and Genomes (KEGG), 345 of 3061 unigenes were assigned into 103 pathways (E value ≤ 10⁻⁵). For InterProScan searches, 1237 unigenes were annotated containing 727 different types of protein domains. 941 of the 1237 unigenes were annotated for Gene Ontology (GO) classification using Uniprot2GO associations in any category (biological, cellular, and molecular). By sequences comparability and analysis of Blastx NCBI nonredundant protein database and KEGG, 66 unigenes were identified that may be involved in genetic information processing based on the known knowledge. The study provides a material basis as useful information for the genomic analysis of shellfish.

摘要

从太平洋牡蛎的肌肉组织构建了一个归一化 cDNA 文库,共获得 4595 个高质量的表达序列标签(EST)。对 EST 进行聚类和组装后,鉴定出 3061 个包含 654 个连续序列和 2407 个单序列的 unigene。连续序列的长度范围为 266 bp 至 2364 bp,这些连续序列的平均长度为 544 bp。Blastx 非冗余蛋白质数据库分析表明,1522 个 unigene 与已知基因具有显著同源性(E 值≤10⁻⁵)。通过与同源基因簇(COG)类别比较,对 460 个 unigene 进行了注释(E 值≤10⁻¹⁰)。利用京都基因与基因组百科全书(KEGG),将 3061 个 unigene 中的 345 个分配到 103 条途径中(E 值≤10⁻⁵)。通过 InterProScan 搜索,对 1237 个 unigene 进行了注释,其中包含 727 种不同类型的蛋白质结构域。使用 Uniprot2GO 关联,在任何类别(生物、细胞和分子)中对 1237 个 unigene 中的 941 个进行了基因本体论(GO)分类注释。通过序列可比性和 Blastx NCBI 非冗余蛋白质数据库以及 KEGG 的分析,鉴定出 66 个可能参与遗传信息处理的 unigene,这是基于已知知识的。该研究为贝类基因组分析提供了有用的信息和物质基础。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验