• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Identitag,一个用于SAGE标签识别和SAGE文库种间比较的关系数据库。

Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries.

作者信息

Keime Céline, Damiola Francesca, Mouchiroud Dominique, Duret Laurent, Gandrillon Olivier

机构信息

Equipe Signalisation et identités cellulaires, Centre de Génétique Moléculaire et Cellulaire CNRS UMR 5534, Université Claude Bernard Lyon 1, bâtiment Gregor Mendel, 16 rue Raphaël Dubois 69622 Villeurbanne cedex France.

出版信息

BMC Bioinformatics. 2004 Oct 6;5:143. doi: 10.1186/1471-2105-5-143.

DOI:10.1186/1471-2105-5-143
PMID:15469608
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC535903/
Abstract

BACKGROUND

Serial Analysis of Gene Expression (SAGE) is a method of large-scale gene expression analysis that has the potential to generate the full list of mRNAs present within a cell population at a given time and their frequency. An essential step in SAGE library analysis is the unambiguous assignment of each 14 bp tag to the transcript from which it was derived. This process, called tag-to-gene mapping, represents a step that has to be improved in the analysis of SAGE libraries. Indeed, the existing web sites providing correspondence between tags and transcripts do not concern all species for which numerous EST and cDNA have already been sequenced.

RESULTS

This is the reason why we designed and implemented a freely available tool called Identitag for tag identification that can be used in any species for which transcript sequences are available. Identitag is based on a relational database structure in order to allow rapid and easy storage and updating of data and, most importantly, in order to be able to precisely define identification parameters. This structure can be seen like three interconnected modules : the first one stores virtual tags extracted from a given list of transcript sequences, the second stores experimental tags observed in SAGE experiments, and the third allows the annotation of the transcript sequences used for virtual tag extraction. It therefore connects an observed tag to a virtual tag and to the sequence it comes from, and then to its functional annotation when available. Databases made from different species can be connected according to orthology relationship thus allowing the comparison of SAGE libraries between species. We successfully used Identitag to identify tags from our chicken SAGE libraries and for chicken to human SAGE tags interspecies comparison. Identitag sources are freely available on http://pbil.univ-lyon1.fr/software/identitag/ web site.

CONCLUSIONS

Identitag is a flexible and powerful tool for tag identification in any single species and for interspecies comparison of SAGE libraries. It opens the way to comparative transcriptomic analysis, an emerging branch of biology.

摘要

背景

基因表达序列分析(SAGE)是一种大规模基因表达分析方法,它有潜力生成特定时间内细胞群体中存在的所有mRNA及其频率的完整列表。SAGE文库分析中的一个关键步骤是将每个14bp标签明确地分配到其来源的转录本。这个过程称为标签到基因的映射,是SAGE文库分析中有待改进的一个步骤。实际上,现有的提供标签与转录本对应关系的网站并不涵盖所有已有大量EST和cDNA测序的物种。

结果

这就是我们设计并实现了一个名为Identitag的免费工具用于标签识别的原因,该工具可用于任何有转录本序列的物种。Identitag基于关系数据库结构,以便能够快速、轻松地存储和更新数据,最重要的是,能够精确地定义识别参数。这种结构可以看作是三个相互连接的模块:第一个模块存储从给定转录本序列列表中提取的虚拟标签,第二个模块存储在SAGE实验中观察到的实验标签,第三个模块允许对用于提取虚拟标签的转录本序列进行注释。因此,它将一个观察到的标签与一个虚拟标签及其来源序列连接起来,然后在有可用功能注释时将其与功能注释连接起来。根据直系同源关系可以连接来自不同物种的数据库,从而允许比较不同物种之间的SAGE文库。我们成功地使用Identitag从我们的鸡SAGE文库中识别标签,并用于鸡与人的SAGE标签的种间比较。Identitag的源代码可在http://pbil.univ-lyon1.fr/software/identitag/网站上免费获取。

结论

Identitag是一种灵活且强大的工具,可用于任何单个物种的标签识别以及SAGE文库的种间比较。它为比较转录组学分析这一生物学新兴分支开辟了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/0d0b4a8fdeb3/1471-2105-5-143-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/718bb619d14c/1471-2105-5-143-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/fd6a61edc5d9/1471-2105-5-143-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/825c6f8ab920/1471-2105-5-143-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/7156dddd47e5/1471-2105-5-143-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/0d0b4a8fdeb3/1471-2105-5-143-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/718bb619d14c/1471-2105-5-143-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/fd6a61edc5d9/1471-2105-5-143-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/825c6f8ab920/1471-2105-5-143-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/7156dddd47e5/1471-2105-5-143-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ddf/535903/0d0b4a8fdeb3/1471-2105-5-143-5.jpg

相似文献

1
Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries.Identitag,一个用于SAGE标签识别和SAGE文库种间比较的关系数据库。
BMC Bioinformatics. 2004 Oct 6;5:143. doi: 10.1186/1471-2105-5-143.
2
[Transcriptomes for serial analysis of gene expression].[用于基因表达序列分析的转录组]
J Soc Biol. 2002;196(4):303-7.
3
Characterization of 954 bovine full-CDS cDNA sequences.954条牛全长编码序列(CDS)cDNA序列的特征分析
BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166.
4
Statistical modeling of sequencing errors in SAGE libraries.SAGE文库中测序错误的统计建模
Bioinformatics. 2004 Aug 4;20 Suppl 1:i31-9. doi: 10.1093/bioinformatics/bth924.
5
Gene Class expression: analysis tool of Gene Ontology terms with gene expression data.基因类表达:结合基因表达数据的基因本体术语分析工具。
Genet Mol Res. 2006 Mar 31;5(1):108-14.
6
WEBSAGE: a web tool for visual analysis of differentially expressed human SAGE tags.WEBSAGE:用于差异表达的人类SAGE标签可视化分析的网络工具。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W693-5. doi: 10.1093/nar/gki444.
7
The Mouse SAGE Site: database of public mouse SAGE libraries.小鼠基因表达连续分析标签位点数据库:公共小鼠基因表达连续分析标签文库数据库。
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D482-3. doi: 10.1093/nar/gkh058.
8
Cloning of tissue-specific genes using serial analysis of gene expression and a novel computational substraction approach.利用基因表达序列分析和一种新型计算扣除法克隆组织特异性基因。
Genomics. 2001 Jul;75(1-3):70-6. doi: 10.1006/geno.2001.6586.
9
Small amplified RNA-SAGE.小扩增RNA-基因表达序列分析
Methods Mol Biol. 2004;258:135-52. doi: 10.1385/1-59259-751-3:135.
10
Annotating nonspecific SAGE tags with microarray data.用微阵列数据注释非特异性SAGE标签。
Genomics. 2006 Jan;87(1):173-80. doi: 10.1016/j.ygeno.2005.08.014. Epub 2005 Nov 28.

引用本文的文献

1
In-depth global analysis of transcript abundance levels in porcine alveolar macrophages following infection with porcine reproductive and respiratory syndrome virus.猪繁殖与呼吸综合征病毒感染后猪肺泡巨噬细胞中转录本丰度水平的深入全球分析。
Adv Virol. 2010;2010:864181. doi: 10.1155/2010/864181. Epub 2011 Jan 12.
2
An atlas of bovine gene expression reveals novel distinctive tissue characteristics and evidence for improving genome annotation.牛基因表达图谱揭示了新的独特组织特征,并为改进基因组注释提供了证据。
Genome Biol. 2010;11(10):R102. doi: 10.1186/gb-2010-11-10-r102. Epub 2010 Oct 20.
3
Gill transcriptome response to changes in environmental calcium in the green spotted puffer fish.

本文引用的文献

1
Global transcription analysis of immature avian erythrocytic progenitors: from self-renewal to differentiation.未成熟禽类红细胞祖细胞的全转录组分析:从自我更新到分化
Oncogene. 2004 Oct 7;23(46):7628-43. doi: 10.1038/sj.onc.1208061.
2
A neutral model of transcriptome evolution.转录组进化的中性模型。
PLoS Biol. 2004 May;2(5):E132. doi: 10.1371/journal.pbio.0020132. Epub 2004 May 11.
3
Incongruent expression profiles between human and mouse orthologous genes suggest widespread neutral evolution of transcription control.
绿斑河豚对环境钙变化的 Gill 转录组反应。
BMC Genomics. 2010 Aug 17;11:476. doi: 10.1186/1471-2164-11-476.
4
Gene expression profiling via LongSAGE in a non-model plant species: a case study in seeds of Brassica napus.通过LongSAGE技术对非模式植物物种进行基因表达谱分析:以甘蓝型油菜种子为例的研究
BMC Genomics. 2009 Jul 3;10:295. doi: 10.1186/1471-2164-10-295.
5
A score system for quality evaluation of RNA sequence tags: an improvement for gene expression profiling.一种用于RNA序列标签质量评估的评分系统:基因表达谱分析的改进方法
BMC Bioinformatics. 2009 Jun 6;10:170. doi: 10.1186/1471-2105-10-170.
6
Clustering-based approaches to SAGE data mining.基于聚类的 SAGE 数据挖掘方法。
BioData Min. 2008 Jul 17;1(1):5. doi: 10.1186/1756-0381-1-5.
7
SQUAT: A web tool to mine human, murine and avian SAGE data.SQUAT:一种挖掘人类、小鼠和禽类SAGE数据的网络工具。
BMC Bioinformatics. 2008 Sep 18;9:378. doi: 10.1186/1471-2105-9-378.
8
Large-scale analysis by SAGE reveals new mechanisms of v-erbA oncogene action.SAGE的大规模分析揭示了v-erbA癌基因作用的新机制。
BMC Genomics. 2007 Oct 26;8:390. doi: 10.1186/1471-2164-8-390.
9
SAGExplore: a web server for unambiguous tag mapping in serial analysis of gene expression oriented to gene discovery and annotation.SAGExplore:一个用于基因表达序列分析中明确标签映射的网络服务器,旨在进行基因发现和注释。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W163-8. doi: 10.1093/nar/gkm429. Epub 2007 Jul 10.
10
Unexpected observations after mapping LongSAGE tags to the human genome.将长链SAGE标签定位到人类基因组后出现的意外观察结果。
BMC Bioinformatics. 2007 May 15;8:154. doi: 10.1186/1471-2105-8-154.
人类和小鼠直系同源基因之间不一致的表达谱表明转录调控存在广泛的中性进化。
OMICS. 2004 Spring;8(1):15-24. doi: 10.1089/153623104773547462.
4
A comprehensive collection of chicken cDNAs.鸡cDNA的全面集合。
Curr Biol. 2002 Nov 19;12(22):1965-9. doi: 10.1016/s0960-9822(02)01296-4.
5
Transcriptome analysis of monocytic leukemia cell differentiation.单核细胞白血病细胞分化的转录组分析
Genomics. 2002 Sep;80(3):361-71. doi: 10.1006/geno.2002.6836.
6
Using the transcriptome to annotate the genome.利用转录组注释基因组。
Nat Biotechnol. 2002 May;20(5):508-12. doi: 10.1038/nbt0502-508.
7
TGF-beta cooperates with TGF-alpha to induce the self-renewal of normal erythrocytic progenitors: evidence for an autocrine mechanism.转化生长因子-β与转化生长因子-α协同诱导正常红细胞祖细胞的自我更新:自分泌机制的证据。
EMBO J. 1999 May 17;18(10):2764-81. doi: 10.1093/emboj/18.10.2764.
8
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
9
WWW-query: an on-line retrieval system for biological sequence banks.万维网查询:一种用于生物序列库的在线检索系统。
Biochimie. 1996;78(5):364-9. doi: 10.1016/0300-9084(96)84768-7.
10
Serial analysis of gene expression.基因表达序列分析
Science. 1995 Oct 20;270(5235):484-7. doi: 10.1126/science.270.5235.484.