• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

构建人、小鼠和大鼠的代表性转录本和蛋白质集,作为其转录组和蛋白质组分析的平台。

Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis.

作者信息

Kasukawa Takeya, Katayama Shintaro, Kawaji Hideya, Suzuki Harukazu, Hume David A, Hayashizaki Yoshihide

机构信息

Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045, Japan.

出版信息

Genomics. 2004 Dec;84(6):913-21. doi: 10.1016/j.ygeno.2004.08.011.

DOI:10.1016/j.ygeno.2004.08.011
PMID:15533708
Abstract

The number of mammalian transcripts identified by full-length cDNA projects and genome sequencing projects is increasing remarkably. Clustering them into a strictly nonredundant and comprehensive set provides a platform for functional analysis of the transcriptome and proteome, but the quality of the clustering and predictive usefulness have previously required manual curation to identify truncated transcripts and inappropriate clustering of closely related sequences. A Representative Transcript and Protein Sets (RTPS) pipeline was previously designed to identify the nonredundant and comprehensive set of mouse transcripts based on clustering of a large mouse full-length cDNA set (FANTOM2). Here we propose an alternative method that is more robust, requires less manual curation, and is applicable to other organisms in addition to mouse. RTPSs of human, mouse, and rat have been produced by this method and used for validation. Their comprehensiveness and quality are discussed by comparison with other clustering approaches. The RTPSs are available at .

摘要

通过全长cDNA项目和基因组测序项目鉴定出的哺乳动物转录本数量正在显著增加。将它们聚类成一个严格非冗余且全面的集合,为转录组和蛋白质组的功能分析提供了一个平台,但聚类的质量和预测实用性此前需要人工整理,以识别截短的转录本和密切相关序列的不恰当聚类。此前设计了一种代表性转录本和蛋白质集(RTPS)流程,基于对大量小鼠全长cDNA集(FANTOM2)的聚类来鉴定小鼠转录本的非冗余且全面的集合。在此,我们提出一种更稳健、所需人工整理更少且除小鼠外还适用于其他生物的替代方法。通过此方法已生成了人、小鼠和大鼠的RTPS并用于验证。通过与其他聚类方法比较,讨论了它们的全面性和质量。RTPS可在……获取。

相似文献

1
Construction of representative transcript and protein sets of human, mouse, and rat as a platform for their transcriptome and proteome analysis.构建人、小鼠和大鼠的代表性转录本和蛋白质集,作为其转录组和蛋白质组分析的平台。
Genomics. 2004 Dec;84(6):913-21. doi: 10.1016/j.ygeno.2004.08.011.
2
Transcript annotation in FANTOM3: mouse gene catalog based on physical cDNAs.FANTOM3中的转录本注释:基于物理cDNA的小鼠基因目录。
PLoS Genet. 2006 Apr;2(4):e62. doi: 10.1371/journal.pgen.0020062.
3
Human disease genes and their cloned mouse orthologs: exploration of the FANTOM2 cDNA sequence data set.人类疾病基因及其克隆的小鼠直系同源基因:FANTOM2 cDNA序列数据集的探索
Genome Res. 2003 Jun;13(6B):1496-500. doi: 10.1101/gr.979503.
4
Identification of "pathologs" (disease-related genes) from the RIKEN mouse cDNA dataset using human curation plus FACTS, a new biological information extraction system.利用人工筛选加上FACTS(一种新型生物信息提取系统),从理化学研究所小鼠cDNA数据集中鉴定“病理同源基因”(疾病相关基因)。
BMC Genomics. 2004 Apr 29;5(1):28. doi: 10.1186/1471-2164-5-28.
5
Characterization of 954 bovine full-CDS cDNA sequences.954条牛全长编码序列(CDS)cDNA序列的特征分析
BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166.
6
[Transcriptomes for serial analysis of gene expression].[用于基因表达序列分析的转录组]
J Soc Biol. 2002;196(4):303-7.
7
Transcriptome analyses of human genes and applications for proteome analyses.人类基因的转录组分析及其在蛋白质组分析中的应用。
Curr Protein Pept Sci. 2006 Apr;7(2):147-63. doi: 10.2174/138920306776359795.
8
Development and evaluation of an automated annotation pipeline and cDNA annotation system.自动化注释流程及cDNA注释系统的开发与评估
Genome Res. 2003 Jun;13(6B):1542-51. doi: 10.1101/gr.992803.
9
Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs.基于60770个全长cDNA功能注释的小鼠转录组分析。
Nature. 2002 Dec 5;420(6915):563-73. doi: 10.1038/nature01266.
10
Mouse proteome analysis.小鼠蛋白质组分析。
Genome Res. 2003 Jun;13(6B):1335-44. doi: 10.1101/gr.978703.

引用本文的文献

1
Gateways to the FANTOM5 promoter level mammalian expression atlas.通向FANTOM5启动子水平哺乳动物表达图谱的途径。
Genome Biol. 2015 Jan 5;16(1):22. doi: 10.1186/s13059-014-0560-6.
2
Projection of gene-protein networks to the functional space of the proteome and its application to analysis of organism complexity.将基因-蛋白质网络投射到蛋白质组的功能空间及其在分析生物复杂性中的应用。
BMC Genomics. 2010 Feb 10;11 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2164-11-S1-S4.
3
Tissue-specific functions based on information content of gene ontology using cap analysis gene expression.
基于基因本体信息内容,利用帽分析基因表达的组织特异性功能。
Med Biol Eng Comput. 2007 Nov;45(11):1029-36. doi: 10.1007/s11517-007-0274-y. Epub 2007 Oct 30.
4
Large-scale clustering of CAGE tag expression data.CAGE标签表达数据的大规模聚类
BMC Bioinformatics. 2007 May 21;8:161. doi: 10.1186/1471-2105-8-161.
5
A method for similarity search of genomic positional expression using CAGE.一种使用CAGE进行基因组位置表达相似性搜索的方法。
PLoS Genet. 2006 Apr;2(4):e44. doi: 10.1371/journal.pgen.0020044. Epub 2006 Apr 28.
6
Clusters of internally primed transcripts reveal novel long noncoding RNAs.内部引发转录本簇揭示了新型长链非编码RNA。
PLoS Genet. 2006 Apr;2(4):e37. doi: 10.1371/journal.pgen.0020037. Epub 2006 Apr 28.
7
The 3of5 web application for complex and comprehensive pattern matching in protein sequences.用于蛋白质序列中复杂全面模式匹配的3of5网络应用程序。
BMC Bioinformatics. 2006 Mar 16;7:144. doi: 10.1186/1471-2105-7-144.
8
CAGE Basic/Analysis Databases: the CAGE resource for comprehensive promoter analysis.CAGE基础/分析数据库:用于全面启动子分析的CAGE资源。
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D632-6. doi: 10.1093/nar/gkj034.