70万个开放阅读框序列标签对人类转录组定义的贡献。

The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

作者信息

Camargo A A, Samaia H P, Dias-Neto E, Simão D F, Migotto I A, Briones M R, Costa F F, Nagai M A, Verjovski-Almeida S, Zago M A, Andrade L E, Carrer H, El-Dorry H F, Espreafico E M, Habr-Gama A, Giannella-Neto D, Goldman G H, Gruber A, Hackel C, Kimura E T, Maciel R M, Marie S K, Martins E A, Nobrega M P, Paco-Larson M L, Pardini M I, Pereira G G, Pesquero J B, Rodrigues V, Rogatto S R, da Silva I D, Sogayar M C, Sonati M F, Tajara E H, Valentini S R, Alberto F L, Amaral M E, Aneas I, Arnaldi L A, de Assis A M, Bengtson M H, Bergamo N A, Bombonato V, de Camargo M E, Canevari R A, Carraro D M, Cerutti J M, Correa M L, Correa R F, Costa M C, Curcio C, Hokama P O, Ferreira A J, Furuzawa G K, Gushiken T, Ho P L, Kimura E, Krieger J E, Leite L C, Majumder P, Marins M, Marques E R, Melo A S, Melo M B, Mestriner C A, Miracca E C, Miranda D C, Nascimento A L, Nobrega F G, Ojopi E P, Pandolfi J R, Pessoa L G, Prevedel A C, Rahal P, Rainho C A, Reis E M, Ribeiro M L, da Ros N, de Sa R G, Sales M M, Sant'anna S C, dos Santos M L, da Silva A M, da Silva N P, Silva W A, da Silveira R A, Sousa J F, Stecconi D, Tsukumo F, Valente V, Soares F, Moreira E S, Nunes D N, Correa R G, Zalcberg H, Carvalho A F, Reis L F, Brentani R R, Simpson A J, de Souza S J

机构信息

Ludwig Institute for Cancer Research, 01509-010, São Paulo, Brazil.

出版信息

Proc Natl Acad Sci U S A. 2001 Oct 9;98(21):12103-8. doi: 10.1073/pnas.201182798.

DOI:10.1073/pnas.201182798

PMID:11593022

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC59775/

Abstract

Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.

摘要

开放阅读框表达序列标签（ORESTES）与传统的EST不同，它提供的是转录本中央蛋白质编码部分的序列数据。我们从24种人体组织中总共生成了696,745条ORESTES序列，并使用了与一组15,095个全长mRNA相对应的数据子集，以此来评估该策略的效率及其对人类转录组定义的潜在贡献。我们估计ORESTES涵盖了超过80%的高表达和中等表达的人类基因，以及40%至50%的低表达人类基因。在我们测序最全面的组织——乳腺中，所生成的130,000条ORESTES来自该组织中估计70%的所有表达基因的转录本，高表达和低表达基因都得到了同样有效的体现。在这方面，我们发现ORESTES策略在基因发现和随机转录本序列生成方面的能力显著超过传统的EST。ORESTES的分布情况使得许多人类转录本现在都由沿着每个基因产物长度分布的部分序列支架所代表。通过逆转录PCR对支架组件进行实验性连接，是转录本完成的直接途径，这可能是全长cDNA克隆的一种有用替代方法。

相似文献

The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

Proc Natl Acad Sci U S A. 2001 Oct 9;98(21):12103-8. doi: 10.1073/pnas.201182798.

The use of Open Reading frame ESTs (ORESTES) for analysis of the honey bee transcriptome.

BMC Genomics. 2004 Nov 3;5:84. doi: 10.1186/1471-2164-5-84.

Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags.

Proc Natl Acad Sci U S A. 2000 Nov 7;97(23):12690-3. doi: 10.1073/pnas.97.23.12690.

Characterization of open reading frame-expressed sequence tags generated from Bos indicus and B. taurus mammary gland cDNA libraries.

Anim Genet. 2004 Jun;35(3):213-9. doi: 10.1111/j.1365-2052.2004.01139.x.

Mining ORESTES no-match database: can we still contribute to cancer transcriptome?

Genet Mol Res. 2006 Mar 31;5(1):24-32.

Shotgun sequencing of the human transcriptome with ORF expressed sequence tags.

Proc Natl Acad Sci U S A. 2000 Mar 28;97(7):3491-6. doi: 10.1073/pnas.97.7.3491.

Identification of unannotated exons of low abundance transcripts in Drosophila melanogaster and cloning of a new serine protease gene upregulated upon injury.

BMC Genomics. 2007 Jul 24;8:249. doi: 10.1186/1471-2164-8-249.

Identification of genes encoding hypothetical proteins in open-reading frame expressed sequence tags from mammalian stages of Trypanosoma cruzi.

Genet Mol Res. 2011;10(3):1589-630. doi: 10.4238/vol10-3gmr1140.

Biomphalaria glabrata transcriptome: identification of cell-signalling, transcriptional control and immune-related genes from open reading frame expressed sequence tags (ORESTES).

Dev Comp Immunol. 2007;31(8):763-82. doi: 10.1016/j.dci.2006.11.004. Epub 2006 Dec 14.

A transcript finishing initiative for closing gaps in the human transcriptome.

Genome Res. 2004 Jul;14(7):1413-23. doi: 10.1101/gr.2111304. Epub 2004 Jun 14.

引用本文的文献

In vitro and in silico validation of CA3 and FHL1 downregulation in oral cancer.

BMC Cancer. 2018 Feb 17;18(1):193. doi: 10.1186/s12885-018-4077-3.

Expression of human protein S100A7 (psoriasin), preparation of antibody and application to human larynx squamous cell carcinoma.

BMC Res Notes. 2011 Nov 14;4:494. doi: 10.1186/1756-0500-4-494.

Long noncoding intronic RNAs are differentially expressed in primary and metastatic pancreatic cancer.

Mol Cancer. 2011 Nov 13;10:141. doi: 10.1186/1476-4598-10-141.

Temporal blastemal cell gene expression analysis in the kidney reveals new Wnt and related signaling pathway genes to be essential for Wilms' tumor onset.

Cell Death Dis. 2011 Nov 3;2(11):e224. doi: 10.1038/cddis.2011.105.

Gene network analyses point to the importance of human tissue kallikreins in melanoma progression.

BMC Med Genomics. 2011 Oct 27;4:76. doi: 10.1186/1755-8794-4-76.

Functional microarray analysis suggests repressed cell-cell signaling and cell survival-related modules inhibit progression of head and neck squamous cell carcinoma.

BMC Med Genomics. 2011 Apr 13;4:33. doi: 10.1186/1755-8794-4-33.

Transcriptome analysis of Taenia solium cysticerci using Open Reading Frame ESTs (ORESTES).

Parasit Vectors. 2009 Jul 31;2(1):35. doi: 10.1186/1756-3305-2-35.

A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide.

Bioinformatics. 2009 Jul 1;25(13):1694-701. doi: 10.1093/bioinformatics/btp290. Epub 2009 May 15.

No-match ORESTES explored as tumor markers.

Nucleic Acids Res. 2009 May;37(8):2607-17. doi: 10.1093/nar/gkp074. Epub 2009 Mar 6.

Transcriptome-guided characterization of genomic rearrangements in a breast cancer cell line.

Proc Natl Acad Sci U S A. 2009 Feb 10;106(6):1886-91. doi: 10.1073/pnas.0812945106. Epub 2009 Jan 30.

本文引用的文献

Initial sequencing and analysis of the human genome.

Nature. 2001 Feb 15;409(6822):860-921. doi: 10.1038/35057062.

Functional annotation of a full-length mouse cDNA collection.

Nature. 2001 Feb 8;409(6821):685-90. doi: 10.1038/35055500.

The sequence of the human genome.

Science. 2001 Feb 16;291(5507):1304-51. doi: 10.1126/science.1058040.

Protein diversity from alternative splicing: a challenge for bioinformatics and post-genome biology.

Cell. 2000 Oct 27;103(3):367-70. doi: 10.1016/s0092-8674(00)00128-8.

Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags.

Proc Natl Acad Sci U S A. 2000 Nov 7;97(23):12690-3. doi: 10.1073/pnas.97.23.12690.

An assessment of gene prediction accuracy in large DNA sequences.

Genome Res. 2000 Oct;10(10):1631-42. doi: 10.1101/gr.122800.

Human and mouse gene structure: comparative analysis and application to exon prediction.

Genome Res. 2000 Jul;10(7):950-8. doi: 10.1101/gr.10.7.950.

Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence.

Nat Genet. 2000 Jun;25(2):235-8. doi: 10.1038/76118.

Shotgun sequencing of the human transcriptome with ORF expressed sequence tags.

Proc Natl Acad Sci U S A. 2000 Mar 28;97(7):3491-6. doi: 10.1073/pnas.97.7.3491.

The human adult skeletal muscle transcriptional profile reconstructed by a novel computational approach.

Genome Res. 2000 Mar;10(3):344-9. doi: 10.1101/gr.10.3.344.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

70万个开放阅读框序列标签对人类转录组定义的贡献。

The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

作者信息

机构信息

Ludwig Institute for Cancer Research, 01509-010, São Paulo, Brazil.

出版信息

Proc Natl Acad Sci U S A. 2001 Oct 9;98(21):12103-8. doi: 10.1073/pnas.201182798.

DOI:10.1073/pnas.201182798

PMID:11593022

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC59775/

Abstract

摘要

70万个开放阅读框序列标签对人类转录组定义的贡献。

The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

70万个开放阅读框序列标签对人类转录组定义的贡献。

The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

作者信息

机构信息

出版信息