用于小鼠全长cDNA文库的基于计算机的方法：构建非冗余cDNA文库的实时序列聚类

Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library.

作者信息

Konno H, Fukunishi Y, Shibata K, Itoh M, Carninci P, Sugahara Y, Hayashizaki Y

机构信息

Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Yokohama 230-0045, Japan.

出版信息

Genome Res. 2001 Feb;11(2):281-9. doi: 10.1101/gr.gr-1457r.

DOI:10.1101/gr.gr-1457r

PMID:11157791

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311029/

Abstract

We developed computer-based methods for constructing a nonredundant mouse full-length cDNA library. Our cDNA library construction process comprises assessment of library quality, sequencing the 3' ends of inserts and clustering, and completing a re-array to generate a nonredundant library from a redundant one. After the cDNA libraries are generated, we sequence the 5' ends of the inserts to check the quality of the library; then we determine the sequencing priority of each library. Selected libraries undergo large-scale sequencing of the 3' ends of the inserts and clustering of the tag sequences. After clustering, the nonredundant library is constructed from the original libraries, which have redundant clones. All libraries, plates, clones, sequences, and clusters are uniquely identified, and all information is saved in the database according to this identifier. At press time, our system has been in place for the past two years; we have clustered 939,725 3' end sequences into 127,385 groups from 227 cDNA libraries/sublibraries (see http://genome.gse.riken.go.jp/).

摘要

我们开发了基于计算机的方法来构建非冗余小鼠全长cDNA文库。我们的cDNA文库构建过程包括评估文库质量、对插入片段的3'末端进行测序和聚类，以及完成重新排列以从冗余文库生成非冗余文库。在生成cDNA文库后，我们对插入片段的5'末端进行测序以检查文库质量；然后我们确定每个文库的测序优先级。选定的文库进行插入片段3'末端的大规模测序和标签序列的聚类。聚类后，从具有冗余克隆的原始文库构建非冗余文库。所有文库、平板、克隆、序列和聚类都有唯一标识，并且所有信息都根据此标识符保存在数据库中。截至发稿时，我们的系统已经运行了两年；我们已将来自227个cDNA文库/子文库的939,725个3'末端序列聚类为127,385组（见http://genome.gse.riken.go.jp/）。

相似文献

Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library.

Genome Res. 2001 Feb;11(2):281-9. doi: 10.1101/gr.gr-1457r.

A computer-based method of selecting clones for a full-length cDNA project: simultaneous collection of negligibly redundant and variant cDNAs.

Genome Res. 2002 Jul;12(7):1127-34. doi: 10.1101/gr.75202.

Identification of unique transcripts from a mouse full-length, subtracted inner ear cDNA library.

Genomics. 2004 Jun;83(6):1012-23. doi: 10.1016/j.ygeno.2004.01.006.

Construction and characterization of a full length-enriched and a 5'-end-enriched cDNA library.

Gene. 1997 Oct 24;200(1-2):149-56. doi: 10.1016/s0378-1119(97)00411-3.

Full-length-enriched cDNA libraries from Echinococcus granulosus contain separate populations of oligo-capped and trans-spliced transcripts and a high level of predicted signal peptide sequences.

Mol Biochem Parasitol. 2002 Jul;122(2):171-80. doi: 10.1016/s0166-6851(02)00098-1.

RIKEN mouse genome encyclopedia.

Mech Ageing Dev. 2003 Jan;124(1):93-102. doi: 10.1016/s0047-6374(02)00173-2.

Characterization of cDNA clones selected by the GeneMark analysis from size-fractionated cDNA libraries from human brain.

DNA Res. 1999 Oct 29;6(5):329-36. doi: 10.1093/dnares/6.5.329.

Targeting a complex transcriptome: the construction of the mouse full-length cDNA encyclopedia.

Genome Res. 2003 Jun;13(6B):1273-89. doi: 10.1101/gr.1119703.

FANTOM DB: database of Functional Annotation of RIKEN Mouse cDNA Clones.

Nucleic Acids Res. 2002 Jan 1;30(1):116-8. doi: 10.1093/nar/30.1.116.

Mouse BAC ends quality assessment and sequence analyses.

Genome Res. 2001 Oct;11(10):1736-45. doi: 10.1101/gr.179201.

引用本文的文献

The juxtaparanodal proteins CNTNAP2 and TAG1 regulate diet-induced obesity.

Mamm Genome. 2012 Aug;23(7-8):431-42. doi: 10.1007/s00335-012-9400-8. Epub 2012 Jul 1.

Construction and characterization of a goat mammary gland cDNA library.

Mol Biotechnol. 2008 Mar;38(3):187-93. doi: 10.1007/s12033-007-9020-9. Epub 2007 Nov 30.

Functional annotation of 19,841 Populus nigra full-length enriched cDNA clones.

BMC Genomics. 2007 Dec 3;8:448. doi: 10.1186/1471-2164-8-448.

A collection of 10,096 indica rice full-length cDNAs reveals highly expressed sequence divergence between Oryza sativa indica and japonica subspecies.

Plant Mol Biol. 2007 Nov;65(4):403-15. doi: 10.1007/s11103-007-9174-7. Epub 2007 May 24.

CAFTAN: a tool for fast mapping, and quality assessment of cDNAs.

BMC Bioinformatics. 2006 Oct 25;7:473. doi: 10.1186/1471-2105-7-473.

EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W459-62. doi: 10.1093/nar/gkl066.

Cytoskeletal rearrangements in synovial fibroblasts as a novel pathophysiological determinant of modeled rheumatoid arthritis.

PLoS Genet. 2005 Oct;1(4):e48. doi: 10.1371/journal.pgen.0010048. Epub 2005 Oct 28.

Mistaken identifiers: gene name errors can be introduced inadvertently when using Excel in bioinformatics.

BMC Bioinformatics. 2004 Jun 23;5:80. doi: 10.1186/1471-2105-5-80.

Annotation and analysis of 10,000 expressed sequence tags from developing mouse eye and adult retina.

Genome Biol. 2003;4(10):R65. doi: 10.1186/gb-2003-4-10-r65. Epub 2003 Sep 22.

Mapping of 19032 mouse cDNAs on mouse chromosomes.

J Struct Funct Genomics. 2002;2(1):23-8. doi: 10.1023/a:1013203019444.

本文引用的文献

Comparative evaluation of 5'-end-sequence quality of clones in CAP trapper and other full-length-cDNA libraries.

Gene. 2001 Jan 24;263(1-2):93-102. doi: 10.1016/s0378-1119(00)00557-6.

Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes.

Genome Res. 2000 Oct;10(10):1617-30. doi: 10.1101/gr.145100.

Frequent alternative splicing of human genes.

Genome Res. 1999 Dec;9(12):1288-93. doi: 10.1101/gr.9.12.1288.

A comprehensive approach to clustering of expressed human gene sequence: the sequence tag alignment and consensus knowledge base.

Genome Res. 1999 Nov;9(11):1143-55. doi: 10.1101/gr.9.11.1143.

d2_cluster: a validated method for clustering EST and full-length cDNAsequences.

Genome Res. 1999 Nov;9(11):1135-42. doi: 10.1101/gr.9.11.1135.

CAP3: A DNA sequence assembly program.

Genome Res. 1999 Sep;9(9):868-77. doi: 10.1101/gr.9.9.868.

High-efficiency full-length cDNA cloning.

Methods Enzymol. 1999;303:19-44. doi: 10.1016/s0076-6879(99)03004-9.

Automated filtration-based high-throughput plasmid preparation system.

Genome Res. 1999 May;9(5):463-70.

Increased specificity of reverse transcription priming by trehalose and oligo-blockers allows high-efficiency window separation of mRNA display.

Nucleic Acids Res. 1999 Mar 1;27(5):1345-9. doi: 10.1093/nar/27.5.1345.

High-efficiency cloning of Arabidopsis full-length cDNA by biotinylated CAP trapper.

Plant J. 1998 Sep;15(5):707-20. doi: 10.1046/j.1365-313x.1998.00237.x.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于小鼠全长cDNA文库的基于计算机的方法：构建非冗余cDNA文库的实时序列聚类

Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library.

作者信息

Konno H, Fukunishi Y, Shibata K, Itoh M, Carninci P, Sugahara Y, Hayashizaki Y

机构信息

Laboratory for Genome Exploration Research Group, RIKEN Genomic Sciences Center, Yokohama 230-0045, Japan.

出版信息

Genome Res. 2001 Feb;11(2):281-9. doi: 10.1101/gr.gr-1457r.

DOI:10.1101/gr.gr-1457r

PMID:11157791

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC311029/

Abstract

摘要

用于小鼠全长cDNA文库的基于计算机的方法：构建非冗余cDNA文库的实时序列聚类

Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

用于小鼠全长cDNA文库的基于计算机的方法：构建非冗余cDNA文库的实时序列聚类

Computer-based methods for the mouse full-length cDNA encyclopedia: real-time sequence clustering for construction of a nonredundant cDNA library.

作者信息

机构信息

出版信息