Suppr超能文献

拟南芥第5号染色体的结构分析。VI. 19个物理定位的P1和TAC克隆覆盖的1,367,185 bp区域的序列特征。

Structural analysis of Arabidopsis thaliana chromosome 5. VI. Sequence features of the regions of 1,367,185 bp covered by 19 physically assigned P1 and TAC clones.

作者信息

Kotani H, Nakamura Y, Sato S, Asamizu E, Kaneko T, Miyajima N, Tabata S

机构信息

Kazusa DNA Research Institute, Kisarazu, Chiba, Japan.

出版信息

DNA Res. 1998 Jun 30;5(3):203-16. doi: 10.1093/dnares/5.3.203.

Abstract

Nineteen P1 and TAC clones, which have been mapped on the fine physical map of the Arabidopsis thaliana chromosome 5, were sequenced according to the shotgun-based strategy, and their structural features were analysed. The total length of the regions sequenced in this study was 1,367,185 bp. Combining this with the regions covered by 90 P1 and TAC clones previously reported, the total length of chromosome 5 sequenced to date becomes 8,058,855 bp. On the basis of similarity search against protein and EST databases and gene modeling with computer programs, a total of 330 potential protein-coding regions were identified, bringing an average density of the genes to approximately one gene per 4.1 kb. Introns were identified in 81.0% of the potential protein genes for which the entire gene structure was predicted, with an average number per gene of 4.2 and an average length of the introns of 180 bp. The RNA-coding genes identified were 9 tRNA genes corresponding to 8 amino acid species and 2 genes for U2 nuclear RNA. These sequence features are essentially identical to those in the previously reported sequences. The sequence data and gene information are available on the World Wide Web database KAOS (Kazusa Arabidopsis data Opening Site) at http://www.kazusa.or.jp/arabi/.

摘要

对已定位在拟南芥第5号染色体精细物理图谱上的19个P1和TAC克隆,按照鸟枪法测序策略进行测序,并分析其结构特征。本研究测序区域的总长度为1,367,185碱基对。将其与先前报道的90个P1和TAC克隆覆盖的区域相结合,迄今已测序的第5号染色体总长度达到8,058,855碱基对。基于对蛋白质和EST数据库的相似性搜索以及用计算机程序进行基因建模,共鉴定出330个潜在蛋白质编码区,基因平均密度约为每4.1千碱基一个基因。在预测了完整基因结构的潜在蛋白质基因中,81.0%鉴定出了内含子,每个基因内含子平均数量为4.2个,平均长度为180碱基对。鉴定出的RNA编码基因有对应8种氨基酸的9个tRNA基因和2个U2核RNA基因。这些序列特征与先前报道的序列基本相同。序列数据和基因信息可在万维网数据库KAOS(Kazusa拟南芥数据开放网站)上获取,网址为http://www.kazusa.or.jp/arabi/

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验