Suppr超能文献

特征表达式:创建和处理序列数据集。

Feature expressions: creating and manipulating sequence datasets.

作者信息

Fristensky B

机构信息

Department of Plant Science, University of Manitoba, Winnipeg, Canada.

出版信息

Nucleic Acids Res. 1993 Dec 25;21(25):5997-6003. doi: 10.1093/nar/21.25.5997.

Abstract

Annotation of features, such as introns, exons and protein coding regions in GenBank/EMBL/DDBJ entries is now standardized through use of the Features Table (FT) language. The essence of the FT language is described by the relation 'expression-->sequence', meaning that each FT expression evaluates to a sequence. For example, the expression M74750:1..50 evaluates to the first 50 bases of the sequence with accession number M74750. Because FT is intrinsic to the database definition, it can serve as a software- and platform-independent lingua franca for sequence manipulation. The XYLEM package makes it possible to create and manipulate sequence datasets using FT expressions. FEATURES is a program that resolves FT expressions into their corresponding sequences. Annotated features can be retrieved either by feature key or by expression. Even unannotated portions of a sequence can be retrieved by user-generated FT expressions. Applications of the FT language include retrieval of subsequences from large sequence entries, generation of chromosome models or artificial DNA constructs, and representation of restriction maps or mutants.

摘要

GenBank/EMBL/DDBJ条目中的特征注释,如内含子、外显子和蛋白质编码区域,现在通过使用特征表(FT)语言实现了标准化。FT语言的核心由“表达式-->序列”关系描述,这意味着每个FT表达式都对应一个序列。例如,表达式M74750:1..50对应于登录号为M74750的序列的前50个碱基。由于FT是数据库定义所固有的,它可以作为一种独立于软件和平台的序列操作通用语言。XYLEM软件包使得使用FT表达式创建和操作序列数据集成为可能。FEATURES是一个将FT表达式解析为相应序列的程序。带注释的特征可以通过特征键或表达式进行检索。甚至序列的未注释部分也可以通过用户生成的FT表达式进行检索。FT语言的应用包括从大型序列条目中检索子序列、生成染色体模型或人工DNA构建体,以及表示限制图谱或突变体。

相似文献

1

本文引用的文献

3
Characterization of translational initiation sites in E. coli.大肠杆菌中转录起始位点的表征
Nucleic Acids Res. 1982 May 11;10(9):2971-96. doi: 10.1093/nar/10.9.2971.
5
The protein identification resource (PIR).蛋白质鉴定资源(PIR)。
Nucleic Acids Res. 1988 Mar 11;16(5):1869-71. doi: 10.1093/nar/16.5.1869.
6
The EMBL data library.欧洲分子生物学实验室数据库。
Nucleic Acids Res. 1988 Mar 11;16(5):1865-7. doi: 10.1093/nar/16.5.1865.
10
GenBank.基因银行
Nucleic Acids Res. 1991 Apr 25;19 Suppl(Suppl):2221-5. doi: 10.1093/nar/19.suppl.2221.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验