Fristensky B
Department of Plant Science, University of Manitoba, Winnipeg, Canada.
Nucleic Acids Res. 1993 Dec 25;21(25):5997-6003. doi: 10.1093/nar/21.25.5997.
Annotation of features, such as introns, exons and protein coding regions in GenBank/EMBL/DDBJ entries is now standardized through use of the Features Table (FT) language. The essence of the FT language is described by the relation 'expression-->sequence', meaning that each FT expression evaluates to a sequence. For example, the expression M74750:1..50 evaluates to the first 50 bases of the sequence with accession number M74750. Because FT is intrinsic to the database definition, it can serve as a software- and platform-independent lingua franca for sequence manipulation. The XYLEM package makes it possible to create and manipulate sequence datasets using FT expressions. FEATURES is a program that resolves FT expressions into their corresponding sequences. Annotated features can be retrieved either by feature key or by expression. Even unannotated portions of a sequence can be retrieved by user-generated FT expressions. Applications of the FT language include retrieval of subsequences from large sequence entries, generation of chromosome models or artificial DNA constructs, and representation of restriction maps or mutants.
GenBank/EMBL/DDBJ条目中的特征注释,如内含子、外显子和蛋白质编码区域,现在通过使用特征表(FT)语言实现了标准化。FT语言的核心由“表达式-->序列”关系描述,这意味着每个FT表达式都对应一个序列。例如,表达式M74750:1..50对应于登录号为M74750的序列的前50个碱基。由于FT是数据库定义所固有的,它可以作为一种独立于软件和平台的序列操作通用语言。XYLEM软件包使得使用FT表达式创建和操作序列数据集成为可能。FEATURES是一个将FT表达式解析为相应序列的程序。带注释的特征可以通过特征键或表达式进行检索。甚至序列的未注释部分也可以通过用户生成的FT表达式进行检索。FT语言的应用包括从大型序列条目中检索子序列、生成染色体模型或人工DNA构建体,以及表示限制图谱或突变体。