Billoud B, Kontic M, Viari A
Atelier de Bio-informatique URA CNRS 448, Institut Curie, Paris, France.
Nucleic Acids Res. 1996 Apr 15;24(8):1395-403. doi: 10.1093/nar/24.8.1395.
At the DNA/RNA level, biological signals are defined by a combination of spatial structures and sequence motifs. Until now, few attempts had been made in writing general purpose search programs that take into account both sequence and structure criteria. Indeed, the most successful structure scanning programs are usually dedicated to particular structures and are written using general purpose programming languages through a complex and time consuming process where the biological problem of defining the structure and the computer engineering problem of looking for it are intimately intertwined. In this paper, we describe a general representation of structures, suitable for database scanning, together with a programming language, Palingol, designed to manipulate it. Palingol has specific data types, corresponding to structural elements-basically helices-that can be arranged in any way to form a complex structure. As a consequence of the declarative approach used in Palingol, the user should only focus on 'what to search for' while the language engine takes care of 'how to look for it'. Therefore, it becomes simpler to write a scanning program and the structural constraints that define the required structure are more clearly identified.
在DNA/RNA层面,生物信号由空间结构和序列基序的组合来定义。到目前为止,很少有人尝试编写兼顾序列和结构标准的通用搜索程序。实际上,最成功的结构扫描程序通常是针对特定结构的,并且是通过复杂且耗时的过程,使用通用编程语言编写的,在这个过程中,定义结构的生物学问题和寻找结构的计算机工程问题紧密交织在一起。在本文中,我们描述了一种适用于数据库扫描的结构通用表示法,以及一种为操作该表示法而设计的编程语言Palingol。Palingol具有特定的数据类型,对应于结构元素——基本上是螺旋——这些结构元素可以以任何方式排列以形成复杂结构。由于Palingol中使用的声明式方法,用户只需专注于“搜索什么”,而语言引擎会处理“如何搜索”。因此,编写扫描程序变得更简单,并且定义所需结构的结构约束也能更清晰地确定。