Brown M P
HNC, San Diego, CA 92121-3278, USA.
Proc Int Conf Intell Syst Mol Biol. 2000;8:57-66.
We introduce a model based on stochastic context-free grammars (SCFGs) that can construct small subunit ribosomal RNA (SSU rRNA) multiple alignments. The method takes into account both primary sequence and secondary structure basepairing interactions. We show that this method produces multiple alignments of quality close to hand edited ones and outperforms several other methods. We also introduce a method of SCFG constraints that dramatically reduces the required computer resources needed to effectively use SCFGs on large problems such as SSU rRNA. Without such constraints, the required computer resources are infeasible for most computers. This work has applications to fields such as phylogenetic tree construction.
我们介绍了一种基于随机上下文无关文法(SCFG)的模型,该模型可以构建小亚基核糖体RNA(SSU rRNA)的多序列比对。该方法同时考虑了一级序列和二级结构碱基配对相互作用。我们表明,该方法产生的多序列比对质量与人工编辑的相近,且优于其他几种方法。我们还引入了一种SCFG约束方法,该方法大大减少了在诸如SSU rRNA等大问题上有效使用SCFG所需的计算机资源。没有这种约束,对于大多数计算机来说所需的计算机资源是不可行的。这项工作在系统发育树构建等领域有应用。