Boguski M S, Freeman M, Elshourbagy N A, Taylor J M, Gordon J I
J Lipid Res. 1986 Oct;27(10):1011-34.
During the past several years, the use of computer programs in the analysis of protein and DNA sequences has become commonplace. In all but the simplest procedures, the ability to critically review the results obtained with computer methods requires a basic knowledge of the algorithms employed (and the assumptions upon which they are based), an awareness of the capabilities and limitations of the particular program that implements an algorithm, and some familiarity with probability and statistics. We describe a number of computer methods that have been applied to the analysis of apolipoprotein sequences. We discuss the suitability of these methods for particular problems, how the choice of initial "parameters" can affect the results, and what the results can tell us about protein or gene sequences. We also identify some outstanding problems of apolipoprotein sequence analysis where further work is needed.
在过去几年中,计算机程序在蛋白质和DNA序列分析中的应用已变得十分普遍。除了最简单的程序外,要严谨地审视通过计算机方法获得的结果,需要具备所采用算法的基本知识(以及算法所基于的假设),了解实现算法的特定程序的功能和局限性,并且对概率和统计学有一定的熟悉程度。我们描述了一些已应用于载脂蛋白序列分析的计算机方法。我们讨论这些方法对特定问题的适用性、初始“参数”的选择如何影响结果,以及这些结果能让我们了解蛋白质或基因序列的哪些信息。我们还指出了载脂蛋白序列分析中一些尚待解决的突出问题,这些问题需要进一步开展研究。