Jones Craig E, Baumann Ute, Brown Alfred L
Australian Centre for Plant Functional Genomics, University of Adelaide, South Australia, 5064, Australia.
BMC Bioinformatics. 2005 Nov 15;6:272. doi: 10.1186/1471-2105-6-272.
With the exponential increase in genomic sequence data there is a need to develop automated approaches to deducing the biological functions of novel sequences with high accuracy. Our aim is to demonstrate how accuracy benchmarking can be used in a decision-making process evaluating competing designs of biological function predictors. We utilise the Gene Ontology, GO, a directed acyclic graph of functional terms, to annotate sequences with functional information describing their biological context. Initially we examine the effect on accuracy scores of increasing the allowed distance between predicted and a test set of curator assigned terms. Next we evaluate several annotator methods using accuracy benchmarking. Given an unannotated sequence we use the Basic Local Alignment Search Tool, BLAST, to find similar sequences that have already been assigned GO terms by curators. A number of methods were developed that utilise terms associated with the best five matching sequences. These methods were compared against a benchmark method of simply using terms associated with the best BLAST-matched sequence (best BLAST approach).
The precision and recall of estimates increases rapidly as the amount of distance permitted between a predicted term and a correct term assignment increases. Accuracy benchmarking allows a comparison of annotation methods. A covering graph approach performs poorly, except where the term assignment rate is high. A term distance concordance approach has a similar accuracy to the best BLAST approach, demonstrating lower precision but higher recall. However, a discriminant function method has higher precision and recall than the best BLAST approach and other methods shown here.
Allowing term predictions to be counted correct if closely related to a correct term decreases the reliability of the accuracy score. As such we recommend using accuracy measures that require exact matching of predicted terms with curator assigned terms. Furthermore, we conclude that competing designs of BLAST-based GO term annotators can be effectively compared using an accuracy benchmarking approach. The most accurate annotation method was developed using data mining techniques. As such we recommend that designers of term annotators utilise accuracy benchmarking and data mining to ensure newly developed annotators are of high quality.
随着基因组序列数据呈指数级增长,有必要开发能够高精度自动推导新序列生物学功能的方法。我们的目的是展示如何在评估生物功能预测器的竞争设计的决策过程中使用准确性基准测试。我们利用基因本体(GO),即一个功能术语的有向无环图,用描述其生物学背景的功能信息来注释序列。首先,我们研究增加预测术语与策展人指定术语的测试集之间允许的距离对准确性得分的影响。接下来,我们使用准确性基准测试评估几种注释方法。给定一个未注释的序列,我们使用基本局部比对搜索工具(BLAST)来找到已经被策展人分配了GO术语的相似序列。开发了许多方法,这些方法利用与最佳五个匹配序列相关的术语。将这些方法与简单使用与最佳BLAST匹配序列相关的术语的基准方法(最佳BLAST方法)进行比较。
随着预测术语与正确术语分配之间允许的距离增加,估计的精确率和召回率迅速提高。准确性基准测试允许对注释方法进行比较。覆盖图方法表现不佳,除非术语分配率很高。术语距离一致性方法的准确性与最佳BLAST方法相似,显示出较低的精确率但较高的召回率。然而,判别函数方法的精确率和召回率高于最佳BLAST方法和此处显示的其他方法。
如果预测术语与正确术语密切相关就将其计为正确,会降低准确性得分的可靠性。因此,我们建议使用要求预测术语与策展人分配的术语精确匹配的准确性度量。此外,我们得出结论,使用准确性基准测试方法可以有效地比较基于BLAST的GO术语注释器的竞争设计。最准确的注释方法是使用数据挖掘技术开发的。因此,我们建议术语注释器的设计者利用准确性基准测试和数据挖掘来确保新开发的注释器具有高质量。