Ribeiro Edward de O, Zerlotini Gustavo G, Lopes Irving R M, Ribeiro Victor B R, Melo Alba C M, Walter Maria Emilia M T, Costa Marcos Mota
Departamento de Ciência da Computação, Universidade de Brasília, Brasília, DF, Brazil.
Genet Mol Res. 2005 Sep 30;4(3):590-8.
Interpro is a widely used tool for protein annotation in genome sequencing projects, demanding a large amount of computation and representing a huge time-consuming step. We present a strategy to execute programs using databases Pfam, PROSITE and ProDom of Interpro in a distributed environment using a Java-based messaging system. We developed a two-layer scheduling architecture of the distributed infrastructure. Then, we made experiments and analyzed the results. Our distributed system gave much better results than Interpro Pfam, PROSITE and ProDom running in a centralized platform. This approach seems to be appropriate and promising for highly demanding computational tools used for biological applications.
Interpro是基因组测序项目中广泛用于蛋白质注释的工具,需要大量计算,是一个极为耗时的步骤。我们提出了一种策略,使用基于Java的消息系统在分布式环境中执行使用Interpro的Pfam、PROSITE和ProDom数据库的程序。我们开发了分布式基础设施的两层调度架构。然后,我们进行了实验并分析了结果。我们的分布式系统比在集中式平台上运行的Interpro Pfam、PROSITE和ProDom产生了更好的结果。这种方法对于用于生物应用的高要求计算工具似乎是合适且有前景的。