Taha Kamal, Elmasri Ramez
Khalifa University of Science, Technology & Research, Abu Dhabi, UAE.
J Integr Bioinform. 2011 Aug 31;8(2):165. doi: 10.2390/biecoll-jib-2011-165.
Bioinformatics applications manage complex biological data stored into distributed and often heterogeneous databases and require large computing power. These databases are too big and complicated to be rapidly queried every time a user submits a query, due to the overhead involved in decomposing the queries, sending the decomposed queries to remote databases, and composing the results. There is also considerable communication costs involved. This study addresses the mentioned problems in Grid-based environment for bioinformatics. We propose a Grid middleware called GMB that alleviates these problems by caching the results of Frequently Used Queries (FUQ). Queries are classified based on their types and frequencies. FUQ are answered from the middleware, which improves their response time. GMB acts as a gateway to TeraGrid Grid: it resides between users’ applications and TeraGrid Grid. We evaluate GMB experimentally.
生物信息学应用程序管理存储在分布式且通常异构的数据库中的复杂生物数据,并且需要强大的计算能力。这些数据库过于庞大和复杂,以至于每次用户提交查询时都无法快速进行查询,这是因为在分解查询、将分解后的查询发送到远程数据库以及组合结果时会产生开销。此外还涉及相当大的通信成本。本研究解决了基于网格的生物信息学环境中的上述问题。我们提出了一种名为GMB的网格中间件,它通过缓存常用查询(FUQ)的结果来缓解这些问题。查询根据其类型和频率进行分类。FUQ由中间件进行回答,这提高了它们的响应时间。GMB充当通向TeraGrid网格的网关:它位于用户应用程序和TeraGrid网格之间。我们通过实验对GMB进行了评估。