Abdul-Wahid Badi', Yu Li, Rajan Dinesh, Feng Haoyun, Darve Eric, Thain Douglas, Izaguirre Jesús A
University of Notre Dame, Notre Dame, IN 46656 ; Department of Computer Science & Engineering ; Interdisciplinary Center for Network Science and Applications.
University of Notre Dame, Notre Dame, IN 46656 ; Department of Computer Science & Engineering.
Proc IEEE Int Conf Escience. 2012 Oct;2012:1-8. doi: 10.1109/eScience.2012.6404429.
Molecular modeling is a field that traditionally has large computational costs. Until recently, most simulation techniques relied on long trajectories, which inherently have poor scalability. A new class of methods is proposed that requires only a large number of short calculations, and for which minimal communication between computer nodes is required. We considered one of the more accurate variants called Accelerated Weighted Ensemble Dynamics (AWE) and for which distributed computing can be made efficient. We implemented AWE using the Work Queue framework for task management and applied it to an all atom protein model (Fip35 WW domain). We can run with excellent scalability by simultaneously utilizing heterogeneous resources from multiple computing platforms such as clouds (Amazon EC2, Microsoft Azure), dedicated clusters, grids, on multiple architectures (CPU/GPU, 32/64bit), and in a dynamic environment in which processes are regularly added or removed from the pool. This has allowed us to achieve an aggregate sampling rate of over 500 ns/hour. As a comparison, a single process typically achieves 0.1 ns/hour.
分子建模是一个传统上计算成本很高的领域。直到最近,大多数模拟技术都依赖于长轨迹,而长轨迹本质上扩展性较差。人们提出了一类新方法,这类方法只需要大量的短计算,并且计算机节点之间的通信需求最小。我们考虑了一种更精确的变体,称为加速加权系综动力学(AWE),并且分布式计算可以高效进行。我们使用工作队列框架来管理任务实现了AWE,并将其应用于一个全原子蛋白质模型(Fip35 WW结构域)。我们可以通过同时利用来自多个计算平台(如云(亚马逊EC2、微软Azure)、专用集群、网格)的异构资源,在多种架构(CPU/GPU、32/64位)上,以及在一个动态环境中(在该环境中进程会定期从池中添加或移除),以出色的扩展性运行。这使我们能够实现超过500纳秒/小时的总采样率。作为对比,单个进程通常实现0.1纳秒/小时。