Computational Science and Engineering Laboratory, ETH Zürich, CH-8092 Zürich, Switzerland.
Computational Science and Engineering Laboratory, ETH Zürich, CH-8092 Zürich, Switzerland
Proc Natl Acad Sci U S A. 2018 Jun 5;115(23):5849-5854. doi: 10.1073/pnas.1800923115. Epub 2018 May 21.
Fish in schooling formations navigate complex flow fields replete with mechanical energy in the vortex wakes of their companions. Their schooling behavior has been associated with evolutionary advantages including energy savings, yet the underlying physical mechanisms remain unknown. We show that fish can improve their sustained propulsive efficiency by placing themselves in appropriate locations in the wake of other swimmers and intercepting judiciously their shed vortices. This swimming strategy leads to collective energy savings and is revealed through a combination of high-fidelity flow simulations with a deep reinforcement learning (RL) algorithm. The RL algorithm relies on a policy defined by deep, recurrent neural nets, with long-short-term memory cells, that are essential for capturing the unsteadiness of the two-way interactions between the fish and the vortical flow field. Surprisingly, we find that swimming in-line with a leader is not associated with energetic benefits for the follower. Instead, "smart swimmer(s)" place themselves at off-center positions, with respect to the axis of the leader(s) and deform their body to synchronize with the momentum of the oncoming vortices, thus enhancing their swimming efficiency at no cost to the leader(s). The results confirm that fish may harvest energy deposited in vortices and support the conjecture that swimming in formation is energetically advantageous. Moreover, this study demonstrates that deep RL can produce navigation algorithms for complex unsteady and vortical flow fields, with promising implications for energy savings in autonomous robotic swarms.
鱼在成群游动时会在同伴的尾涡中穿行于充满机械能的复杂流场。它们的成群游动行为与进化优势相关,包括节省能量,但潜在的物理机制仍不清楚。我们表明,鱼可以通过将自己置于其他游泳者尾流中的适当位置,并明智地拦截它们释放的漩涡,从而提高持续推进效率。这种游动策略导致集体节能,并通过与深度学习强化学习 (RL) 算法相结合的高保真度流模拟来揭示。RL 算法依赖于由深度递归神经网络定义的策略,带有长短时记忆单元,对于捕捉鱼和涡旋流场之间双向相互作用的不稳定性至关重要。令人惊讶的是,我们发现,与领导者成一直线游泳并不与追随者的能量收益相关。相反,“聪明的游泳者”将自己置于相对于领导者轴的非中心位置,并改变身体形状以与迎面而来的漩涡的动量同步,从而在不牺牲领导者的情况下提高游泳效率。结果证实,鱼可以收获漩涡中储存的能量,并支持这样的假设,即编队游泳在能量上是有利的。此外,这项研究表明,深度学习 RL 可以为复杂的不稳定和涡旋流场生成导航算法,这对自主机器人群的节能具有广阔的应用前景。