Zhang Chengwei, Li Xiaohong, Li Shuxin, Feng Zhiyong
School of Computer Science and Technology, Tianjin University, Peiyang Park Campus: No.135 Yaguan Road, Haihe Education Park, Tianjin, 300350, China.
School of Computer Computer Software, Tianjin University, Peiyang Park Campus: No.135 Yaguan Road, Haihe Education Park, Tianjin, 300350, China.
J Biomed Semantics. 2017 Sep 20;8(Suppl 1):31. doi: 10.1186/s13326-017-0142-0.
Biological environment is uncertain and its dynamic is similar to the multiagent environment, thus the research results of the multiagent system area can provide valuable insights to the understanding of biology and are of great significance for the study of biology. Learning in a multiagent environment is highly dynamic since the environment is not stationary anymore and each agent's behavior changes adaptively in response to other coexisting learners, and vice versa. The dynamics becomes more unpredictable when we move from fixed-agent interaction environments to multiagent social learning framework. Analytical understanding of the underlying dynamics is important and challenging.
In this work, we present a social learning framework with homogeneous learners (e.g., Policy Hill Climbing (PHC) learners), and model the behavior of players in the social learning framework as a hybrid dynamical system. By analyzing the dynamical system, we obtain some conditions about convergence or non-convergence. We experimentally verify the predictive power of our model using a number of representative games. Experimental results confirm the theoretical analysis.
Under multiagent social learning framework, we modeled the behavior of agent in biologic environment, and theoretically analyzed the dynamics of the model. We present some sufficient conditions about convergence or non-convergence and prove them theoretically. It can be used to predict the convergence of the system.
生物环境具有不确定性,其动态特性类似于多智能体环境,因此多智能体系统领域的研究成果能够为理解生物学提供有价值的见解,对生物学研究具有重要意义。在多智能体环境中学习具有高度动态性,因为环境不再是静止的,每个智能体的行为会根据其他共存学习者进行自适应变化,反之亦然。当我们从固定智能体交互环境转向多智能体社会学习框架时,动态性变得更加不可预测。对潜在动态性进行分析理解既重要又具有挑战性。
在这项工作中,我们提出了一个具有同质学习者(例如,策略爬山(PHC)学习者)的社会学习框架,并将社会学习框架中参与者的行为建模为一个混合动态系统。通过分析该动态系统,我们获得了一些关于收敛或不收敛的条件。我们使用一些具有代表性的博弈实验验证了我们模型的预测能力。实验结果证实了理论分析。
在多智能体社会学习框架下,我们对生物环境中智能体的行为进行了建模,并对模型的动态性进行了理论分析。我们给出了一些关于收敛或不收敛的充分条件,并从理论上进行了证明。它可用于预测系统的收敛性。