Suppr超能文献

两人博弈中的改进学习

Melioration Learning in Two-Person Games.

作者信息

Zschache Johannes

机构信息

Institute of Sociology, Leipzig University, Leipzig, Germany.

出版信息

PLoS One. 2016 Nov 16;11(11):e0166708. doi: 10.1371/journal.pone.0166708. eCollection 2016.

Abstract

Melioration learning is an empirically well-grounded model of reinforcement learning. By means of computer simulations, this paper derives predictions for several repeatedly played two-person games from this model. The results indicate a likely convergence to a pure Nash equilibrium of the game. If no pure equilibrium exists, the relative frequencies of choice may approach the predictions of the mixed Nash equilibrium. Yet in some games, no stable state is reached.

摘要

改进学习是一种有充分实证依据的强化学习模型。通过计算机模拟,本文从该模型推导出了几个重复进行的两人博弈的预测结果。结果表明,博弈很可能会收敛到纯纳什均衡。如果不存在纯均衡,选择的相对频率可能会接近混合纳什均衡的预测结果。然而,在某些博弈中,无法达到稳定状态。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cbea/5112854/f31693a95d3f/pone.0166708.g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验