哈尔滨工业大学计算机科学与技术学院,黑龙江,哈尔滨,150001
纸质出版:2006
移动端阅览
周浦城, 洪炳镕, 黄庆成. 一种新颖的多agent强化学习方法[J]. 电子学报, 2006,34(8):1488-1491.
ZHOU Pu-cheng, HONG Bing-rong, HUANG Qing-cheng. A Novel Multi-Agent Reinforcement Learning Approach[J]. Acta Electronica Sinica, 2006, 34(8): 1488-1491.
提出了一种综合了模块化结构、利益分配学习以及对手建模技术的多agent强化学习方法
利用模块化学习结构来克服状态空间的维数灾问题
将Q-学习与利益分配学习相结合以加快学习速度
采用基于观察的对手建模来预测其他agent的动作分布.追捕问题的仿真结果验证了所提方法的有效性.
A novel multi-agent reinforcement learning approach is proposed to learn the coordinated behaviors among cooperative agents team.The proposed approach combines advantages of the modular architecture
profit-sharing learning and opponent modeling technique in a single multi-agent framework.Simulation results on the pursuit problem show that the proposed learning approach has faster convergence speed and more optimal policy over conventional modular Q-learning algorithms.
0
浏览量
2228
下载量
5
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621