一种新颖的多agent强化学习方法

周浦城; 洪炳镕; 黄庆成

您当前的位置：

首页 >

文章列表页 >

一种新颖的多agent强化学习方法

论文 | 更新时间：2025-07-16

- 一种新颖的多agent强化学习方法
- A Novel Multi-Agent Reinforcement Learning Approach
- 电子学报 2006年34卷第8期页码：1488-1491
- 作者机构：
  
  哈尔滨工业大学计算机科学与技术学院,黑龙江,哈尔滨,150001
- 作者简介：
- 基金信息：
  
  国家自然科学基金 (No.69985002)
- DOI：
  中图分类号： TP18
- 纸质出版：2006
- 稿件说明：
移动端阅览
周浦城, 洪炳镕, 黄庆成. 一种新颖的多agent强化学习方法[J]. 电子学报, 2006,34(8):1488-1491.

ZHOU Pu-cheng, HONG Bing-rong, HUANG Qing-cheng. A Novel Multi-Agent Reinforcement Learning Approach[J]. Acta Electronica Sinica, 2006, 34(8): 1488-1491.
周浦城, 洪炳镕, 黄庆成. 一种新颖的多agent强化学习方法[J]. 电子学报, 2006,34(8):1488-1491. DOI：

ZHOU Pu-cheng, HONG Bing-rong, HUANG Qing-cheng. A Novel Multi-Agent Reinforcement Learning Approach[J]. Acta Electronica Sinica, 2006, 34(8): 1488-1491. DOI：

摘要

提出了一种综合了模块化结构、利益分配学习以及对手建模技术的多agent强化学习方法

利用模块化学习结构来克服状态空间的维数灾问题

将Q-学习与利益分配学习相结合以加快学习速度

采用基于观察的对手建模来预测其他agent的动作分布.追捕问题的仿真结果验证了所提方法的有效性.

Abstract

A novel multi-agent reinforcement learning approach is proposed to learn the coordinated behaviors among cooperative agents team.The proposed approach combines advantages of the modular architecture

profit-sharing learning and opponent modeling technique in a single multi-agent framework.Simulation results on the pursuit problem show that the proposed learning approach has faster convergence speed and more optimal policy over conventional modular Q-learning algorithms.

关键词

Keywords

references

浏览量

2228

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

暂无数据