基于CMAC网络强化学习的电梯群控调度

高 阳; 胡景凯; 王本年; 王冬黎

您当前的位置：

首页 >

文章列表页 >

基于CMAC网络强化学习的电梯群控调度

论文 | 更新时间：2025-07-16

- 基于CMAC网络强化学习的电梯群控调度
- Elevator Group Control Using Reinforcement Learning with CMAC
- 电子学报 2007年35卷第2期页码：362-365
- 作者机构：
  
  南京大学软件新技术国家重点实验室,江苏,南京,210093
- 作者简介：
- 基金信息：
  
  国家自然科学基金 (No.60475026);国家杰出青年科学基金 (No.60325207);国家973重点基础研究发展规划 (No.2002CB312002)
- DOI：
  中图分类号： TP18
- 纸质出版：2007
- 稿件说明：
移动端阅览
高阳, 胡景凯, 王本年, 等. 基于CMAC网络强化学习的电梯群控调度[J]. 电子学报, 2007,35(2):362-365.

GAO Yang, HU Jing-kai, WANG Ben-nian, et al. Elevator Group Control Using Reinforcement Learning with CMAC[J]. Acta Electronica Sinica, 2007, 35(2): 362-365.
高阳, 胡景凯, 王本年, 等. 基于CMAC网络强化学习的电梯群控调度[J]. 电子学报, 2007,35(2):362-365. DOI：

GAO Yang, HU Jing-kai, WANG Ben-nian, et al. Elevator Group Control Using Reinforcement Learning with CMAC[J]. Acta Electronica Sinica, 2007, 35(2): 362-365. DOI：

摘要

电梯群控调度是一类开放、动态、复杂系统的多目标优化问题.目前应用于群控电梯调度的算法主要有分区算法、基于搜索的算法、基于规则的算法和其他一些自适应的学习算法.但已有方法在顾客平均等待时间等目标上并不能够达到较好的优化性能.本文采用强化学习技术应用到电梯群控调度系统中

使用CMAC神经网络函数估计模块逼近强化学习的值函数

通过Q-学习算法来优化值函数

从而获得优化的电梯群控调度策略.通过仿真实验表明在下行高峰模式下

本文所提出的基于CMAC网络强化学习的群控电梯调度算法

能够有效地减少平均等待时间

提高电梯运行效率.

Abstract

Elevator group control is a multi-objective optimization problem in an open

complicated and dynamical system.Currently

many algorithms have been applied in elevator group control

such as zoning approaches

search-based approaches

rule-based approaches and other adaptive approaches.However these methods fail of achieving the optimal performance in the average wait time.In this paper

the reinforcement learning technology is applied in the elevator group control system.The CMAC neural network is used to approx the value function of reinforcement learning and Q-learning algorithm is used to optimize the value function

thereby the optimal control policy of the elevator group control is achieved.The simulation experiment shows that the elevator group control using reinforcement learning with CMAC can reduce the average wait time efficiently in the down peak traffic.

关键词

Keywords

references

浏览量

3001

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

基于图组合优化的高效社区搜索

知识数据协同的多对手智能空中博弈策略设计

基于强化学习的免调参即插即用单光子图像重建方法

基于强化学习的离散事件系统最优定向监控

基于强化学习的自免疫动态攻击生成方法