南京大学软件新技术国家重点实验室,江苏,南京,210093
纸质出版:2007
移动端阅览
高 阳, 胡景凯, 王本年, 等. 基于CMAC网络强化学习的电梯群控调度[J]. 电子学报, 2007,35(2):362-365.
GAO Yang, HU Jing-kai, WANG Ben-nian, et al. Elevator Group Control Using Reinforcement Learning with CMAC[J]. Acta Electronica Sinica, 2007, 35(2): 362-365.
电梯群控调度是一类开放、动态、复杂系统的多目标优化问题.目前应用于群控电梯调度的算法主要有分区算法、基于搜索的算法、基于规则的算法和其他一些自适应的学习算法.但已有方法在顾客平均等待时间等目标上并不能够达到较好的优化性能.本文采用强化学习技术应用到电梯群控调度系统中
使用CMAC神经网络函数估计模块逼近强化学习的值函数
通过Q-学习算法来优化值函数
从而获得优化的电梯群控调度策略.通过仿真实验表明在下行高峰模式下
本文所提出的基于CMAC网络强化学习的群控电梯调度算法
能够有效地减少平均等待时间
提高电梯运行效率.
Elevator group control is a multi-objective optimization problem in an open
complicated and dynamical system.Currently
many algorithms have been applied in elevator group control
such as zoning approaches
search-based approaches
rule-based approaches and other adaptive approaches.However these methods fail of achieving the optimal performance in the average wait time.In this paper
the reinforcement learning technology is applied in the elevator group control system.The CMAC neural network is used to approx the value function of reinforcement learning and Q-learning algorithm is used to optimize the value function
thereby the optimal control policy of the elevator group control is achieved.The simulation experiment shows that the elevator group control using reinforcement learning with CMAC can reduce the average wait time efficiently in the down peak traffic.
0
浏览量
3001
下载量
7
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621