

浏览全部资源
扫码关注微信
1. 深圳职业技术学院教育技术与信息中心,广东,深圳,518055
2. 中南大学信息科学与工程学院,湖南,长沙,410083
3. 先进控制与智能自动化湖南省工程实验室,湖南,长沙,410083
4. 深圳职业技术学院教育技术与信息中心,广东,深圳,518055
5. 中南大学信息科学与工程学院,湖南,长沙,410083
6. 先进控制与智能自动化湖南省工程实验室,湖南,长沙,410083
Published:2014
移动端阅览
WU Bo, ZHENG Hong-yan, FENG Yan-peng, et al. Model-Based Factored Bayesian Online Reinforcement Learning[J]. Acta Electronica Sinica, 2014, 42(7): 1429-1434.
WU Bo, ZHENG Hong-yan, FENG Yan-peng, et al. Model-Based Factored Bayesian Online Reinforcement Learning[J]. Acta Electronica Sinica, 2014, 42(7): 1429-1434. DOI: 10.3969/j.issn.0372-2112.2014.07.029.
针对贝叶斯强化学习中参数个数巨大,收敛速度慢,无法实现在线学习的问题,提出一种基于模型的可分解贝叶斯强化学习方法.首先,将学习参数进行可分解表示,降低学习参数的个数;然后,根据先验知识和观察数据采用贝叶斯方法来学习,最优化探索和利用二者之间的平衡关系;最后,采用基于点的贝叶斯强化学习方法实现学习过程的快速收敛,从而达到在线学习的目的.仿真结果表明该算法能够满足实时系统性能的要求.
Due to the enormous number of parameters and slow convergence which are the major obstacles for online learning in model-based Bayesian reinforcement learning
the paper presents a model-based factored Bayesian reinforcement learning approach.Firstly
factored representations are made to represent the dynamics with fewer parameters.Then
according to prior knowledge and observable data
this paper exploits model-based reinforcement learning to provide an elegant solution to the optimal exploration-exploitation tradeoff.Finally
a pointed-based Bayesian reinforcement learning approach is proposed to speed up the convergence to achieve online learning.The experimental results show that the proposed approach can approximate the underlying Bayesian reinforcement learning task well with guaranteed real-time performance.
0
Views
2
下载量
2
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621