一种最大集合期望损失的多目标Sarsa(λ)算法
刘全, 李瑾, 傅启明, 崔志明, 伏玉琛
A Multiple-Goal Sarsa(λ) Algorithm Based on Lost Reward of Greatest Mass
LIU Quan, LI Jin, FU Qi-ming, CUI Zhi-ming, FU Yu-chen
电子学报 . 2013, (8): 1469 -1473 .  DOI: 10.3969/j.issn.0372-2112.2013.08.003