an algorithm which implements mechanism of reinforcement learning under the framework of genetic algorithm is described.by using gene space division the algorithm maps the gene space of genetic algorithm into the strategy spcaces of multi-agent.The convergence theorems for the algorithm are presented
and the time and the space efficiency of the algorithm as well as the relation between them and the division granularity are discussed.The experimental results show that RLGA has well global convergence performance
and the further experiments provide the guide range of the size of gene space division in RLGA.