Authors
John J Grefenstette, David E Moriarty, Alan C Schultz
Publication date
2011/6
Journal
arXiv e-prints
Pages
arXiv: 1106.0221
Description
There are two distinct approaches to solving reinforcement learning problems, namely, searching in value function space and searching in policy space. Temporal difference methods and evolutionary algorithms are well-known examples of these approaches. Kaelbling, Littman and Moore recently provided an informative survey of temporal difference methods. This article focuses on the application of evolutionary algorithms to the reinforcement learning problem, emphasizing alternative policy representations, credit assignment methods, and problem-specific genetic operators. Strengths and weaknesses of the evolutionary approach to reinforcement learning are presented, along with a survey of representative applications.
Total citations
201220132014201520162017201820192020202115612112
Scholar articles
JJ Grefenstette, DE Moriarty, AC Schultz - arXiv e-prints, 2011