Generating Macro-actions through Sequence Finding in Partially Observable Environments

D. Aliaga (Japan)

Keywords

RL Learning POMDP, MacroActions, Sequence Finding

Abstract

This is the first of a series of papers dealing with the prob lems of a Reinforcement Learner Robot in a Partially Ob servable Environment and how Macro-Actions are a good approach to deal with those problems. In this paper we are going to introduce the general problem, and then we focus on the first part of the solution: dealing with the au tomatic generation of sensation-action sequences that will be transformed then into Macro-Actions. The paper ex plains the method the robot uses to autonomously generate useful sequences that then are going to be transformed into appropriate Macro-Actions to help it overcome the partial observability problem.

Important Links:



Go Back