Nonparametric Problem-Space Clustering: Learning Efficient Codes for Cognitive Control Tasks
Academic Article
Publication Date:
2016
abstract:
We present an information-theoretic method permitting one to find structure in a problem space (here, in a spatial navigation domain) and cluster it in ways that are convenient to solve different classes of control problems, which include planning a path to a goal from a known or an unknown location, achieving multiple goals and exploring a novel environment. Our generative nonparametric approach, called the generative embedded Chinese restaurant process (geCRP), extends the family of Chinese restaurant process (CRP) models by introducing a parameterizable notion of distance (or kernel) between the states to be clustered together. By using different kernels, such as the the conditional probability or joint probability of two states, the same geCRP method clusters the environment in ways that are more sensitive to different control-related information, such as goal, sub-goal and path information. We perform a series of simulations in three scenariosan open space, a grid world with four rooms and a maze having the same structure as the Hanoi Towerin order to illustrate the characteristics of the different clusters (obtained using different kernels) and their relative benefits for solving planning and control problems.
Iris type:
01.01 Articolo in rivista
Keywords:
clustering; information theory; generative model; structure learning; goal; sub-goal; planning
List of contributors:
Pezzulo, Giovanni; Maisto, Domenico; Donnarumma, Francesco
Published in: