Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

Q-Learning: computation of optimal Q-values for evaluating the learning level in robotic tasks

Academic Article
Publication Date:
2001
abstract:
A problem related to the use of Reinforcement Learning algorithms on real robot applications is the difficulty of measuring the learning level reached after some experience. Among the different RL algorithms, the Q-learning is the most widely used in accomplishing robotic tasks. The aim of this work is to a-priori evaluate the optimal Q-values for problems where it is possible to compute the distance between the current state and the goal state of the system. Starting from the Q-learning updating formula the equations for the maximum Q-weights, for optimal and non-optimal actions, have been computed considering delayed and immediate rewards. Deterministic and non deterministic grid-world environments have been also considered to test in simulations the obtained equations. Besides the convergence rates of the Q-learning algorithm have been compared using different learning rate parameters.
Iris type:
01.01 Articolo in rivista
Keywords:
Q-learning; convergence rate; learning parameters; optimal Q-values
List of contributors:
D'Orazio, TIZIANA RITA; Cicirelli, Grazia
Authors of the University:
CICIRELLI GRAZIA
D'ORAZIO TIZIANA RITA
Handle:
https://iris.cnr.it/handle/20.500.14243/23640
Published in:
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE
Journal
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)