Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

F-Discrepancy for Efficient Sampling in Approximate Dynamic Programming

Articolo
Data di Pubblicazione:
2016
Abstract:
In this paper, we address the problem of generating efficient state sample points for the solution of continuous-state finite-horizon Markovian decision problems through approximate dynamic programming. It is known that the selection of sam- pling points at which the value function is observed is a key factor when such function is approximated by a model based on a finite number of evaluations. A standard approach consists in generating these points through a random or deterministic procedure, aiming at a balanced covering of the state space. Yet, this solution may not be efficient if the state trajectories are not uniformly distributed. Here, we propose to exploit F-discrepancy, a quantity that measures how closely a set of random points rep- resents a probability distribution, and introduce an example of an algorithm based on such concept to automatically select point sets that are efficient with respect to the underlying Markovian process. An error analysis of the approximate solution is pro- vided, showing how the proposed algorithm enables convergence under suitable regularity hypotheses. Then, simulation results are provided concerning an inventory forecasting test problem. The tests confirm in general the important role of F-discrepancy, and show how the proposed algorithm is able to yield bet- ter results than uniform sampling, using sets even 50 times smaller.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Approximate dynamic pro; F-discrepancy; Markovian decision problem; state sam- pling; value function approximation
Elenco autori:
Cervellera, Cristiano; Maccio', Danilo
Autori di Ateneo:
CERVELLERA CRISTIANO
MACCIO' DANILO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/316812
Pubblicato in:
IEEE TRANSACTIONS ON CYBERNETICS
Journal
  • Dati Generali

Dati Generali

URL

https://ieeexplore.ieee.org/document/7172513
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)