Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

F-Discrepancy for Efficient Sampling in Approximate Dynamic Programming

Academic Article
Publication Date:
2016
abstract:
In this paper, we address the problem of generating efficient state sample points for the solution of continuous-state finite-horizon Markovian decision problems through approximate dynamic programming. It is known that the selection of sam- pling points at which the value function is observed is a key factor when such function is approximated by a model based on a finite number of evaluations. A standard approach consists in generating these points through a random or deterministic procedure, aiming at a balanced covering of the state space. Yet, this solution may not be efficient if the state trajectories are not uniformly distributed. Here, we propose to exploit F-discrepancy, a quantity that measures how closely a set of random points rep- resents a probability distribution, and introduce an example of an algorithm based on such concept to automatically select point sets that are efficient with respect to the underlying Markovian process. An error analysis of the approximate solution is pro- vided, showing how the proposed algorithm enables convergence under suitable regularity hypotheses. Then, simulation results are provided concerning an inventory forecasting test problem. The tests confirm in general the important role of F-discrepancy, and show how the proposed algorithm is able to yield bet- ter results than uniform sampling, using sets even 50 times smaller.
Iris type:
01.01 Articolo in rivista
Keywords:
Approximate dynamic pro; F-discrepancy; Markovian decision problem; state sam- pling; value function approximation
List of contributors:
Cervellera, Cristiano; Maccio', Danilo
Authors of the University:
CERVELLERA CRISTIANO
MACCIO' DANILO
Handle:
https://iris.cnr.it/handle/20.500.14243/316812
Published in:
IEEE TRANSACTIONS ON CYBERNETICS
Journal
  • Overview

Overview

URL

https://ieeexplore.ieee.org/document/7172513
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)