Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Efficient sampling in approximate dynamic programming algorithms

Articolo
Data di Pubblicazione:
2007
Abstract:
Dynamic Programming (DP) is known to be a standard optimization tool for solving Stochastic Optimal Control (SOC) problems, either over a finite or an infinite horizon of stages. Under very general assumptions, commonly employed numerical algorithms are based on approximations of the cost-to-go functions, by means of suitable parametric models built from a set of sampling points in the d-dimensional state space. Here the problem of sample complexity, i.e., how "fast" the number of points must grow with the input dimension in order to have an accurate estimate of the cost-to-go functions in typical DP approaches such as value iteration and policy iteration, is discussed. It is shown that a choice of the sampling based on low-discrepancy sequences, commonly used for efficient numerical integration, permits to achieve, under suitable hypotheses, an almost linear sample complexity, thus contributing to mitigate the curse of dimensionality of the approximate DP procedure.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Stochastic optimal control problem; Dynamic programming; Sample complexity; Deterministic learning; Low-discrepancy sequences
Elenco autori:
Cervellera, Cristiano; Muselli, Marco
Autori di Ateneo:
CERVELLERA CRISTIANO
MUSELLI MARCO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/144628
Pubblicato in:
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS
Journal
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)