Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Value iteration for continuous-time linear time-invariant systems

Articolo
Data di Pubblicazione:
2022
Abstract:
Two data-driven strategies for value iteration in linear quadratic optimal control problems over an infinite horizon are proposed. The two architectures share common features, since they both consist of a purely continuous-time control architecture and are based on the forward integration of the Differential Riccati Equation (DRE). They profoundly differ, instead, in the estimation mechanism of the vector field of the underlying DRE from collected data: the first relies on a characterization of properties of the advantage function associated to the problem, whereas the second is inspired by tools from adaptive control theory and ensures semi-global exponential convergence to the optimal solution. Advantages and drawbacks of the architectures are discussed, while the performance is validated via a benchmark numerical example.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Adaptive control; Adaptive control; Convergence; Costs; Linear systems; Optimal control; Optimal control; Reinforcement learning; Reinforcement learning; Riccati equations; Trajectory; Value iteration; learning
Elenco autori:
Possieri, Corrado
Autori di Ateneo:
POSSIERI CORRADO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/413740
Pubblicato in:
IEEE TRANSACTIONS ON AUTOMATIC CONTROL (PRINT)
Journal
  • Dati Generali

Dati Generali

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-85129614109&origin=inward
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)