Efficient model-free Q-faetor approximation in value space via log-sum-exp neural networks

Conference Paper

Publication Date:

2020

abstract:

We propose an efficient technique for performing data-driven optimal control of discrete-time systems. In particular, we show that log-sum-exp (LSE) neural networks, which are smooth and convex universal approximators of convex functions, can be efficiently used to approximate Q-factors arising from finite-horizon optimal control problems with continuous state space. The key advantage of these networks over classical approximation techniques is that they are convex and hence readily amenable to efficient optimization.

Iris type:

04.01 Contributo in Atti di convegno

Keywords:

q learning;; adaptive control; optimal control

List of contributors:

Possieri, Corrado

Authors of the University:

POSSIERI CORRADO

Handle:

https://iris.cnr.it/handle/20.500.14243/409018

Overview

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-85090160376&origin=inward

Efficient model-free Q-faetor approximation in value space via log-sum-exp neural networks

Possieri, Corrado

Overview

URL