Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Taming Lagrangian chaos with multi-objective reinforcement learning

Articolo
Data di Pubblicazione:
2023
Abstract:
Abstract: We consider the problem of two active particles in 2D complex flows with the multi-objective goals of minimizing both the dispersion rate and the control activation cost of the pair. We approach the problem by means of multi-objective reinforcement learning (MORL), combining scalarization techniques together with a Q-learning algorithm, for Lagrangian drifters that have variable swimming velocity. We show that MORL is able to find a set of trade-off solutions forming an optimal Pareto frontier. As a benchmark, we show that a set of heuristic strategies are dominated by the MORL solutions. We consider the situation in which the agents cannot update their control variables continuously, but only after a discrete (decision) time, ?. We show that there is a range of decision times, in between the Lyapunov time and the continuous updating limit, where reinforcement learning finds strategies that significantly improve over heuristics. In particular, we discuss how large decision times require enhanced knowledge of the flow, whereas for smaller ? all a priori heuristic strategies become Pareto optimal. Graphic abstract: [Figure not available: see fulltext.]
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
Chaos; reinforcement learning; Lagrangian chaos
Elenco autori:
Cencini, Massimo
Autori di Ateneo:
CENCINI MASSIMO
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/432262
Pubblicato in:
THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER
Journal
  • Dati Generali

Dati Generali

URL

https://link.springer.com/article/10.1140/epje/s10189-023-00271-0
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)