Investigating the Physics of Tokamak Global Stability with Interpretable Machine Learning Tools
Articolo
Data di Pubblicazione:
2020
Abstract:
The inadequacies of basic physics models for disruption prediction have induced the
community to increasingly rely on data mining tools. In the last decade, it has been shown how
machine learning predictors can achieve a much better performance than those obtained with manually
identified thresholds or empirical descriptions of the plasma stability limits. The main criticisms
of these techniques focus therefore on two different but interrelated issues: poor "physics fidelity"
and limited interpretability. Insuffcient "physics fidelity" refers to the fact that the mathematical
models of most data mining tools do not reflect the physics of the underlying phenomena. Moreover,
they implement a black box approach to learning, which results in very poor interpretability of
their outputs. To overcome or at least mitigate these limitations, a general methodology has been
devised and tested, with the objective of combining the predictive capability of machine learning
tools with the expression of the operational boundary in terms of traditional equations more suited to
understanding the underlying physics. The proposed approach relies on the application of machine
learning classifiers (such as Support Vector Machines or Classification Trees) and Symbolic Regression
via Genetic Programming directly to experimental databases. The results are very encouraging.
The obtained equations of the boundary between the safe and disruptive regions of the operational
space present almost the same performance as the machine learning classifiers, based on completely
independent learning techniques. Moreover, these models possess significantly better predictive
power than traditional representations, such as the Hugill or the beta limit. More importantly,
they are realistic and intuitive mathematical formulas, which are well suited to supporting theoretical
understanding and to benchmarking empirical models. They can also be deployed easily and effciently
in real-time feedback systems.
Tipologia CRIS:
01.01 Articolo in rivista
Keywords:
disruptions; prediction; support vector machines; classification and regression trees (CART); ensemble of classifiers; symbolic regression; data-driven theory; knowledge discovery
Elenco autori:
Murari, Andrea
Link alla scheda completa:
Pubblicato in: