Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data
Altro Prodotto di Ricerca
Data di Pubblicazione:
2022
Abstract:
Machine learning (ML) is increasingly used in cognitive, computational
and clinical neuroscience. The reliable and efficient application of ML re-
quires a sound understanding of its subtleties and limitations. Training ML
models on datasets with imbalanced classes is a particularly common prob-
lem, and it can have severe consequences if not adequately addressed. With
the neuroscience ML user in mind, this paper provides a didactic assessment
of the class imbalance problem and illustrates its impact through systematic
manipulation of data imbalance ratios in (i) simulated data and (ii) brain
data recorded with electroencephalography (EEG) and magnetoencephalog-
raphy (MEG). Our results illustrate how the widely-used Accuracy (Acc)
metric, which measures the overall proportion of successful predictions, yields
misleadingly high performances, as class imbalance increases. Because Acc
weights the per-class ratios of correct predictions proportionally to class size,
it largely disregards the performance on the minority class. A binary classi-
fication model that learns to systematically vote for the majority class will
yield an artificially high decoding accuracy that directly reflects the imbal-
ance between the two classes, rather than any genuine generalizable ability
to discriminate between them. We show that other evaluation metrics such
as the Area Under the Curve (AUC) of the Receiver Operating Charac-
teristic (ROC), and the less common Balanced Accuracy (BAcc) metric -
defined as the arithmetic mean between sensitivity and specificity, provide
more reliable performance evaluations for imbalanced data. Our findings also
highlight the robustness of Random Forest (RF), and the benefits of using
stratified cross-validation and hyperprameter optimization to tackle data im-
balance. Critically, for neuroscience ML applications that seek to minimize
overall classification error, we recommend the routine use of BAcc, which in
the specific case of balanced data is equivalent to using standard Acc, and
readily extends to multi-class settings. Importantly, we present a list of rec-
ommendations for dealing with imbalanced data, as well as open-source code
to allow the neuroscience community to replicate and extend our observations
and explore alternative approaches to coping with imbalanced data.
Tipologia CRIS:
05.12 Altro
Keywords:
Class imbalance; Machine learning; Classification; Performance metrics; Electroencephalography; Magnetoencephalography; Brain decoding; Balanced accuracy
Elenco autori:
Pascarella, Annalisa
Link alla scheda completa: