Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

An empirical comparison of classification algorithms for imbalanced credit scoring datasets

Conference Paper
Publication Date:
2019
abstract:
The profitability of banks is highly dependent on credit scoring models, which support decision making to approve a loan to a customer. State-of-the-art credit scoring models are based on learning methods. These methods need to cope with the problem of imbalanced classes since credit scoring datasets usually contain mainly paid loans and few defaults (unpaid ones). Recently, new imbalanced learning techniques have been proposed in the literature, and they can improve the credit scoring results. Motivated by this scenario, we evaluate several classification approaches to credit scoring. Besides, we also assess some preprocessing methods to overcome skewed datasets. To achieve it, we use three public real-world credit scoring datasets. In our experiments, we progressively increase the class imbalance in each of these datasets by randomly undersampling the minority class of defaulters to identify how the predictive power is affected. The results indicate that random forest, extreme gradient boosting perform very well in all imbalance levels. We also find that a complete grid search step can increase the prediction power of classification approaches in high imbalanced datasets.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Benchmarking; Classification; Credit scoring; Immbalanced datasets
List of contributors:
Renso, Chiara; Nardini, FRANCO MARIA
Authors of the University:
NARDINI FRANCO MARIA
RENSO CHIARA
Handle:
https://iris.cnr.it/handle/20.500.14243/380869
  • Overview

Overview

URL

https://ieeexplore.ieee.org/document/8999279
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)