Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Competenze
  1. Pubblicazioni

Exposing racial dialect bias in abusive language detection: can explainability play a role?

Contributo in Atti di convegno
Data di Pubblicazione:
2023
Abstract:
Biases can arise and be introduced during each phase of a supervised learning pipeline, eventually leading to harm. Within the task of automatic abusive language detection, this matter becomes particularly severe since unintended bias towards sensitive topics such as gender, sexual orientation, or ethnicity can harm underrepresented groups. The role of the datasets used to train these models is crucial to address these challenges. In this contribution, we investigate whether explainability methods can expose racial dialect bias attested within a popular dataset for abusive language detection. Through preliminary experiments, we found that pure explainability techniques cannot effectively uncover biases within the dataset under analysis: the rooted stereotypes are often more implicit and complex to retrieve.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
ML; NLP; Explainability; Interpretability; ML Evaluation; Fairness in ML; Algorithmic bias; Bias discovery; Algorithmic auditing; Data awareness; Discrimination
Elenco autori:
Morini, Virginia
Link alla scheda completa:
https://iris.cnr.it/handle/20.500.14243/439220
Link al Full Text:
https://iris.cnr.it//retrieve/handle/20.500.14243/439220/120493/prod_479349-doc_196638.pdf
Titolo del libro:
Machine Learning and Principles and Practice of Knowledge Discovery in Databases
Pubblicato in:
COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE (PRINT)
Series
  • Dati Generali

Dati Generali

URL

https://link.springer.com/chapter/10.1007/978-3-031-23618-1_32
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)