Publication Date:
2023
abstract:
It has been shown that many Authorship Identification systems are vulnerable to adversarial attacks, where an author actively tries to fool the classifier. We propose to tackle the adversarial Authorship Verification task by augmenting the training set with synthetic textual examples. In this ongoing study, we present preliminary results using two learning algorithms (SVM and Neural Network), and two generation strategies (based on language modeling and GAN training) for two generator models, on three datasets. We empirically show that data augmentation may help improve the performance of the classifier in an adversarial setup.
Iris type:
04.01 Contributo in Atti di convegno
Keywords:
Authorship verification; Data augmentation; Text classification
List of contributors:
Corbara, Silvia; MOREO FERNANDEZ, ALEJANDRO DAVID
Full Text:
Book title:
Proceedings of the 13th Italian Information Retrieval Workshop (IIR 2023)
Published in: