Automatically building datasets of labeled IP traffic traces: A self-training approach
Academic Article
Publication Date:
2012
abstract:
Many approaches have been proposed so far to tackle computer network security. Among them, several systems exploit Machine Learning and Pattern Recognition techniques, by regarding malicious behavior detection as a classification problem. Supervised and unsupervised algorithms have been used in this context, each one with its own benefits and shortcomings. When using supervised techniques, a representative training set is required, which reliably indicates what a human expert wants the system to learn and recognize, by means of suitably labeled samples. In real environments there is a significant difficulty in collecting a representative dataset of correctly labeled traffic traces. In adversarial environments such a task is made even harder by malicious attackers, trying to make their actions' evidences stealthy.
Iris type:
01.01 Articolo in rivista
Keywords:
Soft label; Network security; IDS
List of contributors:
Gargiulo, Francesco
Published in: