Generating Synthetic Discrete Datasets with Machine Learning

Abstract

Data di Pubblicazione:

2022

Abstract:

The real data are not always available/accessible/sufficient or in many cases they are incomplete and lacking in semantic content necessary to the definition of optimization processes. In this paper we discuss about the synthetic data generation under two different perspectives. The core common idea is to analyze a limited set of real data to learn the main patterns that characterize them and exploit this knowledge to generate brand new data. The first perspective is constraint-based generation and consists in generating a synthetic dataset satisfying given support constraints on the real frequent patterns. The second one is based on probabilistic generative modeling and considers the synthetic generation as a sampling process from a parametric distribution learned on the real data, typically encoded as a neural network (e.g. Variational Autoencoders, Generative Adversarial Networks).

Tipologia CRIS:

04.02 Abstract in Atti di convegno

Keywords:

Synthetic dataset; Data generation; Inverse Frequent Itemset Mining; Constraints-based models; Variational Autoencoder; Generative Adversarial Networks; Generative models

Elenco autori:

Manco, Giuseppe; Ritacco, Ettore

Autori di Ateneo:

MANCO GIUSEPPE

RITACCO ETTORE

Link alla scheda completa:

https://iris.cnr.it/handle/20.500.14243/414898

Pubblicato in:

CEUR WORKSHOP PROCEEDINGS

Series

Dati Generali

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-85137471641&origin=inward

Generating Synthetic Discrete Datasets with Machine Learning

Manco, Giuseppe; Ritacco, Ettore

CEUR WORKSHOP PROCEEDINGS

Dati Generali

URL