Variable ranking feature selection for the identification of nucleosome related sequences
Contributo in Atti di convegno
Data di Pubblicazione:
2018
Abstract:
Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue affects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative K- mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classification by a deep learning network. Results computed on three public datasets show the effectiveness of the adopted feature selection method.
Tipologia CRIS:
04.01 Contributo in Atti di convegno
Keywords:
Deep learning models; Feature selection; DNA sequences; Epigenomic; Nucleosomes
Elenco autori:
Rizzo, Riccardo; Urso, Alfonso; Fiannaca, Antonino; LA ROSA, Massimo
Link alla scheda completa:
Titolo del libro:
New Trends in Databases and Information Systems
Pubblicato in: