Skip to Main Content (Press Enter)

Logo CNR
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills

UNI-FIND
Logo CNR

|

UNI-FIND

cnr.it
  • ×
  • Home
  • People
  • Outputs
  • Organizations
  • Expertise & Skills
  1. Outputs

LAF Barcoding: classifying DNA Barcode multi-locus sequences with feature vectors and supervised approaches

Chapter
Publication Date:
2015
abstract:
DNA barcodes - one or multiple very short gene sequences - have been proven effective to classify a specimen to species. To handle this task in the plant and fungus kingdoms, multi-locus DNA barcode data as well as sequence analysis techniques are demanded, posing new challenges. In this work, we describe LAF-BARCODING, a Logic Alignment Free technique that counts the number of fixed-length substrings (k-mers) of the input sequences, represents them in feature vectors, and classifies them through a rule-based approach in order to specifically assign multi-locus DNA barcode sequences to their corresponding species. We use LAF to classify several sets of DNA barcode sequences, belonging to the plant and fungus life kingdoms, obtaining compact and meaningful classification models (if-then rules) with high accuracy rates. Conversely to the widespread alignmentbased (e.g., character, tree, and similarity) methods, we highlight that LAF can be successfully applied to multi-locus DNA barcode sequences.
Iris type:
02.01 Contributo in volume (Capitolo o Saggio)
Keywords:
DNA Barcoding; alignment-free; classification; supervised machine learning.
List of contributors:
Weitschek, Emanuel; Fiscon, Giulia; Bertolazzi, Paola; Cestarelli, Valerio; Felici, Giovanni
Handle:
https://iris.cnr.it/handle/20.500.14243/290386
Book title:
12th International Meeting on Computational Intelligence Methods for Bioinformatics and Biostatistics
  • Overview

Overview

URL

http://bioinfo.na.iac.cnr.it/cibb2015/
  • Use of cookies

Powered by VIVO | Designed by Cineca | 26.5.0.0 | Sorgente dati: PREPROD (Ribaltamento disabilitato)