Compressed spaced suffix arrays

Conference Paper

Publication Date:

2014

abstract:

Spaced seeds are important tools for similarity search in bioinformatics, and using several seeds together often significantly improves their performance. With existing approaches, however, for each seed we keep a separate linear-size data structure, either a hash table or a spaced suffix array (SSA). In this paper we show how to compress SSAs relative to normal suffix arrays (SAs) and still support fast random access to them. We first prove a theoretical upper bound on the space needed to store an SSA when we already have the SA. We then present experiments indicating that our approach works even better in practice.

Iris type:

04.01 Contributo in Atti di convegno

Keywords:

suffix array; spaced seeds

List of contributors:

Manzini, Giovanni

Handle:

https://iris.cnr.it/handle/20.500.14243/318198

Published in:

CEUR WORKSHOP PROCEEDINGS

Series

Overview

URL

http://www.scopus.com/record/display.url?eid=2-s2.0-84908295208&origin=inward

Compressed spaced suffix arrays

Manzini, Giovanni

CEUR WORKSHOP PROCEEDINGS

Overview

URL