Suffix Arrays: A competitive choice for fast Lempel-Ziv Compression

J. Ferreira, Artur; Oliveira, Arlindo L.; Figueiredo, Mario

http://hdl.handle.net/10400.21/17902

Utilize este identificador para referenciar este registo.

Nome:	Descrição:	Tamanho:	Formato:
Suffix Arrays_AJFerreira.pdf		534.66 KB	Adobe PDF	Ver/Abrir

Contacte-nos

Autores

J. Ferreira, Artur

Oliveira, Arlindo L.

Figueiredo, Mario

Resumo(s)

Lossless compression algorithms of the Lempel-Ziv (LZ) family are widely used in a variety of applications. The LZ encoder and decoder exhibit a high asymmetry, regarding time and memory requirements, with the former being much more demanding. Several techniques have been used to speed up the encoding process; among them is the use of suffix trees. In this paper, we explore the use of a simple data structure, named suffix array, to hold the dictionary of the LZ encoder, and propose an algorithm to search the dictionary. A comparison with the suffix tree based LZ encoder is carried out, showing that the compression ratios are roughly the same. The ammount of memory required by the suffix array is fixed, being much lower than the variable memory requirements of the suffix tree encoder, which depends on the text to encode. We conclude that suffix arrays are a very interesting option regarding the tradeoff between time, memory, and compression ratio, when compared with suffix trees, that make them preferable in some compression scenarios.

Palavras-chave

Lempel-Ziv Lossless Data Compression Suffix Arrays Suffix Trees String Matching

URI

http://hdl.handle.net/10400.21/17902

Citação

J. Ferreira, A.; L. Oliveira, A. and A. T. Figueiredo, M. (2008). SUFFIX ARRAYS - A Competitive Choice for Fast Lempel-Ziv Compressions. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2008) - SIGMAP; ISBN 978-989-8111-60-9, SciTePress, pages 5-12. DOI: 10.5220/0001935200050012

Editora

SciTePress

DOI

10.5220/0001935200050012

Coleções

ISEL - Eng. Elect. Tel. Comp. - Comunicações

Métricas Alternativas

Ver registo completo