On the use of suffix arrays for memory-efficient Lempel-Ziv data compression

J. Ferreira, Artur; Oliveira, Arlindo L.; Figueiredo, Mario

http://hdl.handle.net/10400.21/17906

Utilize este identificador para referenciar este registo.

Nome:	Descrição:	Tamanho:	Formato:
On the use_AJFerreira.pdf		116.36 KB	Adobe PDF	Ver/Abrir

Contacte-nos

Autores

J. Ferreira, Artur

Oliveira, Arlindo L.

Figueiredo, Mario

Resumo(s)

The Lempel-Ziv 77 (LZ77) and LZ-Storer-Szymanski (LZSS) text compression algorithms use a sliding window over the sequence of symbols, with two sub-windows: the dictionary (symbols already encoded) and the look-ahead-buffer (LAB) (symbols not yet encoded). Binary search trees and suffix trees (ST) have been used to speedup the search of the LAB over the dictionary, at the expense of high memory usage [1]. A suffix array (SA) is a simpler, more compact data structure which uses (much) less memory [2,3] to hold the same information. The SA for a length m string is an array of integers ([1], ...[k], ...a[m]) that stores the lexicographic order of suffix k of the string; sub-string searching, as used in LZ77/LZSS, is done by searching the SA.

Palavras-chave

Lempel-Ziv 77 (LZ77) LZ-Storer-Szymanski (LZSS)

URI

http://hdl.handle.net/10400.21/17906

Citação

Ferreira A., Oliveira, A., Figueiredo, M. – On the Use of Suffix Arrays for Memory-Efficient Lempel-Ziv Data Compression. In 2009 Data Compression Conference. Snowbird, UT, USA: IEEE, 2009. ISBN 978-0-7695-3592-0. Pp. 1-1. Doi: 10.1109/DCC.2009.50

Editora

IEEE

DOI

10.1109/DCC.2009.50

Coleções

ISEL - Eng. Elect. Tel. Comp. - Comunicações

Métricas Alternativas

Ver registo completo