Repository logo
 
No Thumbnail Available
Publication

On the suitability of suffix arrays for lempel-ziv data compression

Use this identifier to reference this record.
Name:Description:Size:Format: 
On the suitability_AJFerreira.pdf672.89 KBAdobe PDF Download

Advisor(s)

Abstract(s)

Lossless compression algorithms of the Lempel-Ziv (LZ) family are widely used nowadays. Regarding time and memory requirements, LZ encoding is much more demanding than decoding. In order to speed up the encoding process, efficient data structures, like suffix trees, have been used. In this paper, we explore the use of suffix arrays to hold the dictionary of the LZ encoder, and propose an algorithm to search over it. We show that the resulting encoder attains roughly the same compression ratios as those based on suffix trees. However, the amount of memory required by the suffix array is fixed, and much lower than the variable amount of memory used by encoders based on suffix trees (which depends on the text to encode). We conclude that suffix arrays, when compared to suffix trees in terms of the trade-off among time, memory, and compression ratio, may be preferable in scenarios (e.g., embedded systems) where memory is at a premium and high speed is not critical.

Description

Keywords

Construction Algorithm

Citation

FERREIRA, Artur J.; OLIVEIRA, Arlindo L., FIGUEIREDO, Mário A. T. – On the suitability of suffix arrays for lempel-ziv data compression. In E-Business and Telecommunications. Springer, 2009. ISBN 978-3-642-05196-8. Vol. 48. Pp. 267-280.

Research Projects

Organizational Units

Journal Issue

Publisher

Springer-Verlag Berlin

CC License