Embedded Zerotrees of Wavelet transforms

Last updated

Embedded Zerotrees of Wavelet transforms (EZW) is a lossy image compression algorithm. At low bit rates, i.e. high compression ratios, most of the coefficients produced by a subband transform (such as the wavelet transform) will be zero, or very close to zero. This occurs because "real world" images tend to contain mostly low frequency information (highly correlated). However where high frequency information does occur (such as edges in the image) this is particularly important in terms of human perception of the image quality, and thus must be represented accurately in any high quality coding scheme.

Contents

By considering the transformed coefficients as a tree (or trees) with the lowest frequency coefficients at the root node and with the children of each tree node being the spatially related coefficients in the next higher frequency subband, there is a high probability that one or more subtrees will consist entirely of coefficients which are zero or nearly zero, such subtrees are called zerotrees. Due to this, we use the terms node and coefficient interchangeably, and when we refer to the children of a coefficient, we mean the child coefficients of the node in the tree where that coefficient is located. We use children to refer to directly connected nodes lower in the tree and descendants to refer to all nodes which are below a particular node in the tree, even if not directly connected.

In zerotree based image compression scheme such as EZW and SPIHT, the intent is to use the statistical properties of the trees in order to efficiently code the locations of the significant coefficients. Since most of the coefficients will be zero or close to zero, the spatial locations of the significant coefficients make up a large portion of the total size of a typical compressed image. A coefficient (likewise a tree) is considered significant if its magnitude (or magnitudes of a node and all its descendants in the case of a tree) is above a particular threshold. By starting with a threshold which is close to the maximum coefficient magnitudes and iteratively decreasing the threshold, it is possible to create a compressed representation of an image which progressively adds finer detail. Due to the structure of the trees, it is very likely that if a coefficient in a particular frequency band is insignificant, then all its descendants (the spatially related higher frequency band coefficients) will also be insignificant.

EZW uses four symbols to represent (a) a zerotree root, (b) an isolated zero (a coefficient which is insignificant, but which has significant descendants), (c) a significant positive coefficient and (d) a significant negative coefficient. The symbols may be thus represented by two binary bits. The compression algorithm consists of a number of iterations through a dominant pass and a subordinate pass, the threshold is updated (reduced by a factor of two) after each iteration. The dominant pass encodes the significance of the coefficients which have not yet been found significant in earlier iterations, by scanning the trees and emitting one of the four symbols. The children of a coefficient are only scanned if the coefficient was found to be significant, or if the coefficient was an isolated zero. The subordinate pass emits one bit (the most significant bit of each coefficient not so far emitted) for each coefficient which has been found significant in the previous significance passes. The subordinate pass is therefore similar to bit-plane coding.

There are several important features to note. Firstly, it is possible to stop the compression algorithm at any time and obtain an approximation of the original image, the greater the number of bits received, the better the image. Secondly, due to the way in which the compression algorithm is structured as a series of decisions, the same algorithm can be run at the decoder to reconstruct the coefficients, but with the decisions being taken according to the incoming bit stream. In practical implementations, it would be usual to use an entropy code such as arithmetic code to further improve the performance of the dominant pass. Bits from the subordinate pass are usually random enough that entropy coding provides no further coding gain.

The coding performance of EZW has since been exceeded by SPIHT and its many derivatives.

Introduction

Embedded zerotree wavelet algorithm (EZW) as developed by J. Shapiro in 1993, enables scalable image transmission and decoding. It is based on four key concepts: first, it should be a discrete wavelet transform or hierarchical subband decomposition; second, it should predict the absence of significant information when exploring the self-similarity inherent in images; third, it has entropy-coded successive-approximation quantization, and fourth, it is enabled to achieve universal lossless data compression via adaptive arithmetic coding.

Besides, the EZW algorithm also contains the following features:

(1) A discrete wavelet transform which can use a compact multiresolution representation in the image.

(2) Zerotree coding which provides a compact multiresolution representation of significance maps.

(3) Successive approximation for a compact multiprecision representation of the significant coefficients.

(4) A prioritization protocol which the importance is determined by the precision, magnitude, scale, and spatial location of the wavelet coefficients in order.

(5) Adaptive multilevel arithmetic coding which is a fast and efficient method for entropy coding strings of symbols.

Embedded Zerotree Wavelet Coding

A. Encoding a coefficient of the significance map

In a significance map, the coefficients can be representing by the following four different symbols. With using these symbols to represent the image information, the coding will be less complication.

1. Zerotree root

If the magnitude of a coefficient is less than a threshold T, and all its descendants are less than T, then this coefficient is called zerotree root. And if a coefficient has been labeled as zerotree root, it means that all of its descendants are insignificant, so there is no need to label its descendants.

2. Isolated zero

If the magnitude of a coefficient that is less than a threshold T, but it still has some significant descendants, then this coefficient is called isolated zero.

3. Positive significant coefficient

If the magnitude of a coefficient is greater than a threshold T at level T, and also is positive, than it is a positive significant coefficient.

4. Negative significant coefficient

If the magnitude of a coefficient is greater than a threshold T at level T, and also is negative, than it is a negative significant coefficient.

B. Defining threshold

The threshold using above can be defined as the type below.

1. Initial threshold T0: (Assume Cmax is the largest coefficient.)

Threshold-0119.png

2. Threshold Ti is reduced to half of the value of the previous threshold.

Threshold-01192.png

C. Scanning order for coefficients

Raster scanning is the rectangular pattern of image capture and reconstruction. Using this scanning on EZW transform is to perform scanning the coefficients in such way that no child node is scanned before its parent node. Also, all positions in a given subband are scanned before it moves to the next subband.

D. Two-pass bitplane coding

(1) Refinement pass (or subordinate pass)

This determine that if the coefficient is in the interval [Ti, 2Ti). And a refinement bit is coded for each significant coefficient.

In this method, it will visit the significant coefficients according to the magnitude and raster order within subbands.

(2) Significant pass (or dominant pass)

This method will code a bit for each coefficient that is not yet be seen as significant. Once a determination of significance has been made, the significant coefficient is included in a list for further refinement in the refinement pass. And if any coefficient already known to be zero, it will not be coded again.

Example

DCT data                          ZeroTree scan order (EZW)  63 -34  49  10   7  13 -12   7    A  B BE BF E1 E2 F1 F2 -31  23  14 -13   3   4   6  -1    C  D BG BH E3 E4 F3 F4  15  14   3 -12   5  -7   3   9   CI CJ DM DN G1 G2 H1 H2  -9  -7 -14   8   4  -2   3   2   CK CL DO DP G3 G4 H3 H4  -5   9  -1  47   4   6  -2   2   I1 I2 J1 J2 M1 M2 N1 N2   3   0  -3   2   3  -2   0   4   I3 I4 J3 J4 M3 M4 N3 N4   2  -3   6  -4   3   6   3   6   K1 K2 L1 L2 O1 O2 P1 P2   5  11   5   6   0   3  -4   4   K3 K4 L3 L4 O3 O4 P3 P4  D1: pnzt p    ttt  tztt tttttptt (20 codes)     PNZT P(t) TTT  TZTT     TPTT   (D1 by M-EZW, 16 codes)     PNZT P(t) Z(t) TZ(p)    TPZ(p) (D1 by NM-EZW, 11 codes)       P N (t), P or N above zerotree scan       P N Z(t p), p=pair T, t=triple T, P/N + TT/TTT in D1 code S1: 1010 D2: ztnp tttttttt S2: 1001 10 (Shapiro PDF end here) D3: zzzz zppnppnttnnp tpttnttttttttptttptttttttttptttttttttttt S3: 1001 11 01111011011000 D4: zzzzzzztztznzzzzpttptpptpnptntttttptpnpppptttttptptttpnp S4: 1101 11 11011001000001 110110100010010101100 D5: zzzzztzzzzztpzzzttpttttnptppttptttnppnttttpnnpttpttppttt S5: 1011 11 00110100010111 110101101100100000000 110110110011000111 D6: zzzttztttztttttnnttt ( http://www.polyvalens.com/wavelets/ezw/ )  Detailed: (new S is first, other computed by before cycles) s-step      1                21               321    val   D1 S1       R1   D2 S2       R2   D3 S3. ...   R3 ... D4,S4... A   63   P  1  >=48  56   Z  .1 >=56  60   Z  ..1 >=60  62 B  -34   N  0   <48 -40   T  .0  <40 -36   Z  ..0  <36 -36 C  -31   IZ     <32   0   N  1. >=24 -28   Z  .1. >=28 -30 D   23   T      <32   0   P  0. <24   20   Z  .1. >=20  22  BE  49   P  1  >=48  56      .0  <56  52   Z  ..0  <52  50 BF  10   T      <32   0                    P  0    <12  10 BG  14   T      <32   0                    P  1   >=12  14 BH -13   T      <32   0                    N  1   >=12 -14 CI  15   T      <32   0   T      <16   0   P  1   >=12  14 CJ  14   IZ     <32   0   T      <16   0   P  1   >=12  14 CK  -9   T      <32   0   T      <16   0   N  0    <12 -10 CL  -7   T      <32   0   T      <16   0   T        <8   0 DM   3     T      <16   0   T        <8   0 DN -12     T      <16   0   N  1   >=12 -14 DO -14     T      <16   0   N  1   >=12 -14 DP   8     T      <16   0   P       <12  10  E1   7   T      <32   0                    .E,F,G,H(1,2,3,4) E2  13   T      <32   0                    .I,J,K(1,2,3,4) E3   3   T      <32   0                    .N,O,P(1,2,3,4) E4   4   T      <32   0                    . J1  -1   T      <32   0                    . J2  47   P  0  >48   40       1 >=40  44   . J3  -3   T      <32   0 J4   2   T      <32   0  D = dominant pass (P=positive, N=negative, T=ZeroTree, IZ=Izolated zero) S = subordinate pass; (R = back reconstructed value) 

See also

Related Research Articles

Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are a sequence of numbers that represent samples of a continuous variable in a domain such as time, space, or frequency. In digital electronics, a digital signal is represented as a pulse train, which is typically generated by the switching of a transistor.

<span class="mw-page-title-main">JPEG</span> Lossy compression method for reducing the size of digital images

JPEG is a commonly used method of lossy compression for digital images, particularly for those images produced by digital photography. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality. JPEG typically achieves 10:1 compression with little perceptible loss in image quality. Since its introduction in 1992, JPEG has been the most widely used image compression standard in the world, and the most widely used digital image format, with several billion JPEG images produced every day as of 2015.

<span class="mw-page-title-main">Wavelet</span> Function for integral Fourier-like transform

A wavelet is a wave-like oscillation with an amplitude that begins at zero, increases or decreases, and then returns to zero one or more times. Wavelets are termed a "brief oscillation". A taxonomy of wavelets has been established, based on the number and direction of its pulses. Wavelets are imbued with specific properties that make them useful for signal processing.

A video codec is software or hardware that compresses and decompresses digital video. In the context of video compression, codec is a portmanteau of encoder and decoder, while a device that only compresses is typically called an encoder, and one that only decompresses is a decoder.

<span class="mw-page-title-main">JPEG 2000</span> Image compression standard and coding system

JPEG 2000 (JP2) is an image compression standard and coding system. It was developed from 1997 to 2000 by a Joint Photographic Experts Group committee chaired by Touradj Ebrahimi, with the intention of superseding their original JPEG standard, which is based on a discrete cosine transform (DCT), with a newly designed, wavelet-based method. The standardized filename extension is .jp2 for ISO/IEC 15444-1 conforming files and .jpx for the extended part-2 specifications, published as ISO/IEC 15444-2. The registered MIME types are defined in RFC 3745. For ISO/IEC 15444-1 it is image/jp2.

ICER is a wavelet-based image compression file format used by the NASA Mars rovers. ICER has both lossy and lossless compression modes.

<span class="mw-page-title-main">Daubechies wavelet</span> Orthogonal wavelets

The Daubechies wavelets, based on the work of Ingrid Daubechies, are a family of orthogonal wavelets defining a discrete wavelet transform and characterized by a maximal number of vanishing moments for some given support. With each wavelet type of this class, there is a scaling function which generates an orthogonal multiresolution analysis.

<span class="mw-page-title-main">Discrete wavelet transform</span> Transform in numerical harmonic analysis

In numerical analysis and functional analysis, a discrete wavelet transform (DWT) is any wavelet transform for which the wavelets are discretely sampled. As with other wavelet transforms, a key advantage it has over Fourier transforms is temporal resolution: it captures both frequency and location information.

Set partitioning in hierarchical trees (SPIHT) is an image compression algorithm that exploits the inherent similarities across the subbands in a wavelet decomposition of an image. The algorithm was developed by Brazilian engineer Amir Said with William A. Pearlman in 1996.

Originally known as optimal subband tree structuring (SB-TS), also called wavelet packet decomposition (WPD) (sometimes known as just wavelet packets or subband tree), is a wavelet transform where the discrete-time (sampled) signal is passed through more filters than the discrete wavelet transform (DWT).


The Stationary wavelet transform (SWT) is a wavelet transform algorithm designed to overcome the lack of translation-invariance of the discrete wavelet transform (DWT). Translation-invariance is achieved by removing the downsamplers and upsamplers in the DWT and upsampling the filter coefficients by a factor of in the th level of the algorithm. The SWT is an inherently redundant scheme as the output of each level of SWT contains the same number of samples as the input – so for a decomposition of N levels there is a redundancy of N in the wavelet coefficients. This algorithm is more famously known as "algorithme à trous" in French which refers to inserting zeros in the filters. It was introduced by Holschneider et al.

<span class="mw-page-title-main">Cohen–Daubechies–Feauveau wavelet</span>

Cohen–Daubechies–Feauveau wavelets are a family of biorthogonal wavelets that was made popular by Ingrid Daubechies. These are not the same as the orthogonal Daubechies wavelets, and also not very similar in shape and properties. However, their construction idea is the same.

<span class="mw-page-title-main">Wavelet transform</span> Mathematical technique used in data compression and analysis

In mathematics, a wavelet series is a representation of a square-integrable function by a certain orthonormal series generated by a wavelet. This article provides a formal, mathematical definition of an orthonormal wavelet and of the integral wavelet transform.

<span class="mw-page-title-main">Lifting scheme</span> Technique for wavelet analysis

The lifting scheme is a technique for both designing wavelets and performing the discrete wavelet transform (DWT). In an implementation, it is often worthwhile to merge these steps and design the wavelet filters while performing the wavelet transform. This is then called the second-generation wavelet transform. The technique was introduced by Wim Sweldens.

<span class="mw-page-title-main">Progressive Graphics File</span> File format

PGF is a wavelet-based bitmapped image format that employs lossless and lossy data compression. PGF was created to improve upon and replace the JPEG format. It was developed at the same time as JPEG 2000 but with a focus on speed over compression ratio.

Ali Naci Akansu is a Turkish-American Professor of electrical & computer engineering and scientist in applied mathematics.

Wavelets are often used to analyse piece-wise smooth signals. Wavelet coefficients can efficiently represent a signal which has led to data compression algorithms using wavelets. Wavelet analysis is extended for multidimensional signal processing as well. This article introduces a few methods for wavelet synthesis and analysis for multidimensional signals. There also occur challenges such as directivity in multidimensional case.

JPEG XS is an interoperable, visually lossless, low-latency and lightweight image and video coding system used in professional applications. Applications of the standard include streaming high quality content for virtual reality, drones, autonomous vehicles using cameras, gaming, and broadcasting. In this respect, JPEG XS is unique, being the first ISO codec ever designed for this specific purpose. JPEG XS, built on core technology from both intoPIX and Fraunhofer IIS, is formally standardized as ISO/IEC 21122 by the Joint Photographic Experts Group with the first edition published in 2019. Although not official, the XS acronym was chosen to highlight the eXtra Small and eXtra Speed characteristics of the codec. Today, the JPEG committee is still actively working on further improvements to XS, with the second edition scheduled for publication and initial efforts being launched towards a third edition.

In applied mathematics, biorthogonal nearly coiflet bases are wavelet bases proposed by Lowell L. Winger. The wavelet is based on biorthogonal coiflet wavelet bases, but sacrifices its regularity to increase the filter's bandwidth, which might lead to better image compression performance.

References