REGULAR PAPER A predictive algorithm for multimedia data compression Reza Moradi Rad • Abdolrahman Attar • Asadollah Shahbahrami Published online: 24 July 2012 Ó Springer-Verlag 2012 Abstract In lossless image compression, many prediction methods are proposed so far to achieve better compression performance/complexity trade off. In this paper, we con- centrate on some well-known and widely used low-com- plexity algorithms exploited in many modern compression systems, including MED, GAP, Graham, Ljpeg, DARC, and GBSW. This paper proposes a new gradient-based tracking and adapting technique that outperforms some existing methods. This paper aims to design an efficient highly adaptive predictor that can be incorporated in modeling step of image compression systems. This claim is proved by testing the proposed method upon a wide variety of images with different characteristics. Six special sets of images including face, sport, texture, sea, text, and medical constitute our dataset. Keywords Image compression Predictive coding Multimedia Lossless compression 1 Introduction Nowadays, libraries, museums, and films are converting more and more data into digital form especially into image format. This needs large digital devices to store and a large bandwidth to transmit through the networks. In order to alleviate these requirements, compression techniques are used. Image compression emerges to answer this essential question: can we represent multimedia data with fewer amounts of bits than the original data? Multimedia data compression can be defined as ‘‘reducing the amount of data required to represent the multimedia data’’. Many compression techniques have been proposed in the litera- ture, which can be classified into two groups: lossless and lossy [24, 26]. In lossless compression, the quality of the decoded multimedia data is the same as the original data. The compression ratio of this category is generally limited. In lossy compression, unnecessary data are deleted from the original data. Therefore, the quality of the decoded multimedia data is almost the same as the original data and the compression ratio of this group is higher than the previous one [29, 30, 31]. Predictive data coding is the most well-known technique in lossless data compression for its efficiency, simplicity, and robustness [1–3]. Prediction is a process to estimate the unknown (future) values using the available and known (past and present) values. In predictive coding, the essential function is to extract the relationship and similarity of neighboring values; for example, similarity of neighboring pixels in an image; and reduce numbers of values that should be known to represent a data. Most predictive data compression techniques include two major steps [14]: Modeling (prediction): In the first step, essential func- tion is prediction of unknown values. An efficient mecha- nism and predictor function are extracted to produce the predicted data. Coding: In the second step, essential function represents errors (produced by subtracting original and predicted data) in minimal form. Researchers usually use Hoffman, Arithmetic, and some other entropy coding schemas for the second step. While R. M. Rad A. Attar A. Shahbahrami (&) Department of Computer Engineering, Faculty of Engineering, University of Guilan, P.O. Box: 3756-41635, Rasht, Iran e-mail: [email protected]R. M. Rad e-mail: [email protected]A. Attar e-mail: [email protected]123 Multimedia Systems (2013) 19:103–115 DOI 10.1007/s00530-012-0282-0
13
Embed
A predictive algorithm for multimedia data … PAPER A predictive algorithm for multimedia data compression Reza Moradi Rad • Abdolrahman Attar • Asadollah Shahbahrami Published
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
REGULAR PAPER
A predictive algorithm for multimedia data compression
Reza Moradi Rad • Abdolrahman Attar •
Asadollah Shahbahrami
Published online: 24 July 2012
� Springer-Verlag 2012
Abstract In lossless image compression, many prediction
methods are proposed so far to achieve better compression
performance/complexity trade off. In this paper, we con-
centrate on some well-known and widely used low-com-
plexity algorithms exploited in many modern compression
systems, including MED, GAP, Graham, Ljpeg, DARC,
and GBSW. This paper proposes a new gradient-based
tracking and adapting technique that outperforms some
existing methods. This paper aims to design an efficient
highly adaptive predictor that can be incorporated in
modeling step of image compression systems. This claim is
proved by testing the proposed method upon a wide variety
of images with different characteristics. Six special sets of
images including face, sport, texture, sea, text, and medical
parameters in four directions: 45�, 90�, 135�, and 180�.
dNE ¼d1
6þ 0:5
� �
dN ¼d2
10þ 0:5
� �
dNW ¼d3
6þ 0:5
� �
dNE ¼d4
10þ 0:5
� �
After gradient estimation, GBSW produces the prediction
in a way that the neighboring pixel in the direction with the
minimal estimated gradient has the biggest contribution to
the formation of final prediction [4]. GBSW uses just two of
four neighboring pixels (N, W, NW, and NE) for final
predictor.
For example, if dW and dN (dW \ dN) are two minimum
estimated gradients around the current pixel X, then the
prediction for that pixel is computed as:
X ¼ dw � Nð Þ þ dN �Wð Þdw þ dN
GBSW’s complexity is not much but a bit higher than
MED. Please note that, as pointed before, all predictors
mentioned in related works section classify as low-
complexity predictors.
Table 1 provides summarizes information of all pre-
dictors of this section.
In Table 1 some information is provided as a single view
to compare predictors. In this table NANP means number of
all pixels needed for making decision and estimation by a
specific predictor. NPFP is the number of neighboring
pixels in final the predictor. NAV and NC are number of
auxiliary variables and clauses, respectively. In the last row,
the level of adaptivity for various predictors are given.
MED, PRV [14], and Graham classify as low level of
adaptivity because they use some specific configuration in
some conditions, for example three constant configuration
in MED. Note that the mentioned predictors switch between
linear predictors and try to simulate non-linear performance.
If the predictor is able to weigh the coefficients of final
predictors dynamically, it is considered as moderate
adaptivity level in this work, like DARC. A predictor
considered as high adaptivity level when it is able to make
decisions on which and how neighboring pixels contribute
in final predictor dynamically and regarding local infor-
mation, like NEW.
4 Proposed predictive data compression
The human visual system is more sensitive to important
features in an image. Edges are one of the most important
features that play a critical role in the presentation of an
image. Edges are revealing with jump in intensity, for
instance, Fig. 7 depicts the occurrences of edges for image
pixels. In this case, maximum intensity changes level
occurred in 45� direction. As can be seen, the image edge
exists in 135� angle.
A predictive algorithm for multimedia data compression 107
123
Some predictors such as MED, DARC, and Graham
track edges in vertical and horizontal directions while
GBSW tracks and analyzes edges in four directions. GAP
also tracks edges in vertical and horizontal directions but
additionally classify edges to three classes namely, weak,
normal, and sharp. This paper proposes a new gradient-
based tracking and adapting method (GBTA) that tracks
edges in more directions precisely. Our goal is to design an
efficient and highly adaptive predictor that can be incor-
porated in the modeling step of image compression system.
By examining surrounding pixels of an unknown pixel
we try to find the maximum intensity change level to
determine edges. The new proposed algorithm uses 11
directions to predict the pixel value, as depicted in Fig. 8.
For the determination of more exact maximum-change
direction three simple steps are considered here. In the first
step, the main direction around each pixel among four main
directions (45�, 90�, 135�, and 180�) is selected as R. In the
second step, the sub-direction among R�-45� or R� or
R� ? 45� is selected, sub-directions are produced to be
more exact in determining the edges. In the third step, the
exact maximum-change direction calculated by the fol-
lowing equation is depicted in Fig. 8:
Direction ¼ 2�main directionð Þ þ sub-directionð Þ½ �=3
e:g: 2� 45�ð Þ þ 0�ð Þ½ �=3 ¼ 30�
To clarify the proposed algorithm, we give an example
here. Suppose that the main direction is 45� and the sub-
direction is 0�, then final direction (exact maximum-change
direction) is 30� as depicted in Fig. 9.
As shown in Fig. 9, two directions in terms of maximum
change level, main direction and sub-direction are con-
sidered. In other words, final direction not only relies on
the main direction but also relies on sub-direction. Sub-
Table 1 Summarized information about predictors
Ljpeg MED GAP PRV [14] DARC Graham GBSW NEW
NANP – 3 7 3 3 3 10 19
NPFP – 3 4 3 2 2 2 2
NAV – 0 2 0 2 2 4 0
NC 1 3 6 2 0 2 4 11
L-NL L NL NL NL NL NL NL NL
Switching No Yes Yes Yes No Yes No No
FW-BW BW FW FW FW FW FW FW FW
Adaptivity None Low Moderate–high Low Moderate Low Moderate–high High
NANP number of all needed pixels, NPFP number of pixels in final predictor, NAV number of auxiliary variables, NC number of clauses, L–NLlinear or non-linear, FW–BW forward or backward
Fig. 7 Details of edge occurrence
Fig. 8 Classification of edge directions
Fig. 9 Main direction = 45� and sub-direction = 0� so
direction = 30�
Table 2 Predictors which used when both main and sub-direction are
same
Main direction = sub-direction Direction Predictor
45 45 NW
90 90 W
135 135 NE
180 180 N
108 R. M. Rad et al.
123
direction also affects final direction with less power than
main-direction.
Now it is time to introduce predictors regarding the
obtained final direction. For each pixel prediction just one
or two neighbor pixels participate in final predictor.
Regarding the final local gradient estimation, the predictor
performs dynamically. When the main direction and sub-
direction are the same, the predictor just uses one neighbor
pixel for prediction. When they are different, it uses two
neighbor pixels.
When the main direction and sub-direction are same,
then the predictor estimates the unknown pixel value to be
same as the pixel which keeps the intensity change level.
The whole predictors are used in a situation that both main
and sub-direction are the same as shown in Table 2.
Now, suppose that the main and sub-direction are dif-
ferent. In these circumstances predictor formation is a little
more complex than previous. In this situation, two pixels
participate to form the predictor. According to our algo-
rithm premises, the pixel which participates in order to
show the effect of main direction in keeping maximum
change level weighted two and the other one which par-
ticipates for effect of sub-direction weighted one and
divided by three to form a final predictor.
All of the predictors with details about direction are
shown in Table 3.
To clarify the mentioned explanations, an example is
presented here: the maximum change level occurred in 45�and it is the main direction. The predictor looks around it,
the maximum change level is selected among 45�, 90�, and
135� sub-direction. Sub-direction 90� is selected, while the
main direction is 45�. The NW value participates in
Table 3 Predictors which used when both main and sub-direction are
not same
Main direction = sub-direction Direction Predictor
45 0 30 (2NW ? N)/3
45 90 60 (2NW ? W)/3
90 45 75 (2W ? NW)/3
90 135 105 (2W ? NE)/3
135 90 120 (2NE ? W)/3
135 180 150 (2NE ? N)/3
180 135 165 (2N ? NE)/3
3/))*2((ˆ WNWX +=
Fig. 10 Formation of predictor for final gradient 60�
NW N NE
W X
75°60°
45°
30°
90°105°
135°
120°
150°
165°
180°
Angle selection
(According to local gradients)
Predictor Formation
(According to angle)
Computing of predicted pixel value
(According to predictors)
e=difference
Fig. 11 Snapshot of proposed
predictor
A predictive algorithm for multimedia data compression 109
123
prediction that its weighted value is two. In addition, the
predictor uses the W value with weighted value of one. So
the predictor uses the following equation to estimate the
unknown pixel value (Fig. 10).
Complete steps in the proposed algorithm are shown in
Fig. 11.
5 Benchmarks
Now it is time to test the proposed method upon credible
datasets and analyze its performance in comparison with
related works. We believed that poor datasets cannot
reflect the real world of images. Proper datasets should
be able to show some important aspects of image world,
not just one ad hoc aspect of images. We select six
Fig. 12 Examples of image set:
face, sea, medical, texture,
sport, text
Table 4 Characteristics of images
Image properties Ranges
Width of images (pixel) 168–752
Height of image (pixel) 72–800
Resolution of image (dpi) 72–350
110 R. M. Rad et al.
123
different groups of images which have a significant
contribution in file storage and transmission. Face, sea,
medical, texture, sport, and text constitute our six unique
datasets. Figure 12 shows some examples of each set.
These six groups of images inherently are different, and
each group has their own characteristics and features.
Table 5 Mean absolute error (MAE) upon face image set, including ten different face images
Image Dimension MED Graham Previous [13] DARC GAP GBSW GBTA