Top Banner

of 17

14-Bayesian Wavelet-Based Image Denoising Using the Gauss–Hermite Expansion.pdf

Jun 02, 2018

Download

Documents

senersin
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    1/17

    IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008 1755

    Bayesian Wavelet-Based Image Denoising Using theGaussHermite Expansion

    S. M. Mahbubur Rahman, M. Omair Ahmad, Fellow, IEEE, and M. N. S. Swamy, Fellow, IEEE

    AbstractThe probability density functions (PDFs) of thewavelet coefficients play a key role in many wavelet-based imageprocessing algorithms, such as denoising. The conventional PDFsusually have a limited number of parameters that are calculatedfrom the first few moments only. Consequently, such PDFs cannotbe made to fit very well with the empirical PDF of the waveletcoefficients of an image. As a result, the shrinkage functionutilizing any of these density functions provides a substandarddenoising performance. In order for the probabilistic model of theimage wavelet coefficients to be able to incorporate an appropriatenumber of parameters that are dependent on the higher ordermoments, a PDF using a series expansion in terms of the Hermite

    polynomials that are orthogonal with respect to the standardGaussian weight function, is introduced. A modification in theseries function is introduced so that only a finite number of termscan be used to model the image wavelet coefficients, ensuring atthe same time the resulting PDF to be non-negative. It is shownthat the proposed PDF matches the empirical one better thansome of the standard ones, such as the generalized Gaussian orBessel K-form PDF. A Bayesian image denoising technique is thenproposed, wherein the new PDF is exploited to statistically modelthe subband as well as the local neighboring image wavelet coef-ficients. Experimental results on several test images demonstratethat the proposed denoising method, both in the subband-adaptiveand locally adaptive conditions, provides a performance betterthan that of most of the methods that use PDFs with limitednumber of parameters.

    Index TermsBayesian image denoising, GaussHermite,higher order moments, orthogonal series expansion, probabilitydensity function, wavelet coefficient.

    I. INTRODUCTION

    DENOISING of images corrupted by additive white

    Gaussian noise (AWGN) is a classical problem in

    signal-processing. The distortion of images by noise is

    common during its acquisition, processing, compression,

    storage, transmission, and reproduction. The aim of denoising

    is to remove the noise while keeping the signal features as muchas possible. Traditional algorithms perform image denoising

    Manuscript receivedNovember 9, 2007; revisedMay 5, 2008. Current versionpublished September 10, 2008. This work was supported in part by the NaturalSciences and Engineering Research Council (NSERC) of Canada and in partby the Regroupement Stratgique en Microlectronique du Qubec (ReSMiQ).The associate editor coordinating the review of this manuscript and approvingit for publication was Dr. Michael Elad.

    The authors are with the Center for Signal Processing and Communications,Department of Electrical and Computer Engineering, Concordia University,Montral, QC, H3G 1M8 Canada (e-mail: [email protected];[email protected]; [email protected]).

    Color versions of one or more of the figures in this paper are available onlineat http://ieeexplore.ieee.org.

    Digital Object Identifier 10.1109/TIP.2008.2002163

    in the pixel domain. However, the application of the wavelet

    transform in image denoising, compression, etc. has shown

    remarkable success over the last decade. Wavelet shrinkage

    method proposed by Donoho [1][3] is the pioneering work

    for signal denoising using the wavelet transform. The method

    described in [3] provides a mini-max optimal solution. Another

    criterion commonly used is the mean squared error (MSE),

    even though it does not match well with the characteristics of

    the human visual system [4]. However, since it is a measure of

    the noise power [5], and is mathematically convenient in the

    context of optimization, it is employed as a denoising criterionin many image denoising applications. The discrete wavelet

    transform (DWT) used in image denoising can be of many

    types, such as orthogonal/biorthogonal, real/complex-valued,

    separable/nonseparable, or decimated/nondecimated. Due

    to the shift-invariance property, the nondecimated separable

    DWT [6] improves the image denoising performance in MSE

    approximately by 1 dB as compared to that of the decimated

    representation [7][10]. To reduce the computational load of

    the nondecimated wavelet transform as well as the Gibbs arti-

    facts, cycle-spinning has been proposed [11]. Image denoising

    by using transforms with improved directional selectivity,

    such as the dual-tree complex wavelet transform (DT-CWT)[12], ridgelet [13], curvelet [14], contourlet [15], and steerable

    pyramid [16], [17] have also been introduced.

    There are two major approaches for statistical wavelet-based

    denoising [18], [19]. The first approach is to design a sta-

    tistically optimal threshold parameter for some nonlinear

    thresholding or shrinkage functions. Image denoising methods

    using such an approach include the VisuShrink [2], SureShrink

    [3], BayesShrink [20], amplitude-scale-invariant Bayes esti-

    mator [21], NeighCoeff [22], Spatial-correlation thresholding

    [23], and empirical Bayes thresholding (EBayesThresh) [24].

    Standard choices for nonlinear thresholding or shrinkage

    functions are soft- and hard-thresholding [1], firm-shrinkage

    [25], and non-negative garrote shrinkage [26]. Methods usingsuch shrinkage functions are computationally simple, but have

    certain drawbacks in view of the fact that the functions repre-

    senting the nonlinearity are arbitrarily chosen. For example,

    the soft-thresholding technique yields a biased estimate with

    a moderate variance, whereas the hard-thresholding technique

    yields a less biased estimate but with a higher variance [26]. In

    the second approach, instead of using an arbitrary function to

    represent the nonlinearity, the shrinkage function is designed

    by minimizing a Bayesian risk, typically under the minimum

    mean squared error (MMSE) criterion [27], minimum mean

    absolute error criterion [28] or maximum a posteriori (MAP)

    criterion [29], [30]. The development of an optimal threshold

    1057-7149/$25.00 2008 IEEE

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    2/17

    1756 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    or shrinkage function is usually based on the probability den-

    sity function (PDF) or simply the prior function of the image

    wavelet coefficients. By exploiting the nature of the histogram

    of the wavelet coefficients, different noise reduction algorithms

    try to incorporate the prior model that best matches the true

    image coefficients. The marginal PDF of a true image and its

    self-similar subband wavelet coefficients are unimodal andsharply peaked in nature [31], [32]. Therefore, higher order

    statistics are necessary to describe the sharp central cusp of

    the density function of the image coefficients. The parameters

    of a PDF and their estimation play a significant role in the

    Bayesian denoising technique, since the computational com-

    plexity of a subband-adaptive Bayesian denoising is ,

    being the number of parameters of the PDF and

    the size of the image. Subsequent to the work of Mallat [33],

    the generalized Gaussian (GG) distribution with the shape

    parameter has been commonly used to model the

    image wavelet coefficients [8][10], [20], [27], [34][36]. For

    example, the threshold factor in the BayesShrink method [20]

    and the parametersignal of interestin the ProbShrink method[10] assume that the wavelet coefficients in each subband are

    independent and identically distributed (i.i.d.) random variables

    having a GG distribution. Under the simplest prior GG with

    , and the i.i.d. assumption, both the MMSE and MAP

    estimators provide the same closed-form solution, the well

    known Wiener filter. A general class of prior functions used

    for the purpose of denoising is the Gaussian scale mixture

    (GSM), wherein the wavelet coefficients are modelled as a

    zero-mean Gaussian function multiplied by random positive

    state variables [7], [16], [37][42]. For a simple two-state GSM

    prior, an expectation maximization (EM) algorithm can be used

    to estimate the multipliers [7], [41]. In [43], the EM algorithmhas been extended for a finite number of state variables and

    it has been shown that four to eight state variables of the

    GSM density function are sufficient for a good approximation

    of the prior function of the subband wavelet coefficients of

    images. In general, the estimation of the GSM prior requires

    another PDF, such as the log-normal density function [40]

    or Jeffreys noninformative prior [16], to estimate the set of

    multipliers. In order to obtain a Bayes estimator for image

    denoising, Figueiredo and Nowak [21] have used Jeffreys

    prior itself as a statistical model for the wavelet coefficients,

    since this prior is free of any parameter. In [44] and [45], a

    class of heavy-tailed densities, namely, the -stable family with

    characteristic exponent has been proposed as a

    prior of the image coefficients. This PDF has no closed-form

    expression even though its characteristic function has one, and

    its parameter estimation is poor, specially in the presence of

    noise [43]. Another family of sharply peaked PDFs, called the

    Bessel K-form (BK), introduced to model the filtered image in

    [32], has been applied in the wavelet-domain MMSE estimator

    by Fadili and Boubchir [46]. The wavelet coefficients of a

    log-transformed image (for example, synthetic aperture radar

    or medical ultrasound image) are modelled by the normal

    inverse Gaussian PDF for removing the multiplicative noise

    using a MAP estimator in the homomorphic approach [47],

    [48]. The parameters of a PDF can be estimated using severalstrategies, such as the method of moments [49], maximum

    likelihood method [50], and entropy-matching method [51].

    Since the image wavelet coefficients of a subband can be treated

    as a data set having a large sample size, the moment-based

    parameter estimation provides a consistent result [52]. In order

    to capture the higher order statistics, most of the PDFs used in

    modeling the image wavelet coefficients use afinite number of

    parameters that are usually derived from the second and fourthorder moments of the image coefficients. As a result, in most

    of the cases these density functions become much more lep-

    tokurtic than the empirical one. Since the wavelet coefficients

    are locally stationary [53], the parameters of some of the PDFs

    are refined with respect to the local spatial context within a sub-

    band [8], [16], [20], [22], [40], [54][57]. In such a case, when

    the denoising technique is locally adaptive within a subband,

    the computational complexity increases to ,

    being the size of a squared-shape local window. A few of the

    PDFs have also been extended to the bivariate [29], [58] [60]

    and multivariate cases [36], [42] or refined in the hidden

    Markov tree framework [7], [41] to take into consideration the

    intersubband dependency. In such a case, the computationalload of a denoising technique increases exponentially with the

    number of subbands, and this is substantially higher than the

    corresponding locally adaptive method.

    In this paper, a novel marginal statistical model is proposed

    for the wavelet coefficients of images by using the GaussHer-

    mite (GH) expansion. This expansion is used in view of the fact

    that it allows higher order moments to be incorporated in the

    probabilistic modeling of the wavelet coefficients. A method of

    choosing a finite number of terms in the GH series such that

    the resulting PDF matches well with the empirical PDF of the

    wavelet coefficients of the images and at the same time remains

    non-negative, is proposed. A closed-from expression is derivedfor the parameters of the modified PDF in terms of the higher

    order sample-moments. The proposed PDF is then applied as

    a prior function in the subband-adaptive and locally adaptive

    Bayesian MMSE-based noise reduction algorithms, in view of

    the fact that they offer lower computational burden as com-

    pared to the method using intersubband dependency. It is shown

    that the introduction of the more accurate prior in the proposed

    noise reduction technique results in a performance better than

    that of the subband-adaptive and locally adaptive wavelet-based

    image denoising algorithms that use standard PDFs having lim-

    ited number of parameters, in terms of both the visual perception

    and standard indices.

    The paper is organized as follows. In Section II, the modifica-

    tion of the GH expansion in the context of statistical modeling

    of the wavelet coefficients of images is presented. A comparison

    with some of the standard PDFs on a number of test images from

    a database is also given. A Bayesian image denoising method

    that exploits the proposed PDF is described in Section III. In

    Section IV, the performance of the method is evaluated and

    compared with some of the existing denoising methods. Finally,

    Section V gives some conclusions.

    II. GAUSSHERMITE EXPANSION FOR PDF OF IMAGE

    WAVELET COEFFICIENTS

    This section introduces a probabilistic model for the crit-ically sampled real DWT coefficients as well as for the real

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    3/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1757

    TABLE I

    WEIGHTFUNCTIONS ANDREGIONS OFSUPPORT FOR THE SERIESEXPANSIONSOFSOME ORTHOGONAL POLYNOMIALS

    and imaginary components of the DT-CWT coefficients of im-

    ages. To derive the PDF of the image wavelet coefficients, we

    use a series expansion so that the derived PDF enables one to

    use an arbitrary number of parameters that are dependent on

    higher order moments. In practice, the weight functions for the

    various expansions are represented in terms of standard PDFs.

    Table I lists a few orthogonal polynomials along with their cor-

    responding weight functions and regions of support. Among

    the various possible series expansions, we choose the one that

    uses the Hermite polynomials, which are orthogonal with re-

    spect to the standard Gaussian function. There are two major

    reasons for such a choice. First, this expansion is valid for the

    image wavelet coefficients that do not have any compact sup-

    port. Second, the Gaussian weight function resembles the em-

    pirical PDF, and hence there is a high likelihood that the ex-

    panded PDF would have a better match with the empirical one

    [61]. There are some other advantages in using the Hermite se-

    ries expansion: it has a uniform mean square convergence [62],

    the parameters can be expressed in terms of higher order mo-

    ments in closed-form (as seen later in this section), and the poly-

    nomials can be estimated recursively.

    Let represent a wavelet coefficient at spatial locationof a subband in the decomposition level with a transfor-

    mation matrix applied on the true image. Let represent the

    vector containing all the coefficients of a subband, wherein each

    coefficient is a sample of that subband. Let represent

    the PDF of the random variable . This PDF may be expressed

    in terms of the Hermite polynomials, which are orthogonal with

    respect to a Gaussian weight function, in the form [61]

    (1)

    where is the standard deviation of and is the Her-mite polynomial of order that satisfies the recurrance relation

    [63]

    (2)

    The resulting PDF will be called as the GH PDF. A closed-form

    expression of the series coefficients can be obtained in terms

    of the th order moment as (see Appendix A)

    (3)

    where denotes the largest integer contained in . The se-

    ries coefficients are determined only by the even-order mo-

    ments for even , and the odd-order moments for odd . Hence,

    the parameter estimation of the GH PDF becomes relatively

    simple and straight forward. Although the PDF of the image

    coefficients described in (1) is an infinite series expression, in

    practice, only thefirst few terms are used for a reasonably goodapproximation. For example, if the highest value of order in

    (1) is chosen as 4, the resulting density function reduces to the

    GramCharlier (GC) PDF [57], [64], [65]

    (4)

    where and , respectively, are the

    skewness and kurtosis of . However, the approximation of the

    right side of (1) by only three terms may introduce some er-rors in , and the error can be reduced by including more

    number of terms in the series. In order to include a larger number

    of terms in the series, higher order moments are necessary for

    estimating the series coefficients . But, the deviation between

    the estimated sample-moments and the true-moments is not in-

    significant when the order becomes high [64]. Therefore, the

    error introduced by the estimation of is not negligible when

    a large number of terms of the series is included. In addition,

    the decay rate of the density function is proportional

    to or simply for suf-

    ficiently large . The larger the order , the slower the decay of, thus making it a challenge to fit the tail part of

    with that of the empirical PDF of the image coefficients. As-

    suming the largest value of in (1) to be , the right side of (1)

    may introduce real zeros in the expansion of , the max-

    imum number of such zeros being . In essence, the PDF defined

    by the truncated GH series may not be always non-negative. For

    example, the symmetric (i.e., the skewness is zero) GC PDF is

    non-negative only if [66].

    Table II shows the average values of the magnitude of the

    skewness and the kurtosis of the DWT coefficients in thefirst

    three levels for some test images. It can be observed that the

    skewness of the image wavelet coefficients is insignificant. It

    has also been observed that other odd-order moments of interest

    are also insignificant. Hence, the PDF of the image coefficients

    can be treated as symmetric. As a result, all in the infinite

    series of (1) for odd disappear and the truncated GH PDF for

    the image wavelet coefficients can be written as

    (5)

    The kurtosis of the subband wavelet coefficients of images isusually more than 7 (see Table II); therefore, cannot

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    4/17

    1758 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    model these coefficients sufficiently well. In addition, each sub-

    band of the image wavelet coefficients usually has a sufficiently

    large number of data samples, and hence, the higher order mo-

    ments would be useful for modeling. Recall that the truncated

    GH PDF with a sufficiently large value of is not ensured to be

    non-negative. However, the model of the subband coefficients

    defined by the truncated GH PDF in (5) is an even function,and therefore, the real zeros are symmetrically situated around

    the mode. Fig. 1 shows the statistical modeling of the HL1band

    DWT coefficients by the empirical PDF and by the truncated GH

    PDF with varying from 4 to 14 for the imagesLenaand Bar-

    bara. The empirical PDF is obtained from the histogram using

    a nonparametric density estimation technique, wherein the bin

    width of the histogram plays a significant role. If the bin width is

    too small, then the estimated PDF suffers from undersmoothing,

    i.e., the density function has a small bias but a large variability.

    On the other hand, if it is too large, then the estimated PDF suf-

    fers from oversmoothing, i.e., the density function has a small

    variability but a large bias. In order to obtain a tradeoff be-

    tween the bias and variance, an appropriate choice of the binwidth is necessary. The bin width can be fixed or variable in

    size. Variable width may be better, when the density function

    is asymmetric and multimodal in nature. Since the PDF of the

    image wavelet coefficients is unimodal and symmetric in na-

    ture, afixed bin width is chosen for the histogram. The value of

    this bin width is chosen to be , being the

    interquartile range1 and the length of the data, since such a

    choice asymptotically minimizes the mean integrated squared

    error of the empirical PDF that is non-Gaussian [67]. It can be

    observed from Fig. 1 that, as increases, the kurtotic behavior

    of the truncated GH PDF approaches that of the empirical one,

    even though it results in zeros in the tail part of the PDF. Letbe the two zeros closest to the mode for a particular value

    of . The value of may be found by using any rootfinding

    routine, but we calculate it as

    (6)

    where is a unit step function and is logicalequal

    operation. It is evident from Fig. 1 that when the coefficients lie

    in the interval , then t hese c oef ficients can be modelled

    well by the proposed GH PDF with an appropriate choice of .

    Outside this range, i.e., in the tail part, such a truncated GH PDF

    introduces negative probability, which is not possible. Hence, inthe tail part of the truncated GH PDF, a modification is necessary

    1The IQR is the difference between the third and first quartiles of the data.

    so that the probability becomes non-negative and at the same

    time the modified GH PDF fits well with the empirical PDF.

    For such a modification, the use of a GH PDF is preferred to

    that of using a different PDF, since the estimated parameters

    and Hermite polynomials can also be used for the tail part of the

    PDF. Examples of the GH PDFs with non-negative probability

    are the Gaussian PDF and the symmetric GC PDF having. Since the latter has a longer tail than the former and

    the PDFs of the image wavelet coefficients are known to have

    long-tails, we choose the GC PDF as the modified tail. In order

    to ensure that the total area under the curve of the modified GH

    PDF is unity, we multiply the GC PDF by a weight parameter

    . Henceforth, this density function will be referred to as the

    modified GH (MGH) PDF. It can be mathematically formulated

    as (7), shown at the bottom of the page, where is the

    weight parameter, and . Since the

    total area under the MGH PDF is unity, the parameter satisfies

    (8)

    After some manipulations, the value of can be obtained as (see

    Appendix B)

    (9)

    where

    (10)

    and represents the error function. In summary, the MGH

    PDF for a particular subband is estimated using the following

    steps.1) Select as the highest-order even moment. Estimate the

    moments .

    if

    if

    (7)

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    5/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1759

    Fig. 1. Probability density function of theHL1 band DWT coefficients in terms of the GH series expansion with increasing value of . Test images are (a)Lenaand (b) Barbara.

    TABLE IIAVERAGEVALUES OF THE MAGNITUDE OF THESKEWNESS

    AND THE

    KURTOSIS

    OF THEDWT COEFFICIENTS FORSOMETESTIMAGES

    2) Estimate the series parameters by

    using (3).

    3) Select the bin width . Set the dis-

    crete points , where and

    , respectively, are the minimum and maximum values

    of .

    4) Estimate the Hermite polynomials

    by using (2) for each .

    5) Estimate using (5) and (6).

    6) Estimate using (9) and (10).7) Obtain the MGH PDF using (7).

    The proposed MGH PDF requires the estimation of ,

    and , and the computational complexities of these are

    , and , respectively, where is

    total number of discrete points resulting from the selected bin.

    Since and , the computational complexity

    of the proposed MGH PDF is . Fig. 2 shows the

    modeling of theHL1band DWT coefficients by the MGH PDF

    in the linear-scale using different values of for theLena and

    Barbara images. It can be observed from this figure that the

    MGH PDF is ensured to be non-negative and for a sufficiently

    large value of , the tail part of the MGH PDF matches very

    well with that of the empirical PDF. Therefore, in the proposedMGH PDF the choice of is of importance, since it determines

    the highest-order of the Hermite polynomials and of the mo-

    ments used in the parameter estimation. In order to make a good

    choice of , the MSE between the empirical and proposed MGH

    PDFs is plotted as a function of the highest-order moments used

    in (7) for theHL1andHL2 band DWT coefficients for some test

    images, and shown in Fig. 3. It can be observed from thisfigure

    that the inclusion of an increased number of higher order mo-

    ments in the parameter estimation of the proposed MGH PDF

    lowers the MSE, and a value of 12, 14, or 16 is a good choice

    for . In the log-scale, Fig. 4 shows the modeling performanceof three PDFs, namely, the GG, the BK and the MGH that

    uses for theHL1 band DWT coefficients of the Lena

    and Barbara images. The closeness of the fit of the proposed

    PDF with the empirical one, both in its peak and tail parts, is

    obvious as compared to the others. Moreover, to quantify the

    performance of the PDFs, we use, the Kolmogorov-Smirnov

    distance (KSD), , and the Kull-

    back-Leibler divergence (KLD), ,

    as the two metrics. Table III shows the results concerning

    the metrics KSD and KLD for the GG, BK, and MGH PDFs

    of the first and second level image DWT coefficients. The

    metrics are calculated by averaging the results obtained using96 images, each of size 512 512, from the database given in

    [68]. Standard deviations of the KLD metric are also included

    in the table. For the results shown here, the MGH PDF uses

    the even-order moments from 2 up to 14, whereas the GG and

    BK PDFs use only the second and fourth order moments in

    view of their limited number of parameters. From the table, it

    is evident that in most of the cases the MGH PDF, compared to

    the GG and BK PDFs, has the lowest distance and divergence

    from the empirical one. It should be mentioned that similar

    kind of results have been obtained from the prior modeling of

    the DT-CWT coefficients and are not included here for lack

    of space. This high degree of matching between the empirical

    and MGH PDFs is mainly due to the fact that the proposedPDF can incorporate more number of higher order moments

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    6/17

    1760 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    Fig. 2. Modified GH probability density function of the HL1 band DWT coefficients with increasing value of . Test images are (a)Lenaand (b)Barbara.

    Fig. 3. Mean squared error between the empirical and the modified GH probability density functions for the DWT coefficients as a function of the highest-orderof the moments that has been used for the parameter estimation. The subbands are (a)HL1 and (b)HL2.

    in its parameter estimation process as compared to the others.This feature, along with the modification proposed through the

    introduction of the parameter , results in the proposed MGH

    PDF to match the empirical PDF very well both in the peak and

    tail segments. In order to demonstrate the effectiveness of the

    proposed PDF, we next consider the case of Bayesian denoising

    with the MGH prior function.

    III. DENOISINGALGORITHM

    In this section, we assume that the image pixels are corrupted

    by AWGN with a known variance . If is unknown, it may

    be estimated by applying the median-absolute-deviation method

    [3] in the highest frequency subband of the noisy wavelet coeffi-cients. Since the noise is uncorrelated with the true image signal,

    the wavelet coefficient of the noisy image at the spatial locationof a subband is given by

    (11)

    where is the wavelet coefficient of the additive noise at

    the reference location. We propose an MMSE-based denoising,

    wherein the Bayes-risk estimator uses the proposed MGH PDF.

    In a Bayesian framework, , , and in (11) are consid-

    ered as samples of the independent random variables , , and

    , respectively. The signal component is modelled as the i.i.d.

    MGH random variable with subband-adaptive parameters. The

    random variable of the noise component is modelled as a zero-

    mean Gaussian random variable with a subband-adaptive vari-ance , which depends on , , and the arithmeticoperations

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    7/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1761

    Fig. 4. Modified GH probability density function in log-scale for the HL1band DWT coefficients. Hermite polynomials are up to the fourteenth order. Test imagesare (a)Lena, where

    and (b)Barbara, where

    .

    TABLE IIIVALUES OF THE METRICSKSD AND KLD FOR THE PRIOR FUNCTION

    MODELING OF THE IMAGEDWT COEFFICIENTS. T HE AVERAGEVALUES OFKLD (STANDARDDEVIATIONS INPARENTHESES) AND KSD ARE OBTAINED

    FROM A96 IMAGE DATABASE

    of the corresponding wavelet-domain. In this case, we assume

    that is known; however, if is unknown, it may be esti-

    mated using the method given in [16]. The Bayes-risk estimator

    for the denoised wavelet coefficient under a quadratic cost

    function that minimizes the MSE is the conditional mean given

    by

    (12)

    where is the posterior density function conditioned

    on the subband coefficients under consideration. According to

    the Bayesrule, the prior function of the image wavelet

    coefficients can be used to compute the posterior density func-

    tion as

    (13)

    where is the density function of the wavelet coeffi-

    cients corresponding to the noise component. It can be seen

    from (12) and (13) that the MMSE-based shrinkage functionis independent of the weight parameter that has been

    estimated in Section II. An analytical expression of the pro-

    posed MMSE-based shrinkage function can be obtained as

    in (14), shown at the bottom of the page (see Appendix C

    for proof), where ,

    , and is the parabolic

    cylinder function [69]. It is to be noted that for ,

    and . The shrinkage function

    given by (14) may be directly used for estimating each of

    the noisy wavelet coefficients. The computational complexity

    of this function is, where is the

    computational complexity of . Since the computational

    load increases exponentially with , an implementation of such

    (14)

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    8/17

    1762 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    a function may not be practical in view of the excessive com-

    putational effort involved. Hence, we represent the shrinkage

    function in terms of a linear convolution as

    (15)

    where is the convolution operator. Taking the computa-

    tional advantage of the discrete Fourier transform (DFT), the

    shrinkage function can be expressed as

    (16)

    where is the forward DFT and is the inverse DFT. Using

    a fast implementation of the DFT, the complexity of computing

    the shrinkage function can be reduced to .The function given in (16) provides the estimates of the

    noisy wavelet coefficients at the discrete points , where the

    probabilities of the MGH PDF are estimated. Therefore, an

    interpolation function is required to estimate each of the noisy

    wavelet coefficients. In our case, we choose the cubic Hermite

    spline [70] as an interpolation function so that the estimated

    Hermite polynomials for the MGH PDF can also be used

    for interpolation. This spline function requires the estimation

    of four parameters and has a computational complexity of

    . Since and , the compu-

    tational complexity of the shrinkage function given by (16) for

    subband-adaptive denoising is . From simulationresults, we have observed that the denoising performances

    using the two shrinkage functions, given by (14) and (16), are

    essentially the same. Hence, we prefer to use (16) for the pro-

    posed Bayesian denoising in view of its lower computational

    complexity.

    The proposed MMSE estimation requires that the parameters

    of the proposed prior function be estimated as accurately as pos-

    sible from the noisy observation. Since the parameters of the

    MGH prior function are expressed in terms of , the esti-

    mation of the th order moment of the signal coefficients from

    the noisy observation is sufficient for such a parameter esti-

    mation. The noise is independent of the signal and additive in

    nature, since it is AWGN. Therefore, the th order moment of

    the signal coefficients is estimated as by utilizing,

    and , which are, respectively, the th order moments of the

    noisy signal and noise coefficients. Since the noise is zero-mean

    Gaussian distributed, the odd-order moments of the noise coef-

    ficients are zero and the even-order moments can be expressed

    in terms of the noise standard deviation . In general, the th

    order moment of the noise coefficients is [71]

    for odd

    for even. (17)

    Therefore, the th order moment of the signal coefficients is

    evaluated as (18), shown at the bottom of the page. The param-

    eter estimation of the MGH prior function is relatively simple,

    since these can be substituted in (3) to obtain . On the

    other hand, the parameter estimation of traditional prior func-

    tions that have fewer number of parameters are often not so

    simple. For example, the shape and width parameters of the GG

    prior function are estimated numerically that involves iteration.

    In practice, the convergence of the iterative procedure depends

    on the initial choice of the parameter. Fig. 5 shows an example

    of the nonlinear shrinkage function obtained from the pro-

    posed denoising technique that uses three different values of

    in the MGH prior function. It can be observed from thisfigure

    that the amount of shrinkage can be controlled by changing .This feature of being able to vary the amount of shrinkage may

    be useful, in particular, in producing a denoised image that is

    visually acceptable from the users perspective.

    Since the wavelet coefficients of images are locally stationary,

    an improved performance of the Bayesian denoising method can

    be obtained by considering the proposed prior function to be lo-

    cally adaptive during the MMSE estimation. The higher order

    moments of the MGH PDF, however, will not be effective for

    modeling the local neighboring coefficients as compared to that

    of the subband coefficients, since the error introduced by the

    estimated higher order sample-moments in the former is not in-

    significant as in the case of the latter. This is due to the factthat the number of data samples for the local neighboring co-

    efficients is much less than that of the subband coefficients. In

    our earlier work [57], it has been shown that the symmetric GC

    PDF, which is a particular case of the MGH PDF having ,

    performs better than the GG PDF for modeling the local neigh-

    boring wavelet coefficients for images. A better quality MMSE

    estimator for image denoising can, therefore, be obtained by

    using the symmetric GC prior function as the prior function in

    the proposed shrinkage function. It was observed from exten-

    sive simulations that the difference in denoising performance in

    terms of the MSEs of the MMSE estimators designed using the

    MGH PDF with and is n o more than 0.25%. H ence,

    in order to savethe computational effort, MGH PDF with ,

    i.e., the GC PDF, is chosen as the prior function for the case of

    locally adaptive denoising. In such a case, the number of pa-

    rameters of the MGH prior function is the same as that of the

    conventional prior functions, namely, GG and BK. The compu-

    tational complexity of the locally adaptive version of the pro-

    for odd

    for even. (18)

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    9/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1763

    Fig. 5. Shrinkage function obtained from the modified GH probability density function for the LH2 subband DWT coefficients as a function of with .The test images are (a) Lena and (b)Boat.

    Fig. 6. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-based Bayesian denoising algorithms using the test image Lena

    ( for the proposed method).

    posed denoising method is , where is the

    average number of discrete points in a window obtained from

    the selected bin.

    IV. SIMULATION RESULTS

    Extensive experimentations have been carried out in order

    to compare the performance of the proposed denoising method

    with that of the other methods. Due to space limitation, how-

    ever, we give in this section results concerning five 512 512

    grayscale images, namely,Lena,Barbara,Boat,Bridge, andBa-

    boon and two types of wavelet representations, namely, the deci-

    mated DWT and redundant DT-CWT. Thefirst three test imagesare obtained from the same sources as mentioned in [16] and

    Fig. 7. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-based Bayesian denoising algorithms using the test image Bar-bara(

    for the proposed method).

    the last two from [68]. The DWT-based experiments use the or-

    thogonal wavelet filter Symmlet-8, whereas the DT-CWT-based

    ones use the -tap biorthogonal filters at level ,

    and 6-tap Q-shift orthogonalfilters at , as have been pro-

    posed in [72]. The proposed algorithm is applied to the 4-level

    decomposition of the DWT and the 3-level decomposition of the

    DT-CWT, since any further decomposition beyond these levels

    does not produce a significant increase in the denoising perfor-

    mance. The performance of the Bayesian denoising method is

    tested using two sets of experiments. In the first set of experi-

    ments, the MGH prior function is used for the subband-adaptive

    denoising. In the second set of experiments, the symmetric GC

    prior function, i.e., MGH prior function with , is used forlocally adaptive denoising.

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    10/17

    1764 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    Fig. 8. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-based Bayesian denoising algorithms using the test image Boat

    (

    for the proposed method).

    Fig. 9. Output PSNR as a function of the input PSNR for different subband-adaptive DWT-basedBayesian denoising algorithms usingthe testimageBridge(

    for the proposed method).

    A. Subband-adaptive Bayesian Denoising

    Seven different wavelet-based Bayesian denoising methods,

    which are subband-adaptive in nature, have been considered in

    this set of experiments. These are the classical Wiener filter,

    SureShrink [3], BayesShrink [20], EBayesThresh [24], Prob-

    Shrink [10], BKShrink [46] and proposed method. As pointed

    out in Section I, these methods enjoy a computational com-

    plexity that is lower than that of the corresponding locally adap-

    tive one by a factor of ; and that the computational com-

    plexity of those methods that employ intersubband dependency

    is even higher than those of the corresponding locally adap-

    tive ones. To the best of authorsknowledge the BKShrink and

    ProbShrink methods provide the best results amongst the sub-

    band-adaptive denoising methods. The denoising performanceof the various methods are evaluated using two indices, namely,

    Fig. 10. Output PSNR as a function of the input PSNR for different subband-

    adaptive DWT-based Bayesian denoising algorithms using the test image Ba-

    boon ( for the proposed method).

    Fig. 11. Output PSNR of subband-adaptive Bayesian denoising algorithmsusing redundant DWT, wherein test images are corrupted with ( for the proposed method).

    the peak signal-to-noise ratio (PSNR) and the structural simi-larity (SSIM) [4]. In this context, it is to be noted that the authors

    in [4] have claimed that SSIM can assess the similarity between

    the original and processed images better than the PSNR. De-

    noising performance of the proposed method is tested by using

    the two shrinkage functions, given by (14) and (16) in Sec-

    tion III. The evaluation of the parabolic cylinder function for

    the analytic expression in (14) is performed by the same source

    code as mentioned in [46]. It is to be noted that the processing

    time of (16) is very fast as compared to that of (14). For example,

    in a 3 GHz and 512 MB machine, the processing time of (16)

    using the MATLAB code is only 6.6 seconds for and the

    time is virtually the same for values of . On the other hand, the

    processing time of (14) is 1 hour and 24 minutes for andthe time increases exponentially with . It is also observed that

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    11/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1765

    Fig. 12. Comparison of the denoised images obtained from the different subband-adaptive Bayesian DWT-based denoising algorithms on Lenawith :(a) original image, (b) noisy image (

    dB,

    ), and denoised images using (c) BayesShrink method [20] (

    dB,

    ), (d) ProbShrink method [10] ( dB, ), (e) BKShrink method [46] ( dB, ),and (f) proposed method with ( dB, ).

    for a given value of , the denoising performances of both (14)

    and (16) are essentially the same. Hence, we provide the results

    of the proposed method that uses the shrinkage function given

    by (16). From our experiments, we have observed that

    provides the best performance in terms of the PSNR, whereas

    provides the best performance in terms of the SSIM, for

    the proposed method. These values of are consistent with thevalues of mentioned in Section II as providing the best statis-

    tical modeling of the image wavelet coefficients by the MGH

    PDF.

    The output PSNR as a function of the input PSNR obtained

    from various DWT-based denoising methods are shown in

    Figs. 610, respectively, for the test images Lena, Barbara,

    Boat, Bridge, and Baboon. From these figures, it can be

    observed that the proposed denoising method with

    that employs the DWT provides the highest output PSNR

    consistently as compared to the other DWT-based methods.

    According to the output PSNR values, the improvement of the

    denoising performance provided by the proposed method is

    substantial, when the input PSNR is low, i.e., the noise strengthis significant. Fig. 11 shows a comparison between the output

    PSNR values obtained from the SureShrink [3], BayesShrink

    [20], EBayesThresh [24], ProbShrink [10], and proposed

    methods employing the redundant form of the DWT when the

    images are corrupted with . From thisfigure, it can also

    be seen that the output PSNR values of the proposed method

    are consistently higher than that of the other methods. Table IV

    shows the SSIM values obtained from the various DWT-baseddenoising methods for three different noise standard deviations.

    It can be observed from the table that, except for two values of

    in the case of the Barbaraimage, the proposed method with

    provides the best SSIM compared to that provided by

    the other denoising methods. It should be noted that the PSNR

    values for are about the same as those for , the

    difference being not more than 0.1 dB, and that these values are

    at least as good as those provided by the other methods. Fig. 12

    shows the original cropped image Lena, its noisy version with

    , and the DWT-based denoised versions provided

    by the BayesShrink method [20], ProbShrink method [10],

    BKShrink method [46], and proposed method with .

    From this figure, it can be observed that the proposed methodcan reduce the noise significantly with the least amount of

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    12/17

    1766 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    TABLE IV

    VALUES OF THE INDEXSSIM FOR VARIOUSSUBBAND-ADAPTIVEBAYESIANDENOISINGMETHODSUSING THEDWT

    artifacts (e.g., in the eye and the edge of the hat) as compared

    to the other competing methods.

    B. Locally Adaptive Bayesian Denoising

    Six different wavelet-based denoising methods, which are lo-

    cally adaptive in nature, have been considered in this set of ex-

    periments. These are the BivariateShrink [58], locally adaptive

    window-based MAP (LAWMAP) [54], NeighShrink [22], Prob-

    Shrink [10], GSM method [16] and proposed method. In the

    first method, both the inter and intrasubband dependencies are

    considered. The second, third, and fourth methods inherently

    use only the intrasubband dependency. In the case of the fifth,

    we provide the results obtained from the intrasubband depen-

    dency only in order to make a fair comparison with our pro-

    posed method. The parameters of the symmetric GC prior func-

    tion for the proposed denoising method is refined locally using

    different sizes of square-shaped neighborhood regions. Fig. 13

    shows the improvement in the PSNR (IPSNR) values2 asa func-

    tion of the window size with a reference of 3 3 window, i.e.,

    as a function of , for de-

    noising different test imagesbeing corrupted with a noise having

    . From this figure, it can be observed that as the window

    size is increased from 3 3, the IPSNR value increasesfirst and

    then drops slightly when the window exceeds a certain size. Thisisdue tothe fact thata sufficient number of samples are required

    for improved parameter estimation of the prior function, but, if

    the neighborhood region becomes relatively large, the estima-

    tion process loses its local contextual information. The experi-

    ments reveal that a 9 9 window provides the highest IPSNR

    values in most cases. Considering this fact, we report the re-

    sults provided by our proposed denoising method using a 9 9

    window.

    Table V shows the output PSNR values obtained from the

    various locally adaptive denoising methods for the test images

    considered here. From the tabular results, it can be observed that

    among all the DWT-based denoising methods, the proposed one2IPSNR is the difference between output PSNR and input PSNR.

    Fig. 13. IPSNR values in dB as a function of the window size relative to theIPSNR values corresponding to a 3

    2

    3 window.

    provides the highest output PSNR values. The PSNR perfor-

    mance of the proposed method improves further, if the DT-CWT

    is employed. Fig. 14 shows the original cropped image Bar-

    bara, its noisy version with , and the DWT-based de-

    noised versions provided by four different denoising methods,

    namely, the BivariateShrink method [58], ProbShrink method

    [10], GSM method [16] and proposed method. From this figure

    also, it can be observed that the proposed method can reduce the

    noise significantly with the least amount of artifacts (e.g., in the

    eyes and scarf ofBarbara) as compared to the other competing

    methods.

    V. CONCLUSION

    The traditional probability density functions (PDFs) of the

    wavelet transform coefficients of images have a limited number

    of parameters, which are calculated by using only thefirst few

    moments. As a result, these PDFs cannot provide a sufficiently

    good statistical match for the empirical PDF of the waveletcoefficients of a true image. In this paper, a new PDF, wherein

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    13/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1767

    Fig. 14. Comparison of the denoised images obtained from the different locally adaptive DWT-based denoising algorithms onBarbarawith

    : (a) originalimage, (b) noisy image dB , and denoised images using the (c) BivariateShrink method [58] dB , (d) ProbShrink method [10] dB , (e) GSM method [16] dB , and (f) proposed method dB .

    a more appropriate number of parameters can be incorporated,

    has been proposed in order to overcome this shortfall. This PDF

    uses a series expansion in terms of the Hermite polynomials,

    which are orthogonal with respect to the standard Gaussian

    weight function. Such an orthogonal series expansion has been

    utilized, since the Gaussian density function bears a resem-

    blance with the empirical PDF of the wavelet coefficients of a

    true image. A closed-form expression for the parameters of thePDF (which has been referred to in this paper as the GH PDF)

    has been derived in terms of the higher order sample-moments.

    A modification has been proposed in the GH PDF so that an

    appropriate number of parameters can be used to model the

    true image wavelet coefficients while keeping the resulting

    PDF to be non-negative. It has been shown that this modified

    GH (MGH) PDF matches the empirical ones of the test images

    used, more closely than some of the standard PDFs such as the

    generalized Gaussian and Bessel K-form.

    In order to demonstrate the efficacy of the proposed MGH

    PDF, we have designed and tested the minimum mean squared

    error-based shrinkage function for its use in an image noise re-

    duction technique as a prior function of the subband as well aslocal neighboring wavelet coefficients. The parameters required

    for the prior function have been estimated from the noisy obser-

    vation. The simulation results using a number of standard test

    images have shown that the performance of the proposed de-

    noising method, both in subband-adaptive and locally adaptive

    conditions, is better than that of most of the methods that use

    PDFs with limited number of parameters for statistical modeling

    of the wavelet coefficients.

    APPENDIXA

    PARAMETERS OF PDF IN TERMS OF THE

    GAUSSHERMITEEXPANSION

    Let be the normalized version of with the relation

    and its PDF be . Since the Hermite polynomials are

    orthogonal with respect to the zero-mean unit variance Gaussian

    weight function, can be written as [64]

    (19)

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    14/17

    1768 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    TABLE IV

    OUTPUTPSNR, IN DB, WHERE IS THEERROR STANDARD

    DEVIATION, FOR DIFFERENTLOCALLYADAPTIVEDENOISINGMETHODS

    where are the series coefficients. Multiplying both sides of

    (19) by and integrating from to , can be eval-

    uated as

    (20)

    Using the fact that has a zero mean and unit variance, one can

    obtain and . Finally, changing the variable

    of to in (19) yields [61]

    (21)

    The Hermite polynomials can be expressed as

    (22)

    Using the identity and the expression of , the series coefficients in (20) can be obtained in a

    generalized form

    (23)

    where is the th order moment of .

    APPENDIXB

    PARAMETER FOR THEMODIFIEDGAUSS-HERMITEPDF

    Using the normalized variable , we can solve (8) as

    , where

    (24)

    Using the identity and the

    recurrance relation in (2), we obtain

    (25)

    Hence, for

    (26)

    (27)

    Using the above, and , and hence, the parameter can be

    expressed as

    (28)

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    15/17

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    16/17

    1770 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 17, NO. 10, OCTOBER 2008

    [11] R. R. Coifman and D. L. Donoho,Translation-invariant de-noising,in Wavelets and Statistics, A. Antoniadis and G. Oppenheim, Eds.Berlin, Germany: Springer-Verlag, 1995.

    [12] N. G. Kingsbury, Image processing with complex wavelets, Phil.Trans. R. Soc. London A, vol. 357, no. 1760, pp. 25432560, 1999.

    [13] G. Y. Chen and B. Kgl, Image denoising with complex ridgelets,Pattern Recognit., vol. 40, pp. 578585, 2007.

    [14] J. L. Starck, E. J. Cands, and D. L. Donoho, The curvelet transform

    for image denoising,IEEE Trans. Image Process., vol. 11, no. 6, pp.670684, Jun. 2002.[15] R. Eslami and H. Radha,Translation-invariant contourlet transform

    and its application to image denoising,IEEE Trans. Image Process.,vol. 15, no. 11, pp. 33623374, Nov. 2006.

    [16] J. Portilla, V. Strela, M. Wainwright, and E. P. Simoncelli, Imagedenoising using scale mixtures of Gaussians in the wavelet domain,

    IEEE Trans. Image Process., vol. 12, no. 11, pp. 13381351, Nov.2003.

    [17] A. A. Bharath and J. Ng,A steerable complex wavelet constructionand its application to image denoising,IEEE Trans. Image Process.,vol. 14, no. 7, pp. 948959, Jul. 2005.

    [18] B. Vidakovic, Statistical Modeling by Wavelets, 1st ed. New York:Wiley, 1999.

    [19] A. Antoniadis and J. Bigot,Wavelet estimators in nonparametric re-gression: A comparative simulation study,J. Statist. Softw., vol. 6, no.6, 2001.

    [20] S. G. Chang, B. Yu, and M. Vetterli,Adaptive wavelet thresholdingfor image denoising and compression, IEEE Trans. Image Process.,vol. 9, no. 9, pp. 15321546, Sep. 2000.

    [21] M. A. T. Figueiredo and R. D. Nowak, Wavelet-based image esti-mation: An empirical Bayes approach using Jeffreysnoninformativeprior,IEEE Trans. Image Process., vol. 10, no. 9, pp. 13221331, Sep.2001.

    [22] T. Cai and B. Silverman,Incorporating information on neighboringcoefficients into wavelet estimation, Sankhya: The Indian J. Statist.,vol. 63, pp. 127148, 2001.

    [23] L. Zhang and P. Bao,Denoising by spatial correlation thresholding,IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 6, pp. 535538,Jun. 2003.

    [24] I. M. Johnstone and B. W. Silverman,Empirical Bayes selection ofwavelet thresholds,Ann. Statist., vol. 33, no. 4, pp. 17001752, 2005.

    [25] H.-Y. Gao and A. G. Bruce,Waveshrink withfirm shrinkage,Statist.Sinica, vol. 7, no. 4, pp. 855874, 1997.

    [26] H.-Y. Gao,Wavelet shrinkage denoising using the non-negative gar-rote,J. Comput. Graph. Statist., vol. 7, no. 4, pp. 469488, 1998.

    [27] E. P. Simoncelli and E. Adelson,Noise removal via Bayesian waveletcoring,inProc. IEEE Int. Conf. Image Processing, Lusanne, Switzer-land, 1996, vol. 1, pp. 279382.

    [28] M. I. H. Bhuiyan, M. O. Ahmad, and M. N. S.Swamy, Spatially adap-tive wavelet-based method using the Cauchy prior for denoising theSAR images, IEEE Trans. Circuits Syst. Video Technol., vol. 17, no.4, pp. 500507, Apr. 2007.

    [29] L.Sendur and I. W. Selesnick, Bivariate shrinkage functions forwavelet-based denoising exploiting interscale dependency, IEEETrans. Signal Process., vol. 50, no. 11, pp. 27442756, Nov.2002.

    [30] S. M. M. Rahman, M. O. Ahmad, and M. N. S. Swamy,Video de-noising based on inter-frame statistical modeling of wavelet coeffi-cients, IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 2, pp.

    187198, Feb., 2007.[31] J. Huang,Statistics of Natural Images and Models, Ph.D. disserta-

    tion, Brown Univ., Providence, RI, 2000.[32] A. Srivastava, A. B. Lee,E. P.Simoncelli, and S. C. Zhu, On advances

    in statistical modeling of natural images,J. Math. Imag. Vis., vol. 18,pp. 1733, 2003.

    [33] S. Mallat, A theory for multiresolution signal decomposition: Thewavelet representation, IEEE Trans. Pattern Anal. Machine Intell.,vol. 11, no. 7, pp. 674693, 1989.

    [34] A. D. Stefano, P. R. White, and W. B. Collis, Selection of thresh-olding scheme for image noise reduction on wavelet components usingBayesian estimation,J. Math. Imag. Vis., vol. 21, pp. 225233, 2004.

    [35] P. Moulin and J. Liu,Analysis of multiresolution image denoisingschemes using generalized Gaussian and complexity priors, IEEETrans. Inf. Theory, vol. 55, no. 3, pp. 909919, Apr. 1999.

    [36] D. Cho and T. D. Bui,Multivariate statistical modeling for image de-noising using wavelet transforms, Signal Process.: Image Commun.,vol. 20, pp. 7789, 2005.

    [37] H. A. Chipman, E. D. Kolaczyk, and R. E. McCulloch, AdaptiveBayesian wavelet shrinkage,J. Amer. Statist. Assoc., vol. 92, no. 440,pp. 14131421, 1997.

    [38] F. Abramovich, T. Sapatinas, and B. W. Silverman,Wavelet thresh-olding via a Bayesian approach,J. Roy. Statist. Soc. B, vol. 60, no. 4,pp. 725749, 1998.

    [39] M. Clyde and E. I. George,Flexible empirical Bayes estimation forwavelets,J. Roy. Statist. Soc. B, vol. 62, no. 4, pp. 681698, 2000.

    [40] J. Portilla, V. Strela, M. Wainwright, and E. P. Simoncelli, Adap-tive Wiener denoising using a Gaussian scale mixture model,inProc.IEEE Int. Conf. Image Processing, Thessaloniki, Greece, 2001, vol. 2,pp. 3740.

    [41] M. S. Crouse, R. D. Nowak, and R. G. Baraniuk,Wavelet-based sta-tistical signal processing using hidden Markov models,IEEE Trans.Signal Process., vol. 46, no. 4, pp. 886902, Apr. 1998.

    [42] P. Scheunders and S. D. Backer,Wavelet denoising of multicompo-nent images using Gaussian scale mixture models and a noise-freeimage as priors, IEEE Trans. Image Process., vol. 16, no. 7, pp.18651872, Jun. 2007.

    [43] L. Boubchir and J. M. Fadili,A closed-form nonparametric Bayesianestimator in the wavelet domain of images using an approximate

    -stable prior,Pattern Recognit. Lett., vol. 27, pp. 13701382, 2006.[44] A. Achim, A. Bezerianos, and P. Tsakalides,Novel Bayesian multi-

    scale method for speckle removal in medical ultrasound images,IEEETrans. Med. Imag., vol. 20, no. 8, pp. 772783, Aug. 2001.

    [45] A. Achim, P. Tsakalides, and A. Bezerianos,SAR image denoisingvia Bayesian wavelet shrinkage basedon heavy-tailed modeling,IEEETrans. Geosci. RemoteSens., vol.41,no.8, pp. 17731784, Aug. 2003.

    [46] J. M. Fadili and L. Boubchir,Analytical form for a Bayesian waveletestimator of images using the Bessel K-form densities, IEEE Trans.

    Image Process., vol. 14, no. 2, pp. 231240, Feb. 2005.[47] S. Solbo and T. Eltoft, Homomorphic wavelet-based statistical de-

    speckling of SAR images,IEEE Trans. Geosci. Remote Sens., vol. 42,no. 4, pp. 711721, Apr. 2004.

    [48] M. I. H. Bhuiyan, M. O. Ahmad, and M. N. S. Swamy,Wavelet-baseddespeckling of ultrasound images with the symmetric normal inverse

    Gaussian prior, inProc. IEEE Int. Conf. Acoustics, Speech, and SignalProcess., 2007, vol. 1, pp. 721724.

    [49] R. V. Hogg and A. T. Craig, Introduction to Mathematical Statistics,5th ed. Englewood Cliffs, NJ: Prentice-Hall, 1995.

    [50] M. N. Do and M. Vetterly,Wavelet-based texture retrieval using gen-eralized Gaussian density and Kullback-Leibler distance,IEEE Trans.

    Image Process., vol. 11, no. 2, pp. 146158, Feb. 2002.[51] B. Aiazzi, L. Alparone, and S. Baronti,Estimation based on entropy

    matching for generalized Gaussian PDF modeling, IEEE SignalProcess. Lett., vol. 6, no. 6, pp. 138140, Jun. 1999.

    [52] L. P. Hansen,Large sample properties of generalized method of mo-ments estimator,Econometrica, vol. 50, no. 4, pp. 10291054, 1982.

    [53] J. Liu and P. Moulin,Information-theoretic analysis of interscale andintrascale dependencies between image wavelet coefficients, IEEETrans. Image Process., vol. 10, no. 11, pp. 16471658, Nov. 2001.

    [54] M. K. Mihak, I. Kozintsev, K. Ramchandran, and P. Moulin,Low-complexity image denoising based on statistical modeling of wavelet-coefficients,IEEE Signal Process. Lett., vol. 6, no. 12, pp. 300303,Dec. 1999.

    [55] S. M. M. Rahman and M. K. Hasan,Wavelet-domain iterative centerweighted medianfilter for image denoising,Signal Process., vol. 83,pp. 10011012, 2003.

    [56] M. Kazubek,Wavelet domain image denoising by thresholding andWiener filtering, IEEE Signal Process. Lett., vol. 10, no. 11, pp.324326, Nov. 2003.

    [57] S. M. M. Rahman, M. O. Ahmad, and M. N. S. Swamy, Locallyadaptive wavelet-based imagedenoising using the GramCharlier priorfunction, in Proc. IEEE Int. Conf. Image Processing, San Antonio,TX, 2007, vol. 3, pp. 549552.

    [58] L.Sendur and I. W. Selesnick,Bivariate shrinkage with local varianceestimation, IEEE Signal Process. Lett., vol. 9, no. 12, pp. 438441,Dec. 2002.

    [59] A. Achim and E. E. Kuruoglu, Image denoising using bivariate -stable distributions in the complex wavelet domain,IEEE Signal Pro-cessing Lett., vol. 12, no. 1, pp. 1720, Jan. 2005.

    [60] L. Zhang, P. Bao, and X. Wu,Multiscale LMMSE-based image de-noising with optimal wavelet selection, IEEE Trans. Circuits Syst.Video Technol., vol. 15, no. 4, pp. 469481, Apr. 2005.

    [61] A. Zayezdny, D. Tabak, and D. Wulich, Engineering Applications ofStochastic Processes, 1st ed. New York: Wiley, 1989.

  • 8/10/2019 14-Bayesian Wavelet-Based Image Denoising Using the GaussHermite Expansion.pdf

    17/17

    RAHMANet al.: BAYESIAN WAVELET-BASED IMAGE DENOISING USING THE GAUSSHERMITE EXPANSION 1771

    [62] G. G. Walter,Properties of Hermite series estimation of probabilitydensity,Ann. Stat., vol. 5, no. 6, pp. 12581264, 1977.

    [63] M. Abramowitz and I. E. Stegun, Handbook of Mathematical Func-tions with Formulas, Graphs and Mathematical Tables, 10th ed. NewYork: Dover, 1965.

    [64] M. G. Kendall and A. Stuart, The Advanced Theory of Statistics, 4thed. London, U.K.: Charles Griffin, 1977, vol. 1, Distribution Theory.

    [65] E. Jondeau and M. Rockinger, GramCharlier densities, J. Econ.

    Dyn. C ontrol, vol. 25, pp. 14571483, 2001.[66] D. E. Barton and K. E. Dennis, The conditions under whichGramCharlier and edgeworth curves are positive definite and uni-modal,Biometrika, vol. 39, no. 3/4, pp. 425427, 1952.

    [67] J. S. Simonoff, Smoothing Methods in Statistics, 1st ed. New York:

    Springer-Verlag, 1996.[68] [Online]. Available: http://decsai.ugr.es/cvg/dbimagenes/index.php[69] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series and Prod-

    ucts, A. Jeffrey, Ed., 5th ed. New York: Academic, 1994.[70] C. D. Boor, A Practical Guide to Splines, F. John, L. Sirovich,

    J. P. Lasalle, and G. B. Whitham, Eds., 1st ed. New York:Springer-Verlag, 1978.

    [71] A. Papoulis, Probability, Random Variables, and Stochastic Processes,3rd ed. New York: McGraw-Hill, 1991.

    [72] N. G. Kingsbury,Complex wavelets for shift invariance analysis andfiltering of signals,Appl. Comput. Harmon. Anal., vol. 10, no. 3, pp.234253, 2001.

    S. M. Mahbubur Rahman received B.Sc. andM.Sc.degrees in electrical and electronic engineering fromthe Bangladesh University of Engineering andTechnology (BUET), Dhaka, Bangladesh, in 1999

    and 2002, respectively. He is currently pursuingthe Ph.D. degree in the Department of Electricaland Computer Engineering, Concordia University,

    Montreal, QC, Canada.He was a Lecturer from 1999 to 2002 and an As-

    sistant Professor from 2002 to 2003 at BUET. Hisresearch interests are in the areas of statistical signal

    processing, digital image/video processing, and waveletfilters.Mr. Rahman hasreceivedseveral awardsduringhis Ph.D. program, including

    the Concordia University Graduate Fellowship, the Faculty of Engineering andComputer Science Graduate Scholarship, the Carolyn and Richard Renaud

    Teaching Assistantship, and the Doctoral Teaching Assistantship.

    M. Omair Ahmad (S69M78SM83F01) re-ceived the B.Eng. degree from Sir George Williams

    University, Montreal, QC, Canada, and the Ph.D.degree from Concordia University, Montreal, both

    in electrical engineering.During 19781979, he was a member of the Fac-

    ulty of the New York University College, Buffalo. InSeptember 1979, he joined the Faculty of ConcordiaUniversity, where he was an Assistant Professor ofcomputer science. Subsequently, he joined the De-partment of Electrical and Computer Engineering of

    thesame university, where he wasthe Chair of thedepartment from June 2002 toMay 2005 and where he is presently a Professor and holds the Tier I Concordia

    University Research Chair in Multimedia Signal Processing. He has publishedextensively in the area of signal processing and holds four patents. His current

    research interests include the areas of multidimensionalfilter design, speech,image and video processing, nonlinear signal processing, communication DSP,artificial neural networks, and VLSI circuits for signal processing. He was afounding member of Micronet from its inception in 1990 as a Canadian Net-work of Centers of Excellence until its expiration in 2004.

    Dr. Ahmad was anAssociate Editor of the IEEET RANSACTIONS ON CIRCUITSANDSYSTEMSPARTI: FUNDAMENTALTHEORY ANDAPPLICATIONS from June1999 to December 2001. He was the Local Arrangements Chairman of the 1984IEEE International Symposium on Circuits and Systems. During 1988, he wasa member of the Admission and Advancement Committee of the IEEE. He hasalso served as the Program Co-Chair for the 1995 IEEE International Con-ference on Neural Networks and Signal Processing and the 2004 IEEE Inter-

    national Midwest Symposium on Circuits and Systems. Presently, he is theChairman of the IEEE Circuits and Systems Chapter (Montreal Section). He

    is a recipient of the Wighton Fellowship from the Sandford Fleming Founda-tion.

    M. N. S. Swamy (S59M62SM74F80) re-ceived the B.Sc. (Hons.) degree in mathematics fromMysore University, India, in 1954, the Diploma inElectrical Communication Engineering from theIndian Institute of Science, Bangalore, in 1957, theM.Sc. and Ph.D. degrees in electrical engineeringfrom the University of Saskatchewan, Saskatoon,SK, Canada, in 1960 and 1963, respectively, and the

    Doctor of Science degree in engineering (HonorisCausa) from Ansted University In recognitionof his exemplary contributions to the research in

    Electrical and Computer Engineering and to Engineering Education, as wellas his dedication to the promotion of Signal Processing and CommunicationsApplications in August 2001. As a recognition of his research contributions,as well as his contributions to Concordia University, the University has recentlyestablished the M. N. S. Swamy Chair in Electrical Engineering.

    He is presently a Research Professor and the Director of the Center for SignalProcessing and Communications, Department of Electrical and Computer En-gineering, Concordia University, Montreal, QC, Canada, where he served asthe Chair of the Department of Electrical Engineering from 1970 to 1977 andDean of Engineering and Computer Science from 1977 to 1993. Since July2001, he has held the Concordia Tier 1 Research Chair in Signal Processing. Hehas also taught in the Electrical Engineering Department, Technical Universityof Nova Scotia, Halifax, NS, Canada, and the University of Calgary, Calgary,AB, Canada, as well as in the Department of Mathematics at the University

    of Saskatchewan. He has published extensively in the areas of number theory,circuits, systems and signal processing, and holds five patents. He is the co-au-thor of two book chapters and four books: Graphs, Networks and Algorithms

    (Wiley, 1981),Graphs: Theory and Algorithms(Wiley, 1992),Switched Capac-itor Filters: Theory, Analysis and Design (Prentice Hall International U.K. Ltd.,1995), andNeural Networks in a Softcomputing Framework(Springer, 2006). ARussian translation of the first book was published by Mir Publishers, Moscow,Russia, in 1984, while a Chinese version was published by the Education Press,Beijing, China, in 1987. He was a founding member of Micronet from its incep-tion in 1990 as a Canadian Network of Centers of Excellence until its expirationin 2004, as well as its Coordinator for Concordia University.

    Dr. Swamy is a Fellow of the Institute of Electrical Engineers (U.K.), the En-gineering Institute of Canada, the Institution of Engineers (India), and the Insti-tution of Electronic and Telecommunication Engineers (India). He has servedthe IEEE in various capacities such as the President-Elect in 2003, President in2004, Past-President in 2005, Vice President (Publications) during 20012002,VicePresident in 1976, Editor-in-Chief of theT RANSACTIONS ON CIRCUITS AND

    SYSTEMSPART I: FUNDAMENTAL THEORY AND APPLICATIONS fromJune 1999to December 2001, Associate Editor of the TRANSACTIONS ON CIRCUITS AND

    SYSTEMSfrom June 1985May 1987, Program Chair for the 1973 IEEE CASSymposium, General Chair for the 1984 IEEE CAS Symposium, Vice-Chair forthe 1999 IEEE CAS Symposium, and a member of the Board of Governors ofthe CAS Society. He is the recipient of many IEEE-CAS Society awards in-cluding the Education Award in 2000, Golden Jubilee Medal in 2000, and the1986 Guillemin-Cauer Best Paper Award.