A wavelet is a wave-like oscillation with an amplitude that begins at zero, increases or decreases, and then returns to zero one or more times. Wavelets are termed a "brief oscillation". A taxonomy of wavelets has been established, based on the number and direction of its pulses. Wavelets are imbued with specific properties that make them useful for signal processing.
For example, a wavelet could be created to have a frequency of middle C and a short duration of roughly one tenth of a second. If this wavelet were to be convolution with a signal created from the recording of a melody, then the resulting signal would be useful for determining when the middle C note appeared in the song. Mathematically, a wavelet correlates with a signal if a portion of the signal is similar. Correlation is at the core of many practical wavelet applications.
As a mathematical tool, wavelets can be used to extract information from many kinds of data, including and images. Sets of wavelets are needed to analyze data fully. "Complementary" wavelets decompose a signal without gaps or overlaps so that the decomposition process is mathematically reversible. Thus, sets of complementary wavelets are useful in wavelet-based compression/decompression algorithms, where it is desirable to recover the original information with minimal loss.
In formal terms, this representation is a wavelet series representation of a square-integrable function with respect to either a complete, orthonormal set of , or an overcomplete set or frame of a vector space, for the Hilbert space of square-integrable functions. This is accomplished through coherent states.
In classical physics, the diffraction phenomenon is described by the Huygens–Fresnel principle that treats each point in a propagating wavefront as a collection of individual spherical wavelets.Wireless Communications: Principles and Practice, Prentice Hall communications engineering and emerging technologies series, T. S. Rappaport, Prentice Hall, 2002, p. 126. The characteristic bending pattern is most pronounced when a wave from a coherent source (such as a laser) encounters a slit/aperture that is comparable in size to its wavelength. This is due to the addition, or interference, of different points on the wavefront (or, equivalently, each wavelet) that travel by paths of different lengths to the registering surface. Multiple, closely spaced openings (e.g., a diffraction grating), can result in a complex pattern of varying intensity.
Wavelet transforms are broadly divided into three classes: continuous, discrete and multiresolution-based.
The frequency bands or subspaces (sub-bands) are scaled versions of a subspace at scale 1. This subspace in turn is in most situations generated by the shifts of one generating function ψ in L2( R), the mother wavelet. For the example of the scale one frequency band 1, this function is with the (normalized) sinc function. That, Meyer's, and two other examples of mother wavelets are:
]] |
]] |
]] |
The subspace of scale a or frequency band 1/ a, is generated by the functions (sometimes called child wavelets) where a is positive and defines the scale and b is any real number and defines the shift. The pair ( a, b) defines a point in the right halfplane R+ × R.
The projection of a function x onto the subspace of scale a then has the form with wavelet coefficients
For the analysis of the signal x, one can assemble the wavelet coefficients into a scaleogram of the signal.
See a list of some Continuous wavelets.
A sufficient condition for the reconstruction of any signal x of finite energy by the formula is that the functions form an orthonormal basis of L2( R).
From the mother and father wavelets one constructs the subspaces The father wavelet keeps the time domain properties, while the mother wavelets keeps the frequency domain properties.
From these it is required that the sequence forms a multiresolution analysis of L2 and that the subspaces are the orthogonal "differences" of the above sequence, that is, Wm is the orthogonal complement of Vm inside the subspace V m−1,
In analogy to the sampling theorem one may conclude that the space Vm with sampling distance 2 m more or less covers the frequency baseband from 0 to 1/2 m-1. As orthogonal complement, Wm roughly covers the band 1/2 m−1,.
From those inclusions and orthogonality relations, especially , follows the existence of sequences and that satisfy the identities so that and so that The second identity of the first pair is a refinement equation for the father wavelet φ. Both pairs of identities form the basis for the algorithm of the fast wavelet transform.
From the multiresolution analysis derives the orthogonal decomposition of the space L2 as For any signal or function this gives a representation in basis functions of the corresponding subspaces as where the coefficients are and
Being in this space ensures that one can formulate the conditions of zero mean and square norm one: is the condition for zero mean, and is the condition for square norm one.
For ψ to be a wavelet for the continuous wavelet transform (see there for exact statement), the mother wavelet must satisfy an admissibility criterion (loosely speaking, a kind of half-differentiability) in order to get a stably invertible transform.
For the discrete wavelet transform, one needs at least the condition that the wavelet series is a representation of the identity in the Lp space L2( R). Most constructions of discrete WT make use of the multiresolution analysis, which defines the wavelet by a scaling function. This scaling function itself is a solution to a functional equation.
In most situations it is useful to restrict ψ to be a continuous function with a higher number M of vanishing moments, i.e. for all integer m < M
The mother wavelet is scaled (or dilated) by a factor of a and translated (or shifted) by a factor of b to give (under Morlet's original formulation):
For the continuous WT, the pair ( a, b) varies over the full half-plane R+ × R; for the discrete WT this pair varies over a discrete subset of it, which is also called affine group.
These functions are often incorrectly referred to as the basis functions of the (continuous) transform. In fact, as in the continuous Fourier transform, there is no basis in the continuous wavelet transform. Time-frequency interpretation uses a subtly different formulation (after Delprat).
Restriction:
In particular, assuming a rectangular window region, one may think of the STFT as a transform with a slightly different kernel where can often be written as , where and u respectively denote the length and temporal offset of the windowing function. Using Parseval's theorem, one may define the wavelet's energy as From this, the square of the temporal support of the window offset by time u is given by
and the square of the spectral support of the window acting on a frequency
Multiplication with a rectangular window in the time domain corresponds to convolution with a function in the frequency domain, resulting in spurious ringing artifacts for short/localized temporal windows. With the continuous-time Fourier transform, and this convolution is with a delta function in Fourier space, resulting in the true Fourier transform of the signal . The window function may be some other Apodizing, such as a Gaussian filter. The choice of windowing function will affect the approximation error relative to the true Fourier transform.
A given resolution cell's time-bandwidth product may not be exceeded with the STFT. All STFT basis elements maintain a uniform spectral and temporal support for all temporal shifts or offsets, thereby attaining an equal resolution in time for lower and higher frequencies. The resolution is purely determined by the sampling width.
In contrast, the wavelet transform's multiresolutional properties enables large temporal supports for lower frequencies while maintaining short temporal widths for higher frequencies by the scaling properties of the wavelet transform. This property extends conventional time-frequency analysis into time-scale analysis.
The discrete wavelet transform is less computationally complexity, taking O( N) time as compared to O( N log N) for the fast Fourier transform (FFT). This computational advantage is not inherent to the transform, but reflects the choice of a logarithmic division of frequency, in contrast to the equally spaced frequency divisions of the FFT which uses the same basis functions as the discrete Fourier transform (DFT).The Scientist and Engineer's Guide to Digital Signal Processing By Steven W. Smith, Ph.D. chapter 8 equation 8-1: http://www.dspguide.com/ch8/4.htm This complexity only applies when the filter size has no relation to the signal size. A wavelet without compact support such as the Shannon wavelet would require O( N2). (For instance, a logarithmic Fourier Transform also exists with O( N) complexity, but the original signal must be sampled logarithmically in time, which is only useful for certain types of signals.)
For analysis with orthogonal wavelets the high pass filter is calculated as the quadrature mirror filter of the low pass, and reconstruction filters are the time reverse of the decomposition filters.
Daubechies and Symlet wavelets can be defined by the scaling filter.
The wavelet function is in effect a band-pass filter and scaling that for each level halves its bandwidth. This creates the problem that in order to cover the entire spectrum, an infinite number of levels would be required. The scaling function filters the lowest level of the transform and ensures all the spectrum is covered. See for a detailed explanation.
For a wavelet with compact support, φ( t) can be considered finite in length and is equivalent to the scaling filter g.
Meyer wavelets can be defined by scaling functions
For instance, Mexican hat wavelets can be defined by a wavelet function. See a list of a few continuous wavelets.
Notable contributions to wavelet theory since then can be attributed to George Zweig’s discovery of the continuous wavelet transform (CWT) in 1975 (originally called the cochlear transform and discovered while studying the reaction of the ear to sound), Pierre Goupillaud, Alex Grossmann and Jean Morlet's formulation of what is now known as the CWT (1982), Jan-Olov Strömberg's early work on discrete wavelets (1983), the Le Gall–Tabatabai (LGT) 5/3-taps non-orthogonal filter bank with linear phase (1988), Ingrid Daubechies' orthogonal wavelets with compact support (1988), Stephane Mallat's non-orthogonal multiresolution framework (1989), Ali Akansu's binomial QMF (1990), Nathalie Delprat's time-frequency interpretation of the CWT (1991), Newland's harmonic wavelet transform (1993), and set partitioning in hierarchical trees (SPIHT) developed by Amir Said with William A. Pearlman in 1996.
The JPEG 2000 standard was developed from 1997 to 2000 by a Joint Photographic Experts Group (JPEG) committee chaired by Touradj Ebrahimi (later the JPEG president). In contrast to the DCT algorithm used by the original JPEG format, JPEG 2000 instead uses discrete wavelet transform (DWT) algorithms. It uses the CDF 9/7 wavelet transform (developed by Ingrid Daubechies in 1992) for its lossy compression algorithm, and the Le Gall–Tabatabai (LGT) 5/3 discrete-time filter bank (developed by Didier Le Gall and Ali J. Tabatabai in 1988) for its lossless compression algorithm. JPEG 2000 technology, which includes the Motion JPEG 2000 extension, was selected as the video coding standard for digital cinema in 2004.
Wavelet transforms are classified into discrete wavelet transforms (DWTs) and continuous wavelet transforms (CWTs). Note that both DWT and CWT are continuous-time (analog) transforms. They can be used to represent continuous-time (analog) signals. CWTs operate over every possible scale and translation whereas DWTs use a specific subset of scale and translation values or representation grid.
There are a large number of wavelet transforms each suitable for different applications. For a full list see list of wavelet-related transforms but the common ones are listed below:
Another example of a generalized transform is the chirplet transform in which the CWT is also a two dimensional slice through the chirplet transform.
An important application area for generalized transforms involves systems in which high frequency resolution is crucial. For example, darkfield electron optical transforms intermediate between direct and reciprocal space have been widely used in the harmonic analysis of atom clustering, i.e. in the study of and .P. Hirsch, A. Howie, R. Nicholson, D. W. Pashley and M. J. Whelan (1965/1977) Electron microscopy of thin crystals (Butterworths, London/Krieger, Malabar FLA) Now that transmission electron microscopes are capable of providing digital images with picometer-scale information on atomic periodicity in nanostructure of all sorts, the range of pattern recognitionP. Fraundorf, J. Wang, E. Mandell and M. Rose (2006) Digital darkfield tableaus, Microscopy and Microanalysis 12:S2, 1010–1011 (cf. arXiv:cond-mat/0403017) and strain/metrologyMartin Rose (2006) Spacing measurements of lattice fringes in HRTEM image using digital darkfield decomposition (M.S. Thesis in Physics, U. Missouri – St. Louis) applications for intermediate transforms with high frequency resolution (like brushletsF. G. Meyer and R. R. Coifman (1997) Applied and Computational Harmonic Analysis 4:147. and ridgeletsA. G. Flesia, Hagit Hel-Or, A. Averbuch, E. J. Candes, R. R. Coifman and David Donoho (2001) Digital implementation of ridgelet packets (Academic Press, New York).) is growing rapidly.
Fractional wavelet transform (FRWT) is a generalization of the classical wavelet transform in the fractional Fourier transform domains. This transform is capable of providing the time- and fractional-domain information simultaneously and representing signals in the time-fractional-frequency plane.
Like some other transforms, wavelet transforms can be used to transform data, then encode the transformed data, resulting in effective compression. For example, JPEG 2000 is an image compression standard that uses biorthogonal wavelets. This means that although the frame is overcomplete, it is a tight frame (see types of frames of a vector space), and the same frame functions (except for conjugation in the case of complex wavelets) are used for both analysis and synthesis, i.e., in both the forward and inverse transform. For details see wavelet compression.
A related use is for smoothing/denoising data based on wavelet coefficient thresholding, also called wavelet shrinkage. By adaptively thresholding the wavelet coefficients that correspond to undesired frequency components smoothing and/or denoising operations can be performed.
Wavelet transforms are also starting to be used for communication applications. Wavelet OFDM is the basic modulation scheme used in HD-PLC (a power line communications technology developed by Panasonic), and in one of the optional modes included in the IEEE 1901 standard. Wavelet OFDM can achieve deeper notches than traditional FFT OFDM, and wavelet OFDM does not require a guard interval (which usually represents significant overhead in FFT OFDM systems). An overview of P1901 PHY/MAC proposal.
This motivates why wavelet transforms are now being adopted for a vast number of applications, often replacing the conventional Fourier transform. Many areas of physics have seen this paradigm shift, including molecular dynamics, chaos theory, ab initio calculations, astrophysics, gravitational wave transient data analysis, Density matrix localisation, seismology, optics, turbulence and quantum mechanics. This change has also occurred in image processing, EEG, Electromyography,J. Rafiee et al. Feature extraction of forearm EMG signals for prosthetics, Expert Systems with Applications 38 (2011) 4058–67. ECG analyses, brain rhythms, DNA analysis, protein analysis, climatology, human sexual response analysis,J. Rafiee et al. Female sexual responses using signal processing techniques, The Journal of Sexual Medicine 6 (2009) 3086–96. (pdf) general signal processing, speech recognition, acoustics, vibration signals, computer graphics, multifractal analysis, and sparse coding. In computer vision and image processing, the notion of scale space representation and Gaussian derivative operators is regarded as a canonical multi-scale representation.
Let the wavelet transform of be , where is the wavelet transform of the signal component and is the wavelet transform of the noise component.
Most elements in are 0 or close to 0, and
Since is orthogonal, the estimation problem amounts to recovery of a signal in iid Gaussian noise. As is sparse, one method is to apply a Gaussian mixture model for .
Assume a prior , where is the variance of "significant" coefficients and is the variance of "insignificant" coefficients.
Then , is called the shrinkage factor, which depends on the prior variances and . By setting coefficients that fall below a shrinkage threshold to zero, once the inverse transform is applied, an expectedly small amount of signal is lost due to the sparsity assumption. The larger coefficients are expected to primarily represent signal due to sparsity, and statistically very little of the signal, albeit the majority of the noise, is expected to be represented in such lower magnitude coefficients... therefore the zeroing-out operation is expected to remove most of the noise and not much signal. Typically, the above-threshold coefficients are not modified during this process. Some algorithms for wavelet-based denoising may attenuate larger coefficients as well, based on a statistical estimate of the amount of noise expected to be removed by such an attenuation.
At last, apply the inverse wavelet transform to obtain
|
|