DOC PREVIEW
UT Arlington EE 5359 - Matching Pursuits

This preview shows page 1-2-3-26-27-28 out of 28 pages.

Save
View full document
Premium Document
Do you want full access? Go Premium and unlock all 28 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Matching Pursuits Vidhya N S Murthy Roadmap Low Bitrate Video coding Some history about Matching Pursuits What is Matching pursuits Applying this technique to Video Encoder Results Motivation for Low bitrate Video Demand for video telephony video conferencing etc over PSTN networks Limited bandwidth in wireless networks Function at bitrates in the range of 10 24kbps Error resilience over noise prone channels the source encoder has to perform well to reduce error protection overhead Evolution of International Standards All these standards are based on Block Matching techniques and DCT framework The effect of transform and quantization Motion Residue 10 4 43 47 6 8 47 46 7 4 42 43 4 5 Transform Quant 44 42 Reconstructed Data 6 8 0 2 0 0 0 0 0 0 0 0 0 0 ITransform IQuant 0 0 8 4 44 48 8 4 44 48 8 4 44 48 8 4 44 48 Typical encoder and where are we planning to modify Reference frames IDCT Inverse Quantization VLC Frame Predictor Motion Estimation DCT Quantization Some History about Matching Pursuits Introduced by Mallat and Zhang in 1993 Based on Projection pursuits work by Friedman and Tukey in 1978 Used for compressing video in 1994 by Neff Zakhor A comprehensive work carried out by Neff and Zakhor at Berkeley and was a part of proposals to the MPEG4 standards committee Currently work is being done to find What is Matching Pursuits Matching Pursuits is a greedy algorithm which matches signal structures to a large diverse dictionary of functions Expands a signal using an over complete dictionary of functions More number of basis functions implies there are a larger number of available options to approximate structures in pictures better Geometric Analogy z z y y x A three dimensional vector in the space R3 If the vector were 3 2 3 it means we have resolved it along the x y and z axis as 3 2 and 3 respectively The unit vectors along x y and z form the complete basis for R3 span all possible vectors in the 3 dimensional space x Now if we add the vector 3 2 3 to the basis vector set of R3 then we have a redundant basis and vectors like scaled versions of 3 2 3 and its linear combinations with other vectors can have sparser representations in this new space spanned by these 4 basis vectors Fourier Bases Sum of the first 4 harmonics Fundamental 3rd Harmonic 5th Harmonic 7th Harmonic Diagramatically Dictionary gk t Signal h t No restriction on the choice of dictionary Signal can be multidimensional Decompose M t pngn t n 1 Notice similarity to Fourier expansion The Gabor dictionary Modulated Gaussian window 2 D case 2D Gabor basis visualization Algorithm Stages Dictionary design Atom Decomposition or Atom Search or simply Find atoms 2D Dictionaries 400 basis images of Gabor Dictionary 64 basis images of 8x8 DCT All basis images have a fixed size of 8x8 Finding Atoms Find Energy Stage Atom Structure Flowchart explaining the position coding system General Block diagram of DCT based Encoder Reference frames IDCT Inverse Quantization VLC Frame Predictor I P video Motion Estimation DCT Quantization Bitstream The new Encoder block diagram More visible features tend to be coded first Foreman Motion Residue First 5 atoms First 32 atoms First 64 atoms First 5 atoms First 32 atoms First 64 atoms Hall Motion Residue Reconstructed Images First 5 atoms First 32 atoms First 64 atoms First 5 atoms First 32 atoms First 64 atoms MPEG2 at Low Bitrates and Matching Pursuits Foreman Reconstructed image for 64 coded atoms Reconstructed image MPEG2 at 20 kbps Hall Monitor Reconstructed image for 64 coded atoms Reconstructed image MPEG2 at 20 kbps Software Software can be downloaded from http cnx org content expanded browse authors letter M author vmurthy Conclusions This coding paradigm is very effective at low bitrates It is computationally very complex and hence future enhancements will be more towards reducing the number of searches and looking for better dictionaries which will also in turn assist in reducing the number of searches References 1 Z Zhang and S Mallat Matching pursuit with time frequency dictionaries IEEE Transactions on Signal Processing Vol 41 No 12 pp 3397 3415 Dec 1993 2 J H Friedman and W Stuetzle Projection pursuit regression J Amer Stat Assoc vol 76 no 376 pp 817 823 Dec 1981 3 F Bergeaud and S Mallat Matching pursuit of images Image Processing 1995 ICIP 1995 IEEE International Conference on pp 53 56 Sept 1995 4 M Vetterli and T Kalker Matching pursuit for compression and application to motion compensated video coding Image Processing 1994 ICIP 1994 IEEE International Conference on pp 724 729 Nov 1994 5 R Neff and A Zakhor Very Low Bit Rate Video Coding Based on Matching Pursuits IEEE Transactions on circuits and systems for video technology Vol 7 No 1 pp 158 171 Feb 1997 6 J Pearl H C Andrews and W K Pratt Performance measures for transform data coding IEEE Trans Commun vol COM 20 pp 411 415 June1972 7 P Yip and K R Rao Energy packing efficiency for the generalized discrete transforms IEEE Trans Commun vol COM 26 pp 1257 1261 Aug 1978 8 K Imammura et al A fast matching pursuits algorithm based on sub band decomposition of video signals IEEE ICME 2006 pp 729 732 July 2006 9 K Cheung and Y Chan An efficient algorithm for realizing matching pursuits and its applications in MPEG4 coding system Image Processing 2000 ICIP 2000 IEEE International Conference on Vol 2 pp 863 866 Sept 2000 10 A Shoa and S Shirani Tree structure search for matching pursuit Image Processing 2005 ICIP 2005 IEEE International Conference on Vol 3 pp 908 911 Sept 2005 11 R Neff et al Decoder complexity and performance comparison of matching pursuit and DCT based MPEG 4 video codecs Image Processing 1998 ICIP 98 Proceedings 1998 International Conference on Vol 1 pp 783 787 Oct 1998 12 R Neff A Zakhor and M Vetterli Very low bit rate video coding using matching pursuits in Proc SPIE VCIP vol 2308 no 1 pp 47 60 Sept 1994 13 R Neff and A Zakhor Matching pursuit video coding at very low bit rates in IEEE Data Compression Conf Snowbird UT pp 411 420 Mar 1995 Thank You


View Full Document

UT Arlington EE 5359 - Matching Pursuits

Documents in this Course
JPEG 2000

JPEG 2000

27 pages

MPEG-II

MPEG-II

45 pages

MATLAB

MATLAB

22 pages

AVS China

AVS China

22 pages

Load more
Download Matching Pursuits
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Matching Pursuits and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Matching Pursuits and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?