UT Arlington EE 5359 - MULTIMEDIA PROCESSING - D2017358

Home> Schools> University of Texas at Arlington> Electrical Engineering (EE) > EE 5359> MULTIMEDIA PROCESSING

DOC PREVIEW

UT Arlington EE 5359 - MULTIMEDIA PROCESSING

School name University of Texas at Arlington

Course Ee 5359- Topics in Signal Processing

Pages 31

This preview shows page 1-2-14-15-30-31 out of 31 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

View full document

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 31 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

1 | P a g e EE 5359 FALL 2009 MULTIMEDIA PROCESSING PROJECT REPORT RATE-DISTORTION OPTIMIZATION USING SSIM IN H.264 I-FRAME ENCODER INSTRUCTOR: DR. K. R. RAO Babu Hemanth Kumar Aswathappa Department of Electrical Engineering University of Texas at Arlington Email: [email protected] | P a g e List of acronyms AVC Advanced Video Coding CABAC Context Adaptive Binary Arithmetic Coding CALVC context Adaptive Variable Length Coding D Distortion HDTV High Definition Television HVS Human Visual System I Intra-frame ITU International Telecommunication Union JPEG Joint Photographic Experts Group JVT Joint Video Team MPEG Moving Picture Experts Group MSE Mean Squared Error MSSIM Mean Structural Similarity Index Measurement PSNR Peak to peak signal to noise Ratio QP Quantization Parameter RD Rate Distortion SDTV Standard Definition Television SSD Sum of Squared Differences SSIM Structural Similarity Index Measurement3 | P a g e ABSTRACT In the rate-distortion optimization for H.264 I-frame encoder [1], the distortion (D) is measured as the sum of the squared differences between the reconstructed and the original images, which is same as MSE. Although peak-to-peak signal-to-noise ratio (PSNR) and MSE are currently the most widely used objective metrics due to their low complexity and clear physical meaning, they are also widely criticized for not correlating well with human visual system (HVS) for a long time [2]. During past several decades a great deal of effort has been made to develop new image quality assessment based on error sensitivity theory of HVS, but only limited success has been achieved by the reason that the HVS has not been well comprehended.[2] Recently a new philosophy for image quality measurement was proposed, based on the assumption that the human visual system is highly adapted to extract structural information from the viewing field. It follows that a measure of structural information change can provide a good approximation to perceived image distortion. In this new theory, an item called structural similarity index (SSIM) including three comparisons is introduced to measure the structural information change. Experiments have shown that the SSIM index method is easy to implement and can better correspond with human perceived measurement than PSNR (or MSE)[4][8][9] The main idea of this project is to employ SSIM in the rate-distortion optimizations of H.264 I-frame encoder to choose the best prediction mode(s). The required modifications will be done on the JVT reference software JM92 program [3]. Results in terms of size of the compressed image, SSIM of the whole reconstructed image for H.264-JM92 software and the new method will be compared.4 | P a g e INTRODUCTION MSE- MEAN SQUARED ERROR MSE is a signal fidelity measure. The goal of a signal fidelity measure is to compare two signals by providing a quantitative score that describes the degree of similarity/ fidelity or, conversely, the level of error/distortion between them. Usually, it is assumed that one of the signals is a pristine original, while the other is distorted or contaminated by errors. Suppose that x = { xi |i = 1, 2, · · · , N} and y = { yi |i = 1, 2, · · · , N} are two finite-length, discrete signals (e.g., visual images), where N is the number of signal samples (pixels, if the signals are images) and xi and yi are the values of the i th samples in x and y, respectively. The MSE between the signals x and y is 211( , ) ( )iiNiMSE x yNxy (1) In the MSE, we will often refer to the error signal ei,= xi − yi, which is the difference between the original and distorted signals. If one of the signals is an original signal of acceptable (or perhaps pristine) quality, and the other is a distorted version of it whose quality is being evaluated, then the MSE may also be regarded as a measure of signal quality. A more general form is the lp norm is (2) MSE is often converted into a peak-to-peak signal-to-noise ratio (PSNR) measure 10210logLPSNRMSE (3) where L is the dynamic range of allowable image pixel intensities. For example, for images that have allocations of 8 bits/pixel of gray-scale, L = 82− 1 = 255. The PSNR is useful if images having different dynamic ranges are being compared, but otherwise contains no new information relative to the MSE. WHY MSE [2]? The MSE has many attractive features: 1. It is simple. It is parameter free and inexpensive to compute, with a complexity of only one multiply and two additions per sample. It is also memoryless—the squared error can be evaluated at each sample, independent of other samples. 2. It has a clear physical meaning—it is the natural way to define the energy of the error signal. Such an energy measure is preserved after any orthogonal (or unitary) linear transformation, such as the Fourier transform (Parseval’s theorem). The energy preserving property guarantees that the energy of a signal distortion in the transform domain is the same as in the signal domain.5 | P a g e 3. The MSE is an excellent metric in the context of optimization. Minimum-MSE (MMSE) optimization problems often have closed-form analytical solutions, and when they do not, iterative numerical optimization procedures are often easy to formulate, since the gradient and the Hessian matrix [2] of the MSE are easy to compute. 4. MSE is widely used simply because it is a convention. Historically, it has been employed extensively for optimizing and assessing a wide variety of signal processing applications, including filter design, signal compression, restoration, denoising, reconstruction, and classification. Moreover, throughout the literature, competing algorithms have most often been compared using the MSE/PSNR. It therefore provides a convenient and extensive standard against which the MSE/PSNR results of new algorithms may be compared. This saves time and effort but further propagates the use of the MSE. WHAT IS WRONG WITH MSE [2]? It is apparent that the MSE possesses many favorable properties for application and analysis, but the reader might point out that a more fundamental issue

View Full Document

UT Arlington EE 5359 - MULTIMEDIA PROCESSING

Sign up for free to view:

This document and 3 million+ documents and flashcards
High quality study guides, lecture notes, practice exams
Course Packets handpicked by editors offering a comprehensive review of your courses
Better Grades Guaranteed


School:
Email:
New Password:
Confirm Password:

This preview shows page 1-2-14-15-30-31 out of 31 pages.

UT Arlington EE 5359 - MULTIMEDIA PROCESSING

Sign up for free to view:

Please select your school