Introduction to JPEG and MPEG - PowerPoint PPT Presentation

About This Presentation

Title:

Introduction to JPEG and MPEG

Description:

D. MacKay, 'Information Theory, Inference and learning ... Greyscale. Binary. Nov 27th 2006. Ingemar J. Cox. 21. UCL Adastral Park Postgraduate Campus ... – PowerPoint PPT presentation

Number of Views:508

Avg rating:3.0/5.0

Slides: 69

Provided by: csI88

Category:

more less

Transcript and Presenter's Notes

Title: Introduction to JPEG and MPEG

1
Introduction to JPEG and MPEG

Ingemar J. Cox
University College London

2
Outline

Elementary information theory
Lossless compression
Quantization
Fundamentals of images
Discrete Cosine Transform (DCT)
JPEG
MPEG-1, MPEG-2

3
Bibliography

D. MacKay, Information Theory, Inference and
learning Algorithms, Cambridge University Press,
2003. http//www.inference.phy.cam.ac.uk/itprnn/bo
ok.html
W. B. Pennebaker and J. L. Mitchell, JPEG Still
Image Data Compression Standard, Chapman Hall,
1993 (ISBN 0-442-01272-1).
G. K. Wallace, The JPEG Still-Picture
Compression Standard, IEEE Trans. On Consumer
Electronics, 38, 1, 18-34, 1992.
http//en.wikipedia.org/wiki/JPEG

4
Bibliography

http//en.wikipedia.org/wiki/MPEG-2
T. Sikora, MPEG Digital Video-Coding Standards,
IEEE Signal Processing Magazine, 82-100,
September 1997

5
Elementary Information Theory
6
Elementary Information Theory

How much information does a symbol convey?
Intuitively, the more unpredictable or surprising
it is, the more information is conveyed.
Conversely, if we strongly expected something,
and it occurs, we have not learnt very much

7
Elementary Information Theory

If p is the probability that a symbol will occur
Then the amount of information, I, conveyed
is
The information, I, is measured in bits
It is the optimum code length for the symbol

8
Elementary Information Theory

The entropy, H, is the average information per
symbol
Provides a lower bound on the compression that
can be achieved

9
Elementary Information theory

A simple example. Suppose we need to transmit
four possible weather conditions
Sunny
Cloudy
Rainy
Snowy
If all conditions are equally likely, p(s)0.25,
and H2
i.e. we need a minimum of 2 bits per symbol

10
Elementary information theory

Suppose instead that it is
Sunny 0.5 of the time
Cloudy 0.25 of the time
Rainy 0.125 of the time, and
Snowy 0.125 of the time
Then the entropy is

11
Elementary Information Theory

Variable length codewords
Huffman code integer code lengths
Arithmetic codes non-integer code lengths

12
Elementary Information Theory

Huffman code

Weather Probability Information Integer code
Sunny 0.5 1 0
Cloudy 0.25 2 10
Rainy 0.125 3 110
Snowy 0.125 3 111
13
Elementary Information Theory

Previous illustration is an example of a lossless
code
I.e. we are able to recover the information
exactly

14
Elementary Information Theory

Note that we have assumed that each symbol is
independent of the other symbols
I.e. the current symbol provides no information
regarding the next symbol

15
Quantization

Quantization is the process of approximating a
continuous (or range of values) by a (much)
smaller range of values
Where Round(y) rounds y to the nearest integer
? is the quantization stepsize

16
Quantization

Example ?2

0
1
-3
-2
-1
2
3
4
5
-5
-4
0
-1
1
2
-2
0
-2
2
4
-4
17
Quantization

Quantization plays an important role in lossy
compression
This is where the loss happens

18
Fundamentals of Images
19
Fundamentals of images

An image consists of pixels (picture elements)
Each pixel represents luminance (and colour)
Typically, 8-bits per pixel

20
Fundamentals of images

Colour
Colour spaces (representations)
RGB (red-green-blue)
CMY (cyan-magenta-yellow)
YUV
Y 0.3R0.6G0.1B (luminance)
UR-Y
VB-Y
Greyscale
Binary

21
Fundamentals of images

A TV frame is about 640x480 pixels
If each pixels is represented by 8-bits for each
colour, then the total image size is
6404803921,600 bytes or ?7.4Mbits
At 30 frames per second, this would be
? 220Mbits/second

22
Fundamentals of images

Do we need all these bits?

23
Fundamentals of images

Here is an image represented with 8-bits per pixel

24
Fundamentals of images

Here is the same image at 7-bits per pixel

25
Fundamentals of images

And at 6-bits per pixel

26
Fundamentals of images

And at 5-bits per pixel

27
Fundamentals of images

And at 4-bits per pixel

28
Fundamentals of images

Do we need all these bits?
No!
The previous example illustrated the eyes
sensitivity to luminance
We can build a perceptual model
Only code what is important to the human visual
system (HVS)
Usually a function of spatial frequency

29
Fundamentals of Images

Just as audio has temporal frequencies
Images have spatial frequencies
Transforms
Fourier transform
Discrete cosine transform
Wavelet transform
Hadamard transform

30
Discrete cosine transform

Forward DCT
Inverse DCT

31
Basis functions

DC term

32
Basis functions

First term

33
Basis functions

Second term

34
Basis functions

Third term

35
Basis functions

Fourth term

36
Basis functions

Fifth term

37
Basis functions

Sixth term

38
Basis functions

Seventh term

39
DCT Example
40
Example

Signal

41
Example

DCT coefficients are
4.2426
0
-3.1543
0
0
0
-0.2242
0

42
Example DCT decomposition

DC term

43
Example DCT decomposition

2nd AC term

44
Example DCT decomposition

6th AC term

45
Example summation of DCT terms

First two non-zero coefficients

46
Example summation of DCT terms

All 3 non-zero coefficients

47
Example

What if we quantize DCT coefficients?
?1
Quantized DCT coefficients are
4
0
-3
0
0
0
0
0

48
Example

Approximate reconstruction

49
Example

Exact reconstruction

50
2-D DCT Transform

Let i(x,y) represent an image with N rows and M
columns
Its DCT I(u,v) is given by
where

51
Fundamentals of images

Discrete cosine transform
Coefficients are approximately uncorrelated
Except DC term
C.f. original 88 pixel block
Concentrates more power in the low frequency
coefficients
Computationally efficient
Block-based DCT
Compute DCT on 88 blocks of pixels

52
Fundamentals of images

Basis functions for the 88 DCT (courtesy
Wikipedia)

53
Fundamentals of JPEG
54
Fundamentals of JPEG
Encoder
DCT
Quantizer
Entropy coder
Compressed image data
IDCT
Dequantizer
Entropy decoder
Decoder
55
Fundamentals of JPEG

JPEG works on 88 blocks
Extract 88 block of pixels
Convert to DCT domain
Quantize each coefficient
Different stepsize for each coefficient
Based on sensitivity of human visual system
Order coefficients in zig-zag order
Entropy code the quantized values

56
Fundamentals of JPEG

A common quantization table is

16 11 10 16 24 40 51 61
12 12 14 19 26 58 60 55
14 13 16 24 40 57 69 56
14 17 22 29 51 87 80 62
18 22 37 56 68 109 103 77
24 35 55 64 81 104 113 92
49 64 78 87 103 121 120 101
72 92 95 98 112 100 103 99
57
Fundamentals of JPEG

Zig-zag ordering

0 1 5 6 14 15 27 28
2 4 7 13 16 26 29 42
3 8 12 17 25 30 41 43
9 11 18 24 31 40 44 53
10 19 23 32 39 45 52 54
20 22 33 38 46 51 55 60
21 34 37 47 50 56 59 61
35 36 48 49 57 58 62 63
58
Fundamentals of JPEG