CS631 Paper Response Form

Paper Title: MPEG: A Video compression standard for Multimedia applications


Author(s): Didier LeGall


Main Point(s):

The paper describes the MPEG standard for encoding video and leaves issues of sound and the interleaving of sound and video aside. MPEG-video is using 3 types of frames: reference frames (called intraframes) that are basically moderately compressed image frames that serve as access points for random access of the video stream, and two types of frames that interpolate between 2 intraframes, namely 'P frames' ,which are interpolations that are encoded in reference to the preceding intraframe and bidirectionally predicted pictures (so-called B frames) that are used to interpolate images between I and P frames by encoding the predictions with reference to the preceding and the following frame. These 3 types of frames form sequences of the form IBBPBBP..BBIBB.. A motion compensation scheme is used for predicting P and B frames: for a 16x16 block, one or two motion vectors and an additive correction term are computed.Spatial redundancy is exploited similar to JPEG.

Possible Use(s):

This knowledge is useful for anybody editing or processing MPEGed video data, because one has to understand how all these predictions work if one wants to access a particular frame without decompressing the whole video.

Extensions:

One could try out how Ramin Zabih's and Dan Huttenlocher's motion algorithms influence the speed and quality/compression rate of MPEG. Also, the fact that MPEG has already computed motion (at least crudelly by assuming that the whole 16x16 block moves evenly) could be exploited by higher level processing (for example tracking of objects in movies).One could just take the MPEG computed motion vector(s) as a first approximation and do a fine-grained search exploring only small deviations from this basic motion vector.

CS631 home page