Back to results list
Please use this identifier to cite or link to this item:
|Title:||Enhancing techniques for a standard conforming real-time video codec||Authors:||Lai, Kam-cheong||Keywords:||Hong Kong Polytechnic University -- Dissertations
MPEG (Video coding standard)
|Issue Date:||2002||Publisher:||The Hong Kong Polytechnic University||Abstract:||The ultimate goal of video coding is to achieve a balance between the spatial quality and the temporal quality subject to human judgment with the best coding efficiency and the least computational requirement. Since inter-operability is of paramount importance, several video coding standards have been established such as H.261, H.263, MPEG-1, MPEG-2, and MPEG-4. Among these standards, the most widespread coding structure takes the form of combing a motion-compensation with Discrete Cosine Transform (DCT). Since standards only specify the decoder's syntax, it provides flexibility for improvement. In this thesis, we present new standard- conforming video coding techniques, which can offer better coding efficiency. Efficient motion estimation is essential for achieving good quality video coding. With studying the distribution of the motion vectors obtained by the Full Search algorithm, we optimize the prediction process of the motion estimation by classifying the blocks into four different classes based on a newly proposed group of reference motion vector. Different search patterns and strategies are used for different classes. The proposed algorithm improves the computational requirement and coding efficiency when compared to other fast motion estimation algorithms. Besides, a Weighted Sum of Absolute Difference (WSAD) matching criterion function for motion estimation is proposed. WSAD is designed based on the characteristic of the DCT. WSAD enhances the coding efficiency while the computational overhead is very small. To achieve better video quality in very low bit rate situation, an object-based video coding scheme is proposed. Each frame of a video is segmented into three non-overlapping regions, namely background, non-facial foreground and facial region. Smaller quantization step (better quality) is used for the facial region while a larger quantization step is used for the background to improve the coding efficiency. However, the non-facial foreground is kept in normal coding quality to prevent degradation of important regions other than facial region such as hand movement. The proposed method is real-time ready.
Since the video data encoded in block-based video codec has an inherently variable bit rate, a rate control algorithm is proposed in this thesis. As human judges the quality of the output, we adopt the information of human visual system to our rate control system. The proposed algorithm is able to adjust the operating point of the encoder automatically to a balance region between the spatial and temporal quality. Besides, experimental results show that the proposed rate control algorithm prevents motion jerkiness in condition of bandwidth fluctuation. Finally, we proposed a fast direct spatial filtering algorithm on DCT-based compressed video. As video signals are often compressed for transmission or storage, manipulation of compressed video signals is required. Filtering is a technique often used in these processes. Direct filtering on DCT domain can speedup the filtering process without going through the inverse transform of data from DCT domain to spatial domain. Our proposed method although performs spatial filter and inverse DCT transform at the same time, the computational requirement is less than a fast inverse DCT transform.
|Description:||xiii, 110 leaves : ill. ; 30 cm.
PolyU Library Call No.: [THS] LG51 .H577M EIE 2002 Lai
|URI:||http://hdl.handle.net/10397/3499||Rights:||All rights reserved.|
|Appears in Collections:||Thesis|
Show full item record
Files in This Item:
|b16576688_link.htm||For PolyU Users||162 B||HTML||View/Open|
|b16576688_ir.pdf||For All Users (Non-printable)||3.7 MB||Adobe PDF||View/Open|
Citations as of Mar 19, 2018
Citations as of Mar 19, 2018
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.