DCT-based video downscaling transcoder using split and merge technique

Fung, KT; Siu, WC

doi:10.1109/TIP.2005.863118

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/295

Title:	DCT-based video downscaling transcoder using split and merge technique
Authors:	Fung, KT Siu, WC
Issue Date:	Feb-2006
Source:	IEEE transactions on image processing, Feb. 2006, v. 15, no. 2, p. 394-403
Abstract:	For a conventional downscaling video transcoder, a video server has firstly to decompress the video, perform downscaling operations in the pixel domain, and then recompress it. This is computationally intensive. However, it is difficult to perform video downscaling in the discrete cosine transform (DCT)-domain since the prediction errors of each frame are computed from its immediate past higher resolution frames. Recently, a fast algorithm for DCT domain image downsampling has been proposed to obtain the downsampled version of DCT coefficients with low computational complexity. However, there is a mismatch between the downsampled version of DCT coefficients and the resampled motion vectors. In other words, significant quality degradation is introduced when the derivation of the original motion vectors and the resampled motion vector is large. In this paper, we propose a new architecture to obtain resampled DCT coefficients in the DCT domain by using the split and merge technique. Using our proposed video transcoder architecture, a macroblock is splitted into two regions: dominant region and the boundary region. The dominant region of the macroblock can be transcoded in the DCT domain with low computational complexity and re-encoding error can be avoided. By transcoding the boundary region adaptively, low computational complexity can also be achieved. More importantly, the re-encoding error introduced in the boundary region can be controlled more dynamically. Experimental results show that our proposed video downscaling transcoder can lead to significant computational savings as well as videos with high quality as compared with the conventional approach. The proposed video transcoder is useful for video servers that provide quality service in real-time for heterogeneous clients.
Keywords:	DCT-domain transcoder Downscaling Drift elimination Transcoding Video coding
Publisher:	Institute of Electrical and Electronics Engineers
Journal:	IEEE transactions on image processing
ISSN:	1057-7149
EISSN:	1941-0042
DOI:	10.1109/TIP.2005.863118
Rights:	© 2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
split-merge_06.pdf		850.25 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show full item record

Page views

244

Last Week
0

Last month

Citations as of Feb 9, 2026

Downloads

161

Citations as of Feb 9, 2026

SCOPUS^TM
Citations

12

Last Week
0

Last month
0

Citations as of May 8, 2026

WEB OF SCIENCE^TM
Citations

7

Last Week
0

Last month
0

Citations as of Apr 23, 2026

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Downloads

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM