Please use this identifier to cite or link to this item:
Title: Learning to evaluate and enhance image quality
Authors: Zeng, Hui
Degree: Ph.D.
Issue Date: 2021
Abstract: The explosive growth of digital images and their ubiquitous usage in our daily life raise two fundamental challenges to the image processing community and digital camera manufacturers. The first one is how to automatically and reliably evaluate the perceptual quality of the huge amounts of images without additional information. The second one is how to design more intelligent and efficient algorithms to further enhance the image quality. In this thesis, we embrace the recent advance of artificial intelligence, especially the powerful deep convolutional neural network (CNN), to tackle these two challenges. Regarding the first challenge, we make in-depth analyses on the limitations of existing blind image quality assessment (BIQA) and image aesthetic assessment (IAA) methods, and propose a new solution to each of the two problems. In the first work, we propose a probabilistic quality representation (PQR) approach to describe the image subjective score distribution, whereby a more robust loss function can be employed to train more effective deep BIQA models. In the second work, we reveal, for the first time to our best knowledge, the inherent relationship of the three IAA tasks, and propose a unified probabilistic formulation to handle all of them. We also propose a method to fit the aesthetic scores to a more stable and discriminative score distribution, which contributes to learning more competitive IAA models on all the three tasks. To address the second challenge, we propose a new formulation to improve the image composition and a novel method to enhance the color and tone of images. Specifically, in the third work, we revisit the image cropping task and propose a more reasonable formulation, namely grid anchor based image cropping (GAIC). Under the new formulation, we construct a new cropping benchmark and define more reliable evaluation metrics. We also design an effective and lightweight cropping model by considering the special properties of image cropping, and achieve robust image cropping performance with high efficiency. In the fourth work, we propose to learn image-adaptive 3D lookup tables (LUTs) to achieve high-performance and real-time photo enhancement. In addition to obtaining clear state-of-the-art performance on two general-purpose photo enhancement benchmarks, we also successfully apply the image-adaptive 3D LUT model to the portrait photo retouching (PPR) task with two dedicated improvements. In summary, in this thesis we introduce human domain knowledge into the formulation and learning of task-specific CNN models, providing new perspectives on designing more effective image processing and computational photography models. The proposed image quality and aesthetic assessment methods can be widely used to monitor the huge amounts of images uploaded to internet and recommend high-quality ones to users, while the proposed enhancement methods can be deployed in digital camera devices and image processing software to automatically generate more visually pleasing images.
Subjects: Image processing -- Digital techniques
Neural networks (Computer science)
Imaging systems -- Image quality
Hong Kong Polytechnic University -- Dissertations
Pages: xx, 171 pages : color illustrations
Appears in Collections:Thesis

Show full item record

Page views

Last Week
Last month
Citations as of May 28, 2023

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.