Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/103132
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Computing | - |
| dc.creator | Li, Shuai | - |
| dc.identifier.uri | https://theses.lib.polyu.edu.hk/handle/200/12707 | - |
| dc.language.iso | English | - |
| dc.title | Exploring efficient feature extractor and label assigner for object detection | - |
| dc.type | Thesis | - |
| dcterms.abstract | With the rapid development of deep learning techniques, various types of object detectors have been continuously developed to push the boundaries of detection performance. Modern detectors commonly encounter two core issues that significantly affect the final performance: feature extraction and label assignment. Therefore, in this thesis, we aim to design an efficient feature extractor and label assigner for generic object detection. | - |
| dcterms.abstract | In Chapter 1, we provide an overview of the pipeline of widely-used object detectors and discuss the contributions and organization of this thesis. In Chapter 2, we focus on improving the anchor feature extraction process in one-stage detectors. Anchor features are fundamental training units in object detection and they are extracted from the image feature produced by the backbone. We introduce two efficient modules to enhance this process. The first is a bi-directional feature fusion module that combines both low-level detail information and high-level semantic information to enrich the image feature. The second is the dynamic anchor feature selection module, which aligns the receptive field of anchor features with the anchor shape. The anchor features extracted in this way are precise and robust, effectively easing the training burden of the detector. In Chapter 3, we introduce a dual weighting (DW) label assignment scheme for NMS-based one-stage detectors. The primary goal of label assignment is to assign a positive or negative label to each anchor to facilitate the training process. To provide finer supervision signals, we propose a method that assigns each anchor both a soft positive label and a soft negative label, which is achieved through two carefully designed assigners. DW is fully compatible with the detection evaluation metric and can significantly enhance the detector’s performance without introducing any additional parameters. | - |
| dcterms.abstract | In Chapter 4, we introduce a one-to-few label assignment (LA) method for end-to-end (NMS-free) dense detection. Our approach combines the advantages of one-to-one LA and one-to-many LA by gradually reducing the number of positive training samples from ‘many’ to ‘one’ during the training process. By doing so, the detector can learn a robust feature representation and prevent the occurrence of duplicated predictions. In Chapter 5, we delve into the LA problem in the realm of unsupervised domain adaptation (UDA) for object detection. To tackle this issue, we put forward a novel approach called the Category Dictionary Guided (CDG) UDA model, which aims to generate more reliable pseudo labels. In essence, our approach involves learning several category dictionaries from the source domain, and then utilizing them to represent the samples in the target domain. The residual of the representation is used as a metric to select high-quality pseudo labels. | - |
| dcterms.abstract | To summarize, this thesis presents four approaches that aim at enhancing the feature extraction and label assignment processes in object detection, including a bi-directional and dynamic anchor feature extractor, a dual weighting label assigner for NMS-based detector, a one-to-few label assigner for end-to-end dense detection, and a category dictionary guided label assigner for cross-domain detection. The efficiency and effectiveness of these methods have been demonstrated through extensive experiments on several benchmarks. | - |
| dcterms.accessRights | open access | - |
| dcterms.educationLevel | Ph.D. | - |
| dcterms.extent | xviii, 120 pages : color illustrations | - |
| dcterms.issued | 2023 | - |
| dcterms.LCSH | Image processing -- Digital techniques | - |
| dcterms.LCSH | Image analysis -- Data processing | - |
| dcterms.LCSH | Computer vision | - |
| dcterms.LCSH | Hong Kong Polytechnic University -- Dissertations | - |
| Appears in Collections: | Thesis | |
Access
View full-text via https://theses.lib.polyu.edu.hk/handle/200/12707
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.


