Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105547
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computingen_US
dc.creatorYang, Gen_US
dc.creatorGeng, Yen_US
dc.creatorLi, Qen_US
dc.creatorYou, Jen_US
dc.creatorCai, Men_US
dc.date.accessioned2024-04-15T07:34:58Z-
dc.date.available2024-04-15T07:34:58Z-
dc.identifier.urihttp://hdl.handle.net/10397/105547-
dc.language.isoenen_US
dc.publisherSociety for Imaging Science and Technologyen_US
dc.rights© 2020, Society for Imaging Science and Technologyen_US
dc.rightsReprinted with permission of IS&T: The Society for Imaging Science and Technology sole copyright owners of Electronic Imaging: Imaging and Multimedia Analytics in a Web and Mobile World 2020en_US
dc.titleA new training model for object detection in aerial imagesen_US
dc.typeConference Paperen_US
dc.identifier.spage84-1en_US
dc.identifier.epage84-5en_US
dc.identifier.doi10.2352/ISSN.2470-1173.2020.8.IMAWM-084en_US
dcterms.abstractThis paper presents a new training model for orientation invariant object detection in aerial images by extending a deep learning based RetinaNet which is a single-stage detector based on feature pyramid networks and focal loss for dense object detection. Unlike R3Det which applies feature refinement to handle rotating objects, we proposed further improvement to cope with the densely arranged and class imbalance problems in aerial imaging, on three aspects: 1) All training images are traversed in each iteration instead of only one image in each iteration in order to cover all possibilities; 2) The learning rate is reduced if losses are not reduced; and 3) The learning rate is reduced if losses are not changed. The proposed method was calibrated and validated by comprehensive for performance evaluation and benchmarking. The experiment results demonstrate the significant improvement in comparison with R3Dec approach on the same data set. In addition to the well-known public data set DOTA for benchmarking, a new data set is also established by considering the balance between the training set and testing set. The map of losses which dropped down smoothly without jitter and overfitting also illustrates the advantages of the proposed newmodel.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationProceedings of IS&T International Symposium on Electronic Imaging: Imaging and Multimedia Analytics in a Web and Mobile World, San Francisco, USA, 26-30 Jan.2020, p. 084-1 - 084-5en_US
dcterms.issued2020-
dc.identifier.scopus2-s2.0-85095111444-
dc.description.validate202402 bcchen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberCOMP-0417-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextGuangdong & Shenzhen grantsen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS26108888-
dc.description.oaCategoryPublisher permissionen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
EI_2020_art00002_Geng-YANG.pdf8.66 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

167
Last Week
6
Last month
Citations as of Nov 9, 2025

Downloads

38
Citations as of Nov 9, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.