Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/104374
PIRA download icon_1.1View/Download Full Text
Title: A scene text detector based on deep feature merging
Authors: Zhang, Y
Huang, Y
Zhao, D
Wu, CH
Ip, WH 
Yung, KL 
Issue Date: Aug-2021
Source: Multimedia tools and applications, Aug. 2021, v. 80, no. 19, p. 29005-29016
Abstract: Scene text detection has become an important research topic. It can be broadly applied to much industrial equipment, such as smart phones, intelligent scanners, and IoT devices. Many existing scene text detection methods have achieved advanced performance. However, text in scene images is presented with differing orientations and varying shapes, rendering scene text detection a challenging task. This paper proposes a method for detecting texts in scene images. First, four stages of low-level features is extracted using DenseNet121. Low-level features are then merged by transposed convolution and skip connection. Second, the merged feature map is used to generate a score map, box map, and angle map. Finally, the Locality-Aware Non-Maximum Suppression (LANMS) is applied as post-processing to generate the final bounding box. The proposed method achieves an F-measure of 0.826 on ICDAR 2015 and 0.761 on MSRA-TD500, respectively.
Keywords: Convolutional neural network
Deep feature merging
DenseNet121
Scene text detector
Publisher: Springer New York LLC
Journal: Multimedia tools and applications 
ISSN: 1380-7501
EISSN: 1573-7721
DOI: 10.1007/s11042-021-11101-w
Rights: © Springer Science+Business Media, LLC, part of Springer Nature 2021
This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use(https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms), but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/s11042-021-11101-w.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Ip_Scene_Text_Detector.pdfPre-Published version1.35 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

111
Last Week
2
Last month
Citations as of Nov 30, 2025

Downloads

94
Citations as of Nov 30, 2025

SCOPUSTM   
Citations

7
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

2
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.