Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108014
PIRA download icon_1.1View/Download Full Text
Title: FireDM : a weakly-supervised approach for massive generation of multi-scale and multi-scene fire segmentation datasets
Authors: Zheng, H 
Wang, M 
Wang, Z 
Huang, X 
Issue Date: 22-Apr-2024
Source: Knowledge-based systems, 22 Apr. 2024, v. 290, 111547
Abstract: Data availability and quality are crucial for the development of semantic segmentation techniques. However, creating high-quality fire scene datasets in a safe and efficient manner remains an unsolved challenge. To fill this gap, we introduce FireDM, the first method to generate unlimited fire segmentation datasets at virtually no cost. FireDM takes full advantage of the combined strengths of a combination of pre-trained diffusion models (Stable Diffusion XL 1.0 and Stable Diffusion 2.1) and text-guided diffusion using ChatGPT4-Fire to generate multi-scale and detail-rich fire images. The innovative fire-decoder module in FireDM then efficiently converts the cross-attention and multi-scale feature maps obtained during diffusion into accurate segmentation masks. This process requires only about 100 images and their corresponding segmentation masks for training. In our experiments, we trained the segmentation algorithms using the large-scale segmentation dataset generated by FireDM and all publicly available fire segmentation datasets respectively, and found that the segmentation algorithms trained with the former dataset outperformed the latter by at least 5% or more in terms of IoU, accuracy, F1-score and AP. This demonstrates the capability of FireDM in expanding a limited fire segmentation dataset. Additionally, the datasets generated by FireDM, with their multiple image resolutions, can adapt to the input sizes of different segmentation algorithms, significantly reducing information loss caused by resizing the image (e.g., cropping and scaling). Finally, we have created the world's first high-quality fire segmentation dataset benchmark using FireDM. The complete code and dataset of FireDM are publicly available at https://github.com/ZhengHongtao2001/FireDM.
Keywords: ChatGPT-Fire
Fire segmentation
FireDM
Multi-scale
Stable Diffusion 2.1
Stable Diffusion XL 1.0
Publisher: Elsevier
Journal: Knowledge-based systems 
ISSN: 0950-7051
DOI: 10.1016/j.knosys.2024.111547
Rights: © 2024 Elsevier B.V. All rights reserved.
© 2024. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/
The following publication Zheng, H., Wang, M., Wang, Z., & Huang, X. (2024). FireDM: A weakly-supervised approach for massive generation of multi-scale and multi-scene fire segmentation datasets. Knowledge-Based Systems, 290, 111547 is available at https://doi.org/10.1016/j.knosys.2024.111547.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Zheng_Firedm_Weakly-Supervised_Approach.pdfPre-Published version1.95 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

104
Citations as of Feb 9, 2026

SCOPUSTM   
Citations

17
Citations as of May 8, 2026

WEB OF SCIENCETM
Citations

11
Citations as of Apr 23, 2026

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.