Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/108014
| Title: | FireDM : a weakly-supervised approach for massive generation of multi-scale and multi-scene fire segmentation datasets | Authors: | Zheng, H Wang, M Wang, Z Huang, X |
Issue Date: | 22-Apr-2024 | Source: | Knowledge-based systems, 22 Apr. 2024, v. 290, 111547 | Abstract: | Data availability and quality are crucial for the development of semantic segmentation techniques. However, creating high-quality fire scene datasets in a safe and efficient manner remains an unsolved challenge. To fill this gap, we introduce FireDM, the first method to generate unlimited fire segmentation datasets at virtually no cost. FireDM takes full advantage of the combined strengths of a combination of pre-trained diffusion models (Stable Diffusion XL 1.0 and Stable Diffusion 2.1) and text-guided diffusion using ChatGPT4-Fire to generate multi-scale and detail-rich fire images. The innovative fire-decoder module in FireDM then efficiently converts the cross-attention and multi-scale feature maps obtained during diffusion into accurate segmentation masks. This process requires only about 100 images and their corresponding segmentation masks for training. In our experiments, we trained the segmentation algorithms using the large-scale segmentation dataset generated by FireDM and all publicly available fire segmentation datasets respectively, and found that the segmentation algorithms trained with the former dataset outperformed the latter by at least 5% or more in terms of IoU, accuracy, F1-score and AP. This demonstrates the capability of FireDM in expanding a limited fire segmentation dataset. Additionally, the datasets generated by FireDM, with their multiple image resolutions, can adapt to the input sizes of different segmentation algorithms, significantly reducing information loss caused by resizing the image (e.g., cropping and scaling). Finally, we have created the world's first high-quality fire segmentation dataset benchmark using FireDM. The complete code and dataset of FireDM are publicly available at https://github.com/ZhengHongtao2001/FireDM. | Keywords: | ChatGPT-Fire Fire segmentation FireDM Multi-scale Stable Diffusion 2.1 Stable Diffusion XL 1.0 |
Publisher: | Elsevier | Journal: | Knowledge-based systems | ISSN: | 0950-7051 | DOI: | 10.1016/j.knosys.2024.111547 | Rights: | © 2024 Elsevier B.V. All rights reserved. © 2024. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/ The following publication Zheng, H., Wang, M., Wang, Z., & Huang, X. (2024). FireDM: A weakly-supervised approach for massive generation of multi-scale and multi-scene fire segmentation datasets. Knowledge-Based Systems, 290, 111547 is available at https://doi.org/10.1016/j.knosys.2024.111547. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Zheng_Firedm_Weakly-Supervised_Approach.pdf | Pre-Published version | 1.95 MB | Adobe PDF | View/Open |
Page views
104
Citations as of Feb 9, 2026
SCOPUSTM
Citations
17
Citations as of May 8, 2026
WEB OF SCIENCETM
Citations
11
Citations as of Apr 23, 2026
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



