Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/114989
| Title: | A SAM-adapted weakly-supervised semantic segmentation method constrained by uncertainty and transformation consistency | Authors: | Cao, YX Huang, X Weng, QH |
Issue Date: | Mar-2025 | Source: | International journal of applied earth observation and geoinformation, Mar. 2025, v. 137, 104440 | Abstract: | Semantic segmentation of remote sensing imagery is a fundamental task to generate pixel-wise category maps. Existing deep learning networks rely heavily on dense pixel-wise labels, incurring high acquisition costs. Given this challenge, this study introduces sparse point labels, a type of cost-effective weak labels, for semantic segmentation. Existing weakly-supervised methods often leverage low-level visual or high-level semantic features from networks to generate supervision information for unlabeled pixels, which can easily lead to the issue of label noises. Furthermore, these methods rarely explore the general-purpose foundation model, segment anything model (SAM), with strong zero-shot generalization capacity in image segmentation. In this paper, we proposed a SAM-adapted weakly-supervised method with three components: 1) an adapted EfficientViT-SAM network (AESAM) for semantic segmentation guided by point labels, 2) an uncertainty-based pseudo-label generation module to select reliable pseudo-labels for supervising unlabeled pixels, and 3) a transformation consistency constraint for enhancing AESAM's robustness to data perturbations. The proposed method was tested on the ISPRS Vaihingen dataset (collected from airplane), the Zurich Summer dataset (satellite), and the UAVid dataset (drone). Results demonstrated a significant improvement in mean F1 (by 5.89 %-10.56 %) and mean IoU (by 5.95 %-11.13 %) compared to the baseline method. Compared to the closest competitors, there was an increase in mean F1 (by 0.83 %-5.29 %) and mean IoU (by 1.04 %-6.54 %). Furthermore, our approach requires only fine-tuning a small number of parameters (0.9 M) using cheap point labels, making it promising for scenarios with limited labeling budgets. The code is available at https://github.com/lauraset/SAM-UTC-WSSS | Keywords: | Segment anything model (SAM) Weakly supervised learning Uncertainty Transformation consistency Semantic segmentation |
Publisher: | Elsevier | Journal: | International journal of applied earth observation and geoinformation | ISSN: | 1569-8432 | EISSN: | 1872-826X | DOI: | 10.1016/j.jag.2025.104440 | Rights: | © 2025 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/bync/4.0/). The following publication Cao, Y., Huang, X., & Weng, Q. (2025). A SAM-adapted weakly-supervised semantic segmentation method constrained by uncertainty and transformation consistency. International Journal of A is available at https://dx.doi.org/10.1016/j.jag.2025.104440. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 1-s2.0-S1569843225000871-main.pdf | 18.78 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



