Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/116814
PIRA download icon_1.1View/Download Full Text
Title: Generative active learning for image synthesis personalization
Authors: Zhang, X 
Zhang, W 
Wei, X 
Wu, J
Zhang, Z
Lei, Z
Li, Q 
Issue Date: 2024
Source: In MM ’24: Proceedings of the 32nd ACM International Conference on Multimedia, p. 10669-10677. New York, NY: The Association for Computing Machinery, 2024
Abstract: This paper presents a pilot study that explores the application of active learning, traditionally studied in the context of discriminative models, to generative models. We specifically focus on image synthesis personalization tasks. The primary challenge in conducting active learning on generative models lies in the open-ended nature of querying, which differs from the closed form of querying in discriminative models that typically target a single concept. We introduce the concept of anchor directions to transform the querying process into a semi-open problem. We propose a direction-based uncertainty sampling strategy to enable generative active learning and tackle the exploitation-exploration dilemma. Extensive experiments are conducted to validate the effectiveness of our approach, demonstrating that an open-source model can achieve superior performance compared to closed-source models developed by large companies, such as Google's StyleDrop. The source code is available at https://github.com/zhangxulu1996/GAL4Personalization.
Keywords: Generative active learning
Image synthesis
Personalization
Publisher: The Association for Computing Machinery
ISBN: 979-8-4007-0686-8
DOI: 10.1145/3664647.3680773
Description: 32nd ACM International Conference on Multimedia, Melbourne VIC, Australia, 28 October 2024 - 1 November 2024
Rights: This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0).
©2024 Copyright held by the owner/author(s).
The following publication Zhang, X., Zhang, W., Wei, X., Wu, J., Zhang, Z., Lei, Z., & Li, Q. (2024). Generative Active Learning for Image Synthesis Personalization Proceedings of the 32nd ACM International Conference on Multimedia, Melbourne VIC, Australia is available at https://doi.org/10.1145/3664647.3680773.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
3664647.3680773.pdf12.44 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.