Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/116814
| Title: | Generative active learning for image synthesis personalization | Authors: | Zhang, X Zhang, W Wei, X Wu, J Zhang, Z Lei, Z Li, Q |
Issue Date: | 2024 | Source: | In MM ’24: Proceedings of the 32nd ACM International Conference on Multimedia, p. 10669-10677. New York, NY: The Association for Computing Machinery, 2024 | Abstract: | This paper presents a pilot study that explores the application of active learning, traditionally studied in the context of discriminative models, to generative models. We specifically focus on image synthesis personalization tasks. The primary challenge in conducting active learning on generative models lies in the open-ended nature of querying, which differs from the closed form of querying in discriminative models that typically target a single concept. We introduce the concept of anchor directions to transform the querying process into a semi-open problem. We propose a direction-based uncertainty sampling strategy to enable generative active learning and tackle the exploitation-exploration dilemma. Extensive experiments are conducted to validate the effectiveness of our approach, demonstrating that an open-source model can achieve superior performance compared to closed-source models developed by large companies, such as Google's StyleDrop. The source code is available at https://github.com/zhangxulu1996/GAL4Personalization. | Keywords: | Generative active learning Image synthesis Personalization |
Publisher: | The Association for Computing Machinery | ISBN: | 979-8-4007-0686-8 | DOI: | 10.1145/3664647.3680773 | Description: | 32nd ACM International Conference on Multimedia, Melbourne VIC, Australia, 28 October 2024 - 1 November 2024 | Rights: | This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0). ©2024 Copyright held by the owner/author(s). The following publication Zhang, X., Zhang, W., Wei, X., Wu, J., Zhang, Z., Lei, Z., & Li, Q. (2024). Generative Active Learning for Image Synthesis Personalization Proceedings of the 32nd ACM International Conference on Multimedia, Melbourne VIC, Australia is available at https://doi.org/10.1145/3664647.3680773. |
| Appears in Collections: | Conference Paper |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 3664647.3680773.pdf | 12.44 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



