Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108348
PIRA download icon_1.1View/Download Full Text
Title: Large language model-enhanced algorithm selection : towards comprehensive algorithm representation
Authors: Wu, X 
Zhong, Y
Wu, J 
Jiang, B
Tan, KC 
Issue Date: 2024
Source: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence : Jeju, Korea, 3-9 August 2024, p. 5235-5244
Abstract: Algorithm selection, a critical process of automated machine learning, aims to identify the most suitable algorithm for solving a specific problem prior to execution. Mainstream algorithm selection techniques heavily rely on problem features, while the role of algorithm features remains largely unexplored. Due to the intrinsic complexity of algorithms, effective methods for universally extracting algorithm information are lacking. This paper takes a significant step towards bridging this gap by introducing Large Language Models (LLMs) into algorithm selection for the first time. By comprehending the code text, LLM not only captures the structural and semantic aspects of the algorithm, but also demonstrates contextual awareness and library function understanding. The high-dimensional algorithm representation extracted by LLM, after undergoing a feature selection module, is combined with the problem representation and passed to the similarity calculation module. The selected algorithm is determined by the matching degree between a given problem and different algorithms. Extensive experiments validate the performance superiority of the proposed model and the efficacy of each key module. Furthermore, we present a theoretical upper bound on model complexity, showcasing the influence of algorithm representation and feature selection modules. This provides valuable theoretical guidance for the practical implementation of our method.
Publisher: International Joint Conference on Artificial Intelligence
DOI: 10.24963/ijcai.2024/579
Rights: Posted with permission of the publisher.
Copyright © 2024 International Joint Conferences on Artificial Intelligence
All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from the publisher.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Wu_Large_Language_Model-Enhanced.pdf387.48 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

206
Citations as of Jan 5, 2025

Downloads

187
Citations as of Jan 5, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.