Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118831
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Data Science and Artificial Intelligenceen_US
dc.creatorZhou, Yen_US
dc.creatorHong, Hen_US
dc.creatorCheng, Ren_US
dc.creatorTan, KCen_US
dc.date.accessioned2026-05-20T06:43:24Z-
dc.date.available2026-05-20T06:43:24Z-
dc.identifier.isbn979-8-3315-8768-0 (Compliant PDF Files)en_US
dc.identifier.isbn979-8-3315-8767-3 (Conference USB Version)en_US
dc.identifier.isbn979-8-3315-8769-7 (Print on Demand(PoD))en_US
dc.identifier.urihttp://hdl.handle.net/10397/118831-
dc.description2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, Chinaen_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineers, Inc.en_US
dc.rights© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication Y. Zhou, H. Hong, R. Cheng and K. C. Tan, "Constrained Human Preference Alignment for Natural Language Planning with LLMs," 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), Xiamen, China, 2025, pp. 88-89 is available at https://doi.org/10.1109/MIND67540.2025.11351754.en_US
dc.subjectConstrainten_US
dc.subjectLLMen_US
dc.subjectPlanningen_US
dc.subjectPreference alignmenten_US
dc.titleConstrained human preference alignment for natural language planning with LLMsen_US
dc.typeConference Paperen_US
dc.identifier.spage88en_US
dc.identifier.epage89en_US
dc.identifier.doi10.1109/MIND67540.2025.11351754en_US
dcterms.abstractRecent advances in large language models (LLMs) have established them as promising candidates for natural language planning tasks. However, existing approaches often fail to address two critical challenges: 1) the effective alignment of LLM-generated plans with human preferences, and 2) the dynamic enforcement of diverse constraints inherent in planning scenarios. To bridge these gaps, we propose a constraint-aware human-preference alignment framework for natural language planning. Our contributions are threefold. First, we design a process reward model that aligns LLM outputs with human preferences through step-by-step feedback, facilitating efficient and interpretable preference learning. Second, we develop a constraint-aware mechanism integrated into the rewriting strategy, which dynamically penalizes violations of task-specific constraints at each reasoning step. Third, we introduce a unified adaptive metric enabling a multifaceted assessment of planning quality. We validate our framework through experiments on planning benchmarks, demonstrating improvements in success rate with constraints and human preference alignment over baselines.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIn 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, China, p. 88-89en_US
dcterms.issued2025-
dc.relation.ispartofbook2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, Chinaen_US
dc.relation.conferenceMachine Intelligence and Nature-Inspired Computing [MIND]en_US
dc.description.validate202605 bcchen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera4427b-
dc.identifier.SubFormID52774-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported in part by National Natural Science Foundation of China (Grant No. U21A20512), Research Grants Council of the Hong Kong SAR (Grant No. C5052-23G, PolyU15229824, PolyU15218622, PolyU15215623), and The Hong Kong Polytechnic University (Project IDs: P0053758, P0051130, P0052694).en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Zhou_Constrained_Human_Preference.pdfPre-Published version868.76 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.