Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/118831
| Title: | Constrained human preference alignment for natural language planning with LLMs | Authors: | Zhou, Y Hong, H Cheng, R Tan, KC |
Issue Date: | 2025 | Source: | In 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, China, p. 88-89 | Abstract: | Recent advances in large language models (LLMs) have established them as promising candidates for natural language planning tasks. However, existing approaches often fail to address two critical challenges: 1) the effective alignment of LLM-generated plans with human preferences, and 2) the dynamic enforcement of diverse constraints inherent in planning scenarios. To bridge these gaps, we propose a constraint-aware human-preference alignment framework for natural language planning. Our contributions are threefold. First, we design a process reward model that aligns LLM outputs with human preferences through step-by-step feedback, facilitating efficient and interpretable preference learning. Second, we develop a constraint-aware mechanism integrated into the rewriting strategy, which dynamically penalizes violations of task-specific constraints at each reasoning step. Third, we introduce a unified adaptive metric enabling a multifaceted assessment of planning quality. We validate our framework through experiments on planning benchmarks, demonstrating improvements in success rate with constraints and human preference alignment over baselines. | Keywords: | Constraint LLM Planning Preference alignment |
Publisher: | Institute of Electrical and Electronics Engineers, Inc. | ISBN: | 979-8-3315-8768-0 (Compliant PDF Files) 979-8-3315-8767-3 (Conference USB Version) 979-8-3315-8769-7 (Print on Demand(PoD)) |
DOI: | 10.1109/MIND67540.2025.11351754 | Description: | 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, China | Rights: | © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication Y. Zhou, H. Hong, R. Cheng and K. C. Tan, "Constrained Human Preference Alignment for Natural Language Planning with LLMs," 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), Xiamen, China, 2025, pp. 88-89 is available at https://doi.org/10.1109/MIND67540.2025.11351754. |
| Appears in Collections: | Conference Paper |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Zhou_Constrained_Human_Preference.pdf | Pre-Published version | 868.76 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



