Constrained human preference alignment for natural language planning with LLMs

Zhou, Y; Hong, H; Cheng, R; Tan, KC

doi:10.1109/MIND67540.2025.11351754

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118831

Title:	Constrained human preference alignment for natural language planning with LLMs
Authors:	Zhou, Y Hong, H Cheng, R Tan, KC
Issue Date:	2025
Source:	In 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, China, p. 88-89
Abstract:	Recent advances in large language models (LLMs) have established them as promising candidates for natural language planning tasks. However, existing approaches often fail to address two critical challenges: 1) the effective alignment of LLM-generated plans with human preferences, and 2) the dynamic enforcement of diverse constraints inherent in planning scenarios. To bridge these gaps, we propose a constraint-aware human-preference alignment framework for natural language planning. Our contributions are threefold. First, we design a process reward model that aligns LLM outputs with human preferences through step-by-step feedback, facilitating efficient and interpretable preference learning. Second, we develop a constraint-aware mechanism integrated into the rewriting strategy, which dynamically penalizes violations of task-specific constraints at each reasoning step. Third, we introduce a unified adaptive metric enabling a multifaceted assessment of planning quality. We validate our framework through experiments on planning benchmarks, demonstrating improvements in success rate with constraints and human preference alignment over baselines.
Keywords:	Constraint LLM Planning Preference alignment
Publisher:	Institute of Electrical and Electronics Engineers, Inc.
ISBN:	979-8-3315-8768-0 (Compliant PDF Files) 979-8-3315-8767-3 (Conference USB Version) 979-8-3315-8769-7 (Print on Demand(PoD))
DOI:	10.1109/MIND67540.2025.11351754
Description:	2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), 31 October - 2 November 2025, Xiamen, China
Rights:	© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication Y. Zhou, H. Hong, R. Cheng and K. C. Tan, "Constrained Human Preference Alignment for Natural Language Planning with LLMs," 2025 International Conference on Machine Intelligence and Nature-Inspired Computing (MIND), Xiamen, China, 2025, pp. 88-89 is available at https://doi.org/10.1109/MIND67540.2025.11351754.
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
Zhou_Constrained_Human_Preference.pdf	Pre-Published version	868.76 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show full item record

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM