Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/113374
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Logistics and Maritime Studies | en_US |
dc.contributor | Department of Aeronautical and Aviation Engineering | en_US |
dc.creator | Luo, X | en_US |
dc.creator | Yan, P | en_US |
dc.creator | Yan, R | en_US |
dc.creator | Wang, S | en_US |
dc.date.accessioned | 2025-06-04T01:34:24Z | - |
dc.date.available | 2025-06-04T01:34:24Z | - |
dc.identifier.issn | 0160-5682 | en_US |
dc.identifier.uri | http://hdl.handle.net/10397/113374 | - |
dc.language.iso | en | en_US |
dc.publisher | Taylor & Francis | en_US |
dc.subject | Controlled experiment | en_US |
dc.subject | Covariate balance | en_US |
dc.subject | Experiment design | en_US |
dc.subject | High-dimensional samples | en_US |
dc.subject | Partitioning problem | en_US |
dc.title | Covariate balancing for high-dimensional samples in controlled experiments | en_US |
dc.type | Journal/Magazine Article | en_US |
dc.identifier.doi | 10.1080/01605682.2024.2423362 | en_US |
dcterms.abstract | In controlled experiments, achieving covariate balancing across all groups is crucial as it ensures that the estimated treatment effects are not confounded by the effects of covariates. This study proposes a mixed-integer nonlinear programming model to address the covariate balancing problem. Specifically, we introduce a new covariate imbalance measure, which is the maximum discrepancy in both the first and second central moments between any two groups. The second central moment can effectively capture the correlation of covariates in a physical sense, which is crucial for partitioning high-dimensional samples. A mixed-integer nonlinear programming model is constructed to minimize the proposed measure to obtain the optimal partitioning results. The nonlinear model is then linearized to accelerate the optimization process. We conduct computational experiments based on simulated datasets, including one-dimensional, two-dimensional, and three-dimensional Gaussian distributed samples, and a real clinic trial dataset. Compared to the conventional discrepancy-based method, our method achieves a 54.81% and a 40.6% reduction in the maximum discrepancy of partitioning results in the two-dimensional simulated Gaussian samples and the real clinic trial dataset, respectively. These results demonstrate the superiority of the proposed model in partitioning high-dimensional samples with correlated covariates compared with the conventional discrepancy-based method. | en_US |
dcterms.accessRights | embargoed access | en_US |
dcterms.bibliographicCitation | Journal of the Operational Research Society, Published online: 05 Nov 2024, Latest Articles, https://doi.org/10.1080/01605682.2024.2423362 | en_US |
dcterms.isPartOf | Journal of the Operational Research Society | en_US |
dcterms.issued | 2024 | - |
dc.identifier.scopus | 2-s2.0-85208470061 | - |
dc.identifier.eissn | 1476-9360 | en_US |
dc.description.validate | 202506 bcch | en_US |
dc.description.oa | Not applicable | en_US |
dc.identifier.FolderNumber | a3629a | - |
dc.identifier.SubFormID | 50512 | - |
dc.description.fundingSource | Self-funded | en_US |
dc.description.pubStatus | Early release | en_US |
dc.date.embargo | 2025-11-05 | en_US |
dc.description.oaCategory | Green (AAM) | en_US |
Appears in Collections: | Journal/Magazine Article |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.