Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118066
PIRA download icon_1.1View/Download Full Text
Title: Computationally efficient likelihood-based estimation and variable selection for the Cox model with incomplete covariates
Authors: Kwok, NS 
Wong, KY 
Issue Date: Jun-2026
Source: Statistics and computing, June 2026, v. 36, no. 3, 98
Abstract: Regression analysis with missing data is a long-standing and challenging problem, particularly when there are many missing variables with arbitrary missing patterns. Likelihood-based methods, although theoretically appealing, are often computationally inefficient or even infeasible when dealing with a large number of missing variables. In this paper, we consider the Cox regression model with incomplete covariates that are missing at random. We develop an expectation-maximization (EM) algorithm for nonparametric maximum likelihood estimation, employing a transformation technique in the E-step so that it involves only a one-dimensional integration. This innovation makes our methods computationally tractable even when the number of missing variables is large. In addition, for variable selection, we extend the proposed EM algorithm to accommodate a Lasso penalty in the likelihood. We demonstrate the feasibility and advantages of the proposed methods by large-scale simulation studies and apply the proposed methods to a cancer genomic study.
Keywords: EM algorithm
Lasso
Missing data
Nonparametric maximum likelihood estimation
Penalized regression
Survival analysis
Publisher: Springer New York LLC
Journal: Statistics and computing 
ISSN: 0960-3174
EISSN: 1573-1375
DOI: 10.1007/s11222-026-10849-1
Rights: © The Author(s) 2026
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
The following publication Kwok, N.S., Wong, K.Y. Computationally efficient likelihood-based estimation and variable selection for the Cox model with incomplete covariates. Stat Comput 36, 98 (2026) is available at https://doi.org/10.1007/s11222-026-10849-1.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
s11222-026-10849-1.pdf764.05 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.