Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/103877
PIRA download icon_1.1View/Download Full Text
Title: A feature engineering and ensemble learning based approach for repeated buyers prediction
Authors: Zhang, M
Lu, J
Ma, N
Cheng, TCE 
Hua, G
Issue Date: Dec-2022
Source: International journal of computers communications & control, Dec. 2022, v. 17, no. 6, 4988
Abstract: The global e-commerce market is growing at a rapid pace, but the percentage of repeat buyers is low. According to Tmall, the repurchase rate is only 6.1%, while research shows that a 5% increase in the repurchase rate can lead to a 25% to 95% increase in profit. To increase the repurchase rate, merchants need to predict potential repeat buyers and convert them into repurchasers. Therefore, it is necessary to predict repeat buyers. In this paper we build a prediction model of repeat purchasers using Tmall's dataset. First, we build high-quality feature engineering for e-commerce scenarios by manual construction and algorithmic selection. We introduce the synthetic minority oversampling technique (SMOTE) algorithm to solve the data imbalance problem and improve prediction performance. Then we train classical classifiers including factorization machine and logistic regression, and ensemble learning classifiers including extreme gradient boosting, and light gradient boosting machine machines. Finally, we construct a two-layer fusion model based on the Stacking algorithm to further enhance prediction performance. The results show that through a series of innovations such as data imbalance processing, feature engineering, and fusion models, the model area under curve (AUC) value is improved by 0.01161. Our findings provide important implications for managing e-commerce platforms and the platform merchants.
Keywords: Feature engineering
Ensemble learning
Fusion model
Repeat buyer prediction
Publisher: Universitatea Agora
Journal: International journal of computers communications & control 
ISSN: 1841-9836
EISSN: 1841-9844
DOI: 10.15837/ijccc.2022.6.4988
Rights: Copyright © 2022 Mingyang Zhang, Jiayue Lu, Ning Ma, T.C. Edwin Cheng, Guowei Hua
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/).
The following publication Zhang, M., Lu, J., Ma, N., Cheng, T. E., & Hua, G. (2022). A Feature Engineering and Ensemble Learning Based Approach for Repeated Buyers Prediction. International journal of computers communications & control, 17(6), 4988 is available at https://doi.org/10.15837/ijccc.2022.6.4988.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Zhang_Feature_Engineering_Ensemble.pdf1.15 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

183
Last Week
2
Last month
Citations as of Nov 9, 2025

Downloads

49
Citations as of Nov 9, 2025

SCOPUSTM   
Citations

6
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

3
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.