Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/103877
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Logistics and Maritime Studiesen_US
dc.creatorZhang, Men_US
dc.creatorLu, Jen_US
dc.creatorMa, Nen_US
dc.creatorCheng, TCEen_US
dc.creatorHua, Gen_US
dc.date.accessioned2024-01-10T02:41:09Z-
dc.date.available2024-01-10T02:41:09Z-
dc.identifier.issn1841-9836en_US
dc.identifier.urihttp://hdl.handle.net/10397/103877-
dc.language.isoenen_US
dc.publisherUniversitatea Agoraen_US
dc.rightsCopyright © 2022 Mingyang Zhang, Jiayue Lu, Ning Ma, T.C. Edwin Cheng, Guowei Huaen_US
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/).en_US
dc.rightsThe following publication Zhang, M., Lu, J., Ma, N., Cheng, T. E., & Hua, G. (2022). A Feature Engineering and Ensemble Learning Based Approach for Repeated Buyers Prediction. International journal of computers communications & control, 17(6), 4988 is available at https://doi.org/10.15837/ijccc.2022.6.4988.en_US
dc.subjectFeature engineeringen_US
dc.subjectEnsemble learningen_US
dc.subjectFusion modelen_US
dc.subjectRepeat buyer predictionen_US
dc.titleA feature engineering and ensemble learning based approach for repeated buyers predictionen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume17en_US
dc.identifier.issue6en_US
dc.identifier.doi10.15837/ijccc.2022.6.4988en_US
dcterms.abstractThe global e-commerce market is growing at a rapid pace, but the percentage of repeat buyers is low. According to Tmall, the repurchase rate is only 6.1%, while research shows that a 5% increase in the repurchase rate can lead to a 25% to 95% increase in profit. To increase the repurchase rate, merchants need to predict potential repeat buyers and convert them into repurchasers. Therefore, it is necessary to predict repeat buyers. In this paper we build a prediction model of repeat purchasers using Tmall's dataset. First, we build high-quality feature engineering for e-commerce scenarios by manual construction and algorithmic selection. We introduce the synthetic minority oversampling technique (SMOTE) algorithm to solve the data imbalance problem and improve prediction performance. Then we train classical classifiers including factorization machine and logistic regression, and ensemble learning classifiers including extreme gradient boosting, and light gradient boosting machine machines. Finally, we construct a two-layer fusion model based on the Stacking algorithm to further enhance prediction performance. The results show that through a series of innovations such as data imbalance processing, feature engineering, and fusion models, the model area under curve (AUC) value is improved by 0.01161. Our findings provide important implications for managing e-commerce platforms and the platform merchants.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationInternational journal of computers communications & control, Dec. 2022, v. 17, no. 6, 4988en_US
dcterms.isPartOfInternational journal of computers communications & controlen_US
dcterms.issued2022-12-
dc.identifier.isiWOS:000916072300006-
dc.identifier.scopus2-s2.0-85144952380-
dc.identifier.eissn1841-9844en_US
dc.identifier.artn4988en_US
dc.description.validate202401 bcvcen_US
dc.description.oaVersion of Recorden_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextNatural Science Foundation of China; Beijing Forestry University 2021 Course Ideological and Political Teaching, Research and Teaching Reform Project, Management Model and Basic Decision-making projecten_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Zhang_Feature_Engineering_Ensemble.pdf1.15 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

183
Last Week
2
Last month
Citations as of Nov 9, 2025

Downloads

49
Citations as of Nov 9, 2025

SCOPUSTM   
Citations

6
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

3
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.