Download PDFOpen PDF in browser

Feature Selection and Adaptive Synthetic Sampling Approach for Optimizing Online Shopper Purchase Intent Prediction

EasyChair Preprint no. 6624

5 pagesDate: September 16, 2021


This paper proposes a novel approach for optimizing online shopper purchase intent prediction using feature selection combined with Adaptive Synthetic Sampling (ADASYN). A supervised learning technique is applied to predict whether the customer visits ending with shopping or not based on the features. However, not all features are important to predict the classes. In addition, a suboptimal performance may occur due to the class imbalance problem. Therefore, we propose Information Gain and Correlation feature selection to select the most important features. ADASYN is additionally used to deal with the class imbalance problem by adaptively generating new synthetic samples of the minority class with considering density distribution. The proposed approach is run using Random Forest classifier. The results indicate that ADASYN effectively improves the classification performances in terms of accuracy, precision, recall, and F1-score. The use of feature selection combined with ADASYN has been compared to previous works, the results indicate that our proposed approach outperforms all. We additionally use a statistical test to show that our results are statistically significant. By these results, our proposed approach is promising in optimizing classification performances.

Keyphrases: Adaptive synthetic sampling, ADASYN, Class Imbalance Problem, feature selection, Filter-based feature selection, Imbalanced dataset, Information Gain, machine learning, online shoppers' purchasing intention, Random Forest, statistical test

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
  author = {Rizal Dwi Prayogo and Siti Amatullah Karimah},
  title = {Feature Selection and Adaptive Synthetic Sampling Approach for Optimizing Online Shopper Purchase Intent Prediction},
  howpublished = {EasyChair Preprint no. 6624},

  year = {EasyChair, 2021}}
Download PDFOpen PDF in browser