Enhancing E-Commerce Customer Segmentation with Fuzzy C-Means Soft Clustering Probabilities

Authors

  • Muhamad Iqbal Januadi Putra Universitas Siber Asia
  • Vincent Alexander Universitas Tarumanagara
  • Ahmad Chusyairi Universitas Siber Asia
  • Raka Admiral Abdurrahman Politeknik Negeri Malang
  • Alexander Daniel Pratama Institut Teknologi Bandung

DOI:

https://doi.org/10.30871/jaic.v9i5.10652

Keywords:

Clustering, Customer Segmentation, E-Commerce, Fuzzy C-Means, Soft Clustering Probability

Abstract

Customer segmentation is of paramount importance in the e-commerce industry, enabling businesses to improve marketing strategies and customer engagement. This study compares the performance of two clustering algorithms, K-Means and Fuzzy C-Means (FCM), using Walmart’s public e-commerce dataset of 550,068 transactions. After preprocessing and normalization, the elbow method was applied to determine the optimal number of clusters, yielding seven clusters for K-Means and eight for FCM. Experimental evaluation based on the silhouette score shows that FCM achieved 0.48, outperforming K-Means which scored 0.36, indicating that FCM generated clusters with stronger cohesion and separation. However, this improvement comes at a computational cost. K-Means consistently required less than 0.02 seconds per run, while FCM averaged 0.3 seconds and peaked at 1.38 seconds when the number of clusters increased, making it approximately 20–30 times slower. Cluster distribution analysis further revealed that K-Means produced an uneven segmentation dominated by a single large cluster, whereas FCM generated a more balanced distribution across its clusters. This demonstrates the advantage of FCM in capturing overlapping and multidimensional customer behaviors through partial memberships, in contrast to the rigid and oversimplify assignments of K-Means. These findings highlight the benefit of adopting FCM for e-commerce segmentation, as it provides more interpretable and actionable insights for personalized marketing. At the same time, the trade-off between clustering quality and computation time suggests that future research should explore optimization techniques such as parallelization, approximate fuzzy clustering, or hybrid models that combine the efficiency of hard clustering with the interpretability of soft clustering.

Downloads

Download data is not yet available.

References

[1] L. Li, “Analysis of e-commerce customers’ shopping behavior based on data mining and machine learning,” Soft Computing, Jul. 2023, doi: 10.1007/s00500-023-08903-5.

[2] R.-S. Wu and P.-H. Chou, “Customer segmentation of multiple category data in e-commerce using a soft-clustering approach,” Electronic Commerce Research and Applications, vol. 10, no. 3, pp. 331–341, May 2011, doi: 10.1016/j.elerap.2010.11.002.

[3] R. Punhani, V. P. S. Arora, A. S. Sabitha, and V. K. Shukla, “Segmenting e-commerce customer through data mining techniques,” J. Phys.: Conf. Ser., vol. 1714, no. 1, pp. 012026, Jan. 2021, doi: 10.1088/1742-6596/1714/1/012026.

[4] A. S. Paramita and T. Hariguna, “Comparison of K-Means and DBSCAN algorithms for customer segmentation in e-commerce,” Journal of Digital Market and Digital Currency., vol. 1, no. 1, pp. 43–62, Jun. 2024, doi: 10.47738/jdmdc.v1i1.3.

[5] R. Punhani, V. P. S. Arora, S. Sabitha and V. Kumar Shukla, "Application of clustering algorithm for effective customer segmentation in e-commerce," 2021 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE), Dubai, United Arab Emirates, 2021, pp. 149-154, doi: 10.1109/ICCIKE51210.2021.9410713.

[6] R. J. Kuo, M. N. Alfareza, and T. P. Q. Nguyen, “Genetic based density peak possibilistic fuzzy c-means algorithms to cluster analysis- a case study on customer segmentation,” Engineering Science and Technology, an International Journal, vol. 47, pp. 101525, Nov. 2023, doi: 10.1016/j.jestch.2023.101525.

[7] M. Sivaguru, “Dynamic customer segmentation: a case study using the modified dynamic fuzzy c-means clustering algorithm,” Granular Computing, vol. 8, no. 2, pp. 345–360, Mar. 2023, doi: 10.1007/s41066-022-00335-0.

[8] N. P. P. Yuliari, I. K. G. D. Putra, and N. K. D. Rusjayanti, “Customer segmentation through fuzzy c-means and fuzzy RFM method,” Journal of Theoretical and Applied Information Technology, vol. 3, no. 78, pp. 380, 2015.

[9] Y. Li, X. Chu, D. Tian, J. Feng, and W. Mu, “Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm,” Applied Soft Computing, vol. 113, pp. 107924, Dec. 2021, doi: 10.1016/j.asoc.2021.107924.

[10] B. Shen, “E-commerce customer segmentation via unsupervised machine learning,” in The 2nd International Conference on Computing and Data Science, Stanford CA USA: ACM, Jan. 2021, pp. 1–7, doi: 10.1145/3448734.3450775.

[11] M. Alves Gomes and T. Meisen, “A review on customer segmentation methods for personalized customer targeting in e-commerce use cases,” Information Systems and e-Business Management, vol. 21, no. 3, pp. 527–570, Sep. 2023, doi: 10.1007/s10257-023-00640-4.

[12] S. Koul and T. M. Philip, “Customer segmentation techniques on e-commerce,” in 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2021, pp. 135-138, doi: 10.1109/ICACITE51222.2021.9404659.

[13] S. Kumar, R. Rani, S. K. Pippal, and R. Agrawal, “Customer segmentation in e-commerce: K-means vs hierarchical clustering,” TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 23, no. 1, pp. 119, Feb. 2025, doi: 10.12928/telkomnika.v23i1.26384.

[14] K. Tabianan, S. Velu, and V. Ravi, “K-means clustering approach for intelligent customer segmentation using customer purchase behavior data,” Sustainability, vol. 14, no. 12, pp. 7243, Jun. 2022, doi: 10.3390/su14127243.

[15] L. Rajput and S. N. Singh, “Customer segmentation of e-commerce data using K-means clustering algorithm,” in 2023 13th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India: IEEE, Jan. 2023, pp. 658–664, doi: 10.1109/Confluence56041.2023.10048834.

[16] S. P. Othayoth and R. Muthalagu, “Customer segmentation using various machine learning techniques,” International Journal of Business Intelligence and Data Mining, vol. 20, no. 4, pp. 480, 2022, doi: 10.1504/IJBIDM.2022.123218.

[17] F. Afrin, M. Al-Amin, and M. Tabassum, “Comparative performance of using PCA with K-means and fuzzy c means clustering for customer segmentation,” International Journal of Scientific and Technology Research, vol. 4, no. 8, pp. 70-74, 2015.

[18] S. E. Hashemi, F. Gholian-Jouybari, and M. Hajiaghaei-Keshteli, “A fuzzy c-means algorithm for optimizing data clustering,” Expert Systems with Applications, vol. 227, pp. 120377, Oct. 2023, doi: 10.1016/j.eswa.2023.120377.

[19] O. N. Purba, D. N. Sitompul, T. H. Harahap, S. R. D. Saragih, and R. F. Siregar, “Application of fuzzy c-means algorithm for clustering customers,” Hanif Journal of Information Systems, vol. 1, no. 1, pp. 26–36, Agu. 2023, doi: 10.56211/hanif.v1i1.8.

[20] K. E. Setiawan, A. Kurniawan, A. Chowanda, and D. Suhartono, “Clustering models for hospitals in Jakarta using fuzzy c-means and k-means,” Procedia Computer Science, vol. 216, pp. 356–363, 2023, doi: 10.1016/j.procs.2022.12.146.

[21] A. Chusyairi and P. R. N. Saputra, “Fuzzy c-means clustering algorithm for grouping health care centers on diarrhea disease,” International Journal of Artificial Intelligence Research, vol. 5, no. 1, Jan. 2021, doi: 10.29099/ijair.v5i1.191.

[22] K. Singh, P. M. Booma, and U. Eaganathan, “E-commerce system for sale prediction using machine learning technique,” Journal of Physics Conference Series, vol. 1712, no. 1, pp. 012042, Dec. 2020, doi: 10.1088/1742-6596/1712/1/012042.

[23] A. Chusyairi and P. R. N. Saputra, “Pengelompokan data puskesmas Banyuwangi dalam pemberian imunisasi menggunakan metode K-means clustering,” Telematika, vol. 12, no. 2, pp. 139–148, Agu. 2019, doi: 10.35671/telematika.v12i2.848.

[24] C. Irfiyanda, R. Andreswari and F. Hamami, “Customer segmentation using fuzzy c-means algorithm in telco industry,” in 2022 International Conference of Science and Information Technology in Smart Administration (ICSINTESA), Denpasar, Bali, Indonesia, 2022, pp. 1-4, doi: 10.1109/ICSINTESA56431.2022.10041585.

[25] B. Song, “A path to implementing a fresh produce e-commerce customer segmentation method based on clustering algorithms,” in 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference (ITOEC), Chongqing, China: IEEE, Sep. 2023, pp. 1047–1050, doi: 10.1109/ITOEC57671.2023.10291762.

[26] S. Munusamy and P. Murugesan, “Modified dynamic fuzzy c-means clustering algorithm – Application in dynamic customer segmentation,” Applied Intelligence, vol. 50, no. 6, pp. 1922–1942, Feb. 2020, doi: 10.1007/s10489-019-01626-x.U.

[27] Rusdiana, I. Ernawati, N. Falih, and A. Arista, “Comparison of distance metrics on fuzzy C-Means algorithm through customer segmentation,” 2022 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), vol. 4, pp. 307–311, Oct. 2021, doi: 10.1109/icimcis53775.2021.9699206.

Downloads

Published

2025-10-08

How to Cite

[1]
M. I. J. Putra, V. Alexander, A. Chusyairi, R. A. Abdurrahman, and A. D. Pratama, “Enhancing E-Commerce Customer Segmentation with Fuzzy C-Means Soft Clustering Probabilities”, JAIC, vol. 9, no. 5, pp. 2418–2425, Oct. 2025.

Similar Articles

1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.