Enhancing E-Commerce Customer Segmentation with Fuzzy C-Means Soft Clustering Probabilities
DOI:
https://doi.org/10.30871/jaic.v9i5.10652Keywords:
Clustering, Customer Segmentation, E-Commerce, Fuzzy C-Means, Soft Clustering ProbabilityAbstract
Customer segmentation is of paramount importance in the e-commerce industry, enabling businesses to improve marketing strategies and customer engagement. This study compares the performance of two clustering algorithms, K-Means and Fuzzy C-Means (FCM), using Walmart’s public e-commerce dataset of 550,068 transactions. After preprocessing and normalization, the elbow method was applied to determine the optimal number of clusters, yielding seven clusters for K-Means and eight for FCM. Experimental evaluation based on the silhouette score shows that FCM achieved 0.48, outperforming K-Means which scored 0.36, indicating that FCM generated clusters with stronger cohesion and separation. However, this improvement comes at a computational cost. K-Means consistently required less than 0.02 seconds per run, while FCM averaged 0.3 seconds and peaked at 1.38 seconds when the number of clusters increased, making it approximately 20–30 times slower. Cluster distribution analysis further revealed that K-Means produced an uneven segmentation dominated by a single large cluster, whereas FCM generated a more balanced distribution across its clusters. This demonstrates the advantage of FCM in capturing overlapping and multidimensional customer behaviors through partial memberships, in contrast to the rigid and oversimplify assignments of K-Means. These findings highlight the benefit of adopting FCM for e-commerce segmentation, as it provides more interpretable and actionable insights for personalized marketing. At the same time, the trade-off between clustering quality and computation time suggests that future research should explore optimization techniques such as parallelization, approximate fuzzy clustering, or hybrid models that combine the efficiency of hard clustering with the interpretability of soft clustering.
Downloads
References
[1] L. Li, “Analysis of e-commerce customers’ shopping behavior based on data mining and machine learning,” Soft Computing, Jul. 2023, doi: 10.1007/s00500-023-08903-5.
[2] R.-S. Wu and P.-H. Chou, “Customer segmentation of multiple category data in e-commerce using a soft-clustering approach,” Electronic Commerce Research and Applications, vol. 10, no. 3, pp. 331–341, May 2011, doi: 10.1016/j.elerap.2010.11.002.
[3] R. Punhani, V. P. S. Arora, A. S. Sabitha, and V. K. Shukla, “Segmenting e-commerce customer through data mining techniques,” J. Phys.: Conf. Ser., vol. 1714, no. 1, pp. 012026, Jan. 2021, doi: 10.1088/1742-6596/1714/1/012026.
[4] A. S. Paramita and T. Hariguna, “Comparison of K-Means and DBSCAN algorithms for customer segmentation in e-commerce,” Journal of Digital Market and Digital Currency., vol. 1, no. 1, pp. 43–62, Jun. 2024, doi: 10.47738/jdmdc.v1i1.3.
[5] R. Punhani, V. P. S. Arora, S. Sabitha and V. Kumar Shukla, "Application of clustering algorithm for effective customer segmentation in e-commerce," 2021 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE), Dubai, United Arab Emirates, 2021, pp. 149-154, doi: 10.1109/ICCIKE51210.2021.9410713.
[6] R. J. Kuo, M. N. Alfareza, and T. P. Q. Nguyen, “Genetic based density peak possibilistic fuzzy c-means algorithms to cluster analysis- a case study on customer segmentation,” Engineering Science and Technology, an International Journal, vol. 47, pp. 101525, Nov. 2023, doi: 10.1016/j.jestch.2023.101525.
[7] M. Sivaguru, “Dynamic customer segmentation: a case study using the modified dynamic fuzzy c-means clustering algorithm,” Granular Computing, vol. 8, no. 2, pp. 345–360, Mar. 2023, doi: 10.1007/s41066-022-00335-0.
[8] N. P. P. Yuliari, I. K. G. D. Putra, and N. K. D. Rusjayanti, “Customer segmentation through fuzzy c-means and fuzzy RFM method,” Journal of Theoretical and Applied Information Technology, vol. 3, no. 78, pp. 380, 2015.
[9] Y. Li, X. Chu, D. Tian, J. Feng, and W. Mu, “Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm,” Applied Soft Computing, vol. 113, pp. 107924, Dec. 2021, doi: 10.1016/j.asoc.2021.107924.
[10] B. Shen, “E-commerce customer segmentation via unsupervised machine learning,” in The 2nd International Conference on Computing and Data Science, Stanford CA USA: ACM, Jan. 2021, pp. 1–7, doi: 10.1145/3448734.3450775.
[11] M. Alves Gomes and T. Meisen, “A review on customer segmentation methods for personalized customer targeting in e-commerce use cases,” Information Systems and e-Business Management, vol. 21, no. 3, pp. 527–570, Sep. 2023, doi: 10.1007/s10257-023-00640-4.
[12] S. Koul and T. M. Philip, “Customer segmentation techniques on e-commerce,” in 2021 International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2021, pp. 135-138, doi: 10.1109/ICACITE51222.2021.9404659.
[13] S. Kumar, R. Rani, S. K. Pippal, and R. Agrawal, “Customer segmentation in e-commerce: K-means vs hierarchical clustering,” TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 23, no. 1, pp. 119, Feb. 2025, doi: 10.12928/telkomnika.v23i1.26384.
[14] K. Tabianan, S. Velu, and V. Ravi, “K-means clustering approach for intelligent customer segmentation using customer purchase behavior data,” Sustainability, vol. 14, no. 12, pp. 7243, Jun. 2022, doi: 10.3390/su14127243.
[15] L. Rajput and S. N. Singh, “Customer segmentation of e-commerce data using K-means clustering algorithm,” in 2023 13th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India: IEEE, Jan. 2023, pp. 658–664, doi: 10.1109/Confluence56041.2023.10048834.
[16] S. P. Othayoth and R. Muthalagu, “Customer segmentation using various machine learning techniques,” International Journal of Business Intelligence and Data Mining, vol. 20, no. 4, pp. 480, 2022, doi: 10.1504/IJBIDM.2022.123218.
[17] F. Afrin, M. Al-Amin, and M. Tabassum, “Comparative performance of using PCA with K-means and fuzzy c means clustering for customer segmentation,” International Journal of Scientific and Technology Research, vol. 4, no. 8, pp. 70-74, 2015.
[18] S. E. Hashemi, F. Gholian-Jouybari, and M. Hajiaghaei-Keshteli, “A fuzzy c-means algorithm for optimizing data clustering,” Expert Systems with Applications, vol. 227, pp. 120377, Oct. 2023, doi: 10.1016/j.eswa.2023.120377.
[19] O. N. Purba, D. N. Sitompul, T. H. Harahap, S. R. D. Saragih, and R. F. Siregar, “Application of fuzzy c-means algorithm for clustering customers,” Hanif Journal of Information Systems, vol. 1, no. 1, pp. 26–36, Agu. 2023, doi: 10.56211/hanif.v1i1.8.
[20] K. E. Setiawan, A. Kurniawan, A. Chowanda, and D. Suhartono, “Clustering models for hospitals in Jakarta using fuzzy c-means and k-means,” Procedia Computer Science, vol. 216, pp. 356–363, 2023, doi: 10.1016/j.procs.2022.12.146.
[21] A. Chusyairi and P. R. N. Saputra, “Fuzzy c-means clustering algorithm for grouping health care centers on diarrhea disease,” International Journal of Artificial Intelligence Research, vol. 5, no. 1, Jan. 2021, doi: 10.29099/ijair.v5i1.191.
[22] K. Singh, P. M. Booma, and U. Eaganathan, “E-commerce system for sale prediction using machine learning technique,” Journal of Physics Conference Series, vol. 1712, no. 1, pp. 012042, Dec. 2020, doi: 10.1088/1742-6596/1712/1/012042.
[23] A. Chusyairi and P. R. N. Saputra, “Pengelompokan data puskesmas Banyuwangi dalam pemberian imunisasi menggunakan metode K-means clustering,” Telematika, vol. 12, no. 2, pp. 139–148, Agu. 2019, doi: 10.35671/telematika.v12i2.848.
[24] C. Irfiyanda, R. Andreswari and F. Hamami, “Customer segmentation using fuzzy c-means algorithm in telco industry,” in 2022 International Conference of Science and Information Technology in Smart Administration (ICSINTESA), Denpasar, Bali, Indonesia, 2022, pp. 1-4, doi: 10.1109/ICSINTESA56431.2022.10041585.
[25] B. Song, “A path to implementing a fresh produce e-commerce customer segmentation method based on clustering algorithms,” in 2023 IEEE 7th Information Technology and Mechatronics Engineering Conference (ITOEC), Chongqing, China: IEEE, Sep. 2023, pp. 1047–1050, doi: 10.1109/ITOEC57671.2023.10291762.
[26] S. Munusamy and P. Murugesan, “Modified dynamic fuzzy c-means clustering algorithm – Application in dynamic customer segmentation,” Applied Intelligence, vol. 50, no. 6, pp. 1922–1942, Feb. 2020, doi: 10.1007/s10489-019-01626-x.U.
[27] Rusdiana, I. Ernawati, N. Falih, and A. Arista, “Comparison of distance metrics on fuzzy C-Means algorithm through customer segmentation,” 2022 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), vol. 4, pp. 307–311, Oct. 2021, doi: 10.1109/icimcis53775.2021.9699206.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Muhamad Iqbal Januadi Putra, Vincent Alexander, Ahmad Chusyairi, Raka Admiral Abdurrahman, Alexander Daniel Pratama

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








