Segmentation of Generation Z Spending Habits Using the K-Means Clustering Algorithm: An Empirical Study on Financial Behavior Patterns

Authors

  • Gunawan Sylvester Universitas Amikom Yogyakarta
  • Majid Rahardi Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.30871/jaic.v9i6.11506

Keywords:

Financial Decision-Making, Generation Z, K-Means Clustering, Segmentation, Spending Behavior

Abstract

Generation Z, born between 1997 and 2012, exhibits unique consumption behaviors shaped by digital technology, modern lifestyles, and evolving financial decision-making patterns. This study segments their financial behavior using the K-Means clustering algorithm applied to the “Generation Z Money Spending” dataset from Kaggle. In addition to K-Means, alternative clustering algorithms—K-Medoids and Hierarchical Clustering—are evaluated to compare their effectiveness in identifying behavioral patterns. The dataset consists of 1,700 individuals with 15 numerical spending attributes, including rent, food, entertainment, education, savings, and investments. All data were normalized using Min-Max Scaling prior to clustering. The analysis identifies six distinct clusters, ranging from highly consumption-oriented groups (with higher spending on entertainment and online shopping) to financially conscious groups prioritizing savings and investments. A quantitative approach was used, incorporating exploratory data analysis, correlation testing, and the Elbow Method to determine the optimal number of clusters. The optimal cluster count of six is supported by a Davies-Bouldin Index (DBI) score of 2.412, indicating acceptable but improvable cluster separation. Each cluster displays unique characteristics: Cluster 0 (average age 20.6) focuses on savings and investments with moderate essential spending; Cluster 1 (average age 23.6) prioritizes education and higher rent expenses; Cluster 2 (average age 20.3) is digitally oriented, spending more on online shopping and entertainment; Cluster 3 (average age 25.2) demonstrates financial stability with balanced expenditures; Cluster 4 (average age 24.9) emphasizes savings and investments with moderate living costs; and Cluster 5 (average age 24.96) combines strong saving habits with balanced essential and leisure spending. Model performance was assessed using the Davies-Bouldin Index, Silhouette Score, and Calinski-Harabasz Index to ensure comprehensive evaluation of cluster quality. The findings highlight the diverse spending behaviors of Generation Z, offering valuable insights for businesses, policymakers, and financial service providers to develop targeted strategies aligned with each segment’s characteristics.

Downloads

Download data is not yet available.

References

[1] L. Sekar Arum, Amira Zahrani, and N. A. Duha, “Karakteristik Generasi Z dan Kesiapannya dalam Menghadapi Bonus Demografi 2030,” Account. Student Res. J., vol. 2, no. 1, pp. 59–72, Mar. 2023, doi: 10.62108/asrj.v2i1.5812.

[2] Muhammad Adnan Faidh, Muhamad Esa Maulana, Ninda Ela Putri, Siti Indriyani Putri, Thasya Azhari Munir, and April Laksana, “Peran Media Sosial X Dalam Perkembangan Komunikasi Di Era Digital,” Konsensus J. Ilmu Pertahanan, Huk. dan Ilmu Komun., vol. 1, no. 6, pp. 43–51, 2024, doi: 10.62383/konsensus.v1i6.433.

[3] A. Jordan and K. Nuringsih, “Understanding Financial Behavior in Generation Z,” Int. J. Appl. Econ. Bus., vol. 1, no. 4, pp. 2535–2546, 2023, doi: 10.24912/ijaeb.v1i4.2535-2546.

[4] S. Mu and J. Jurana, “Financial Behavior Patterns Of Generation Z : Netnographic Analysis Of The Fear Of Missing Out ( Fomo ) Phenomenon,” pp. 23–34, 2025.

[5] K. Tabianan, S. Velu, and V. Ravi, “K-Means Clustering Approach for Intelligent Customer Segmentation Using Customer Purchase Behavior Data,” Sustainability, vol. 14, no. 12, p. 7243, Jun. 2022, doi: 10.3390/su14127243.

[6] F. Akibun, H. Prayitno, R. A. Z, and N. M. Otto, “Financial Literacy In Gen Z Generation ( Case Study at Bina Taruna University Gorontalo ),” no. 2, pp. 1–8, 2025.

[7] J. M. Rodriguez, “The Mediation of Financial Behavior to Financial Literacy and Spending Habits of Gen Z : An Exploratory Factor Analysis,” vol. 5, no. 2, 2024.

[8] J. Chitra and J. Heikal, “Customer segmentation using the K-Means Clustering algorithm in Foreign Banks in Indonesia,” Indones. Account. Res. J., vol. 11, no. 4, pp. 230–241, 2024.

[9] N. Jain and V. Ahuja, “Segmenting online consumers using K-means cluster analysis,” Int. J. Logist. Econ. Glob., vol. 6, no. 2, p. 161, 2014, doi: 10.1504/IJLEG.2014.068274.

[10] Z. Liu, Y. Li, C. Liu, X. Zhao, and W. Yin, “Application of K-Means Clustering Algorithm in Analyzing College Students’ Mental Health,” in 2024 3rd International Conference on Artificial Intelligence and Autonomous Robot Systems (AIARS), IEEE, Jul. 2024, pp. 175–180. doi: 10.1109/AIARS63200.2024.00038.

[11] E. Omol, D. Onyangor, L. Mburu, and P. Abuonji, “Application Of K-Means Clustering For Customer Segmentation In Grocery Stores In Kenya,” Int. J. Sci. Technol. Manag., vol. 5, no. 1, pp. 192–200, Jan. 2024, doi: 10.46729/ijstm.v5i1.1024.

[12] Z. Zhu and N. Liu, “Early Warning of Financial Risk Based on K-Means Clustering Algorithm,” Complexity, vol. 2021, 2021, doi: 10.1155/2021/5571683.

[13] C. Wongoutong, “The impact of neglecting feature scaling in k-means clustering,” PLoS One, vol. 19, no. 12, p. e0310839, Dec. 2024, doi: 10.1371/journal.pone.0310839.

[14] M. Prasad and S. T, “Clustering Accuracy Improvement Using Modified Min-Max Normalization Technique,” Nov. 07, 2024. doi: 10.20944/preprints202411.0486.v1.

[15] K. D. Tran, H. T. Phan, C. T. K. Nguyen, B. C. Nguyen, H. V. Le, and K. D. Nguyen, “Solvent-Free Synthesis of Co-Based Zeolitic Imidazolate Framework (ZIF-9) for the Removal of Congo Red from Water,” Indones. J. Chem., vol. 25, no. 1, p. 178, Jan. 2025, doi: 10.22146/ijc.99141.

[16] E. Sulaiman, N. Nopriyeni, C. Darwin, and A. Lusianti, “Diversity Of Liana Plants Available In The Konak Protected Forest Area, Kepahiang District, Kepahiang Regency,” J. Pembelajaran dan Biol. Nukl., vol. 8, no. 3, pp. 820–830, Nov. 2022, doi: 10.36987/jpbn.v8i3.3170.

[17] A. R. F. Falih, R. Kurniawan, Y. Arie Wijaya, and S. Anwar, “Algoritma K-Mean Untuk Optimalisasi Model Clustering Data Penjualan Toko Online Di Tiktok Shop Dalam Strategi Pemasaran,” J. Sist. Inf. Kaputama, vol. 9, no. 1, pp. 1–11, 2025, doi: 10.59697/jsik.v9i1.929.

[18] M. Zhulal, S. A. Marits, and S. Herman, “Generation Z Purchasing Behavior Profile in the Digital Economy: Normative Analysis in Online Markets,” J. Ilm. Manaj. Kesatuan, vol. 12, no. 1, pp. 1–8, 2024, doi: 10.37641/jimkes.v12i1.2326.

[19] N. Sureja, B. Chawda, and A. Vasant, “An improved K-medoids clustering approach based on the crow search algorithm,” J. Comput. Math. Data Sci., vol. 3, p. 100034, Jun. 2022, doi: 10.1016/j.jcmds.2022.100034.

[20] A. Gere, “Recommendations for validating hierarchical clustering in consumer sensory projects,” Curr. Res. Food Sci., vol. 6, p. 100522, 2023, doi: 10.1016/j.crfs.2023.100522.

[21] M. Hahsler, M. Piekenbrock, and D. Doran, “dbscan : Fast Density-Based Clustering with R,” J. Stat. Softw., vol. 91, no. 1, 2019, doi: 10.18637/jss.v091.i01.

[22] A. M. Ikotun, A. E. Ezugwu, L. Abualigah, B. Abuhaija, and J. Heming, “K-means clustering algorithms: A comprehensive review, variants analysis, and advances in the era of big data,” Inf. Sci. (Ny)., vol. 622, pp. 178–210, Apr. 2023, doi: 10.1016/j.ins.2022.11.139.

[23] M. Zubair, M. A. Iqbal, A. Shil, M. J. M. Chowdhury, M. A. Moni, and I. H. Sarker, “An Improved K-means Clustering Algorithm Towards an Efficient Data-Driven Modeling,” Ann. Data Sci., vol. 11, no. 5, pp. 1525–1544, Oct. 2024, doi: 10.1007/s40745-022-00428-2.

[24] M. Ahmed, R. Seraj, and S. M. S. Islam, “The k-means Algorithm: A Comprehensive Survey and Performance Evaluation,” Electronics, vol. 9, no. 8, p. 1295, Aug. 2020, doi: 10.3390/electronics9081295.

[25] K. N. Salam, A. W. T. F. Singkeruang, M. F. Husni, B. Baharuddin, and D. P. A.R, “Gen-Z Marketing Strategies: Understanding Consumer Preferences and Building Sustainable Relationships,” Golden Ratio Mapp. Idea Lit. Format, vol. 4, no. 1, pp. 53–77, 2024, doi: 10.52970/grmilf.v4i1.351.

[26] E. U. Oti, M. O. Olusola, F. C. Eze, and S. U. Enogwe, “Comprehensive Review of K-Means Clustering Algorithms,” Int. J. Adv. Sci. Res. Eng., vol. 07, no. 08, pp. 64–69, 2021, doi: 10.31695/IJASRE.2021.34050.

[27] D. A. Tarigan, “Optimization of the K-Means Clustering Algorithm Using Davies Bouldin Index in Iris Data Classification,” Media Online), vol. 4, no. 1, pp. 545–552, 2023, doi: 10.30865/klik.v4i1.964.

[28] P. J. Rousseeuw, “Silhouettes: A graphical aid to the interpretation and validation of cluster analysis,” J. Comput. Appl. Math., vol. 20, pp. 53–65, Nov. 1987, doi: 10.1016/0377-0427(87)90125-7.

[29] H. Hassani, M. Kalantari, and C. Beneki, “Comparative Assessment of Hierarchical Clustering Methods for Grouping in Singular Spectrum Analysis,” AppliedMath, vol. 1, no. 1, pp. 18–36, Dec. 2021, doi: 10.3390/appliedmath1010003.

Downloads

Published

2025-12-06

How to Cite

[1]
G. Sylvester and M. Rahardi, “Segmentation of Generation Z Spending Habits Using the K-Means Clustering Algorithm: An Empirical Study on Financial Behavior Patterns”, JAIC, vol. 9, no. 6, pp. 3244–3258, Dec. 2025.

Similar Articles

1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.