Comparison of K-Means and K-Medoids for PKH Priority Clustering

Authors

  • M. Ahsanul Fikri Department of Informatics Engineering, Universitas Nahdlatul Ulama Sunan Giri
  • Ifnu Dwi Wisma Prastya Department of Informatics Engineering, Universitas Nahdlatul Ulama Sunan Giri
  • Alif Yuanita Kartini Department of Informatics Engineering, Universitas Nahdlatul Ulama Sunan Giri

DOI:

https://doi.org/10.30871/jaic.v10i2.12267

Keywords:

Clustering, K-Means, K-Medoids, Silhouette Score, Family Hope Programme

Abstract

The Family Hope Program (PKH) is a government policy aimed at poverty alleviation and protecting vulnerable groups. Differences in the number and composition of PKH recipients between sub-districts indicate variations in the level of social vulnerability and potential regional inequality that require objective mapping. This study aims to identify the concentration of socially vulnerable groups and inequality between sub-districts in Bojonegoro Regency through a data-based cluster approach. The data used comes from the Bojonegoro Regency Social Service which includes PKH recipient components, namely early childhood, SD, SMP, SMA, people with severe disabilities, the elderly, and pregnant women. This study compares several clustering scenarios including the K-Means and K-Medoids algorithms, variations in the number of clusters (k = 2 and k = 3), Euclidean and Manhattan distance functions, and data conditions without normalization and with Min–Max normalization. The determination of the number of clusters is done using the Elbow method, while cluster quality is evaluated using the Silhouette Score. The results of the study show that the best scenario was obtained in K-Medoids with Euclidean distance and Min–Max normalization at k = 3 with a Silhouette Score value of 0,8115. The clustering results divided the sub-districts into three categories, namely high clusters (many) with 10 sub-districts, medium with 13 sub-districts, and low (few) with 5 sub-districts. These findings can be used as a basis for determining the priority of policy interventions, especially in high clusters, so that the distribution of PKH becomes more targeted and is able to reduce social inequality between regions.

Downloads

Download data is not yet available.

References

[1] A. Sofianto, “Implementasi Program Keluarga Harapan (PKH) di Provinsi Jawa Tengah,” Sosio Konsepsia J. Penelit. dan Pengemb. Kesejaht. Sos., vol. 10, no. 1 SE-Articles, pp. 14–31, Dec. 2020, doi: 10.33007/ska.v10i1.2091.

[2] N. Hidayatingsih and A. R. Sofa, “Implementasi pendidikan karakter Islami dalam Program Keluarga Harapan (PKH) untuk masyarakat pedesaan: Studi kasus di Desa Dawuhan,” Karakter J. Ris. Ilmu Pendidik. Islam, vol. 2, no. 2, pp. 11–25, 2025.

[3] M. K. Saidi, Z. Abdussamad, and R. Tantu, “Dampak Sosial Masyarakat pada Program PKH di Desa Dulupi Kecamatan Dulupi Kabupaten Boalemo,” Res. Rev. J. Ilm. Multidisiplin, vol. 4, no. 1, pp. 585–591, 2025.

[4] Y. Long et al., “Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST,” Nat. Commun., vol. 14, 2023, [Online]. Available: https://api.semanticscholar.org/CorpusID:255478092

[5] J. Macqueen, “Some Methods For Classification And Analysis Of Multivariate Observations,” vol. 233, no. 233, pp. 281–297.

[6] Z. Huang, H. Zheng, C. Li, and C. Che, “Application of Machine Learning-Based K-means Clustering for Financial Fraud Detection,” Acad. J. Sci. Technol., 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:269451472

[7] A. Y. Kartini, “Algoritma K-Means Dan K- Medoids Untuk Pengelompokan Kecamatan Penerima Bantuan Sosial di Kabupaten Bojonegoro,” 2023.

[8] “Comparison of K-Means and K-Medoids Clustering for Grouping The Sub-Districts In Bojonegoro Regency Based On Educational Supporting Factors ,” J Stat. J. Ilm. Teor. dan Apl. Stat., vol. 16, no. 2, pp. 514–523, doi: 10.36456/jstat.vol16.no2.a8079.

[9] A. S. C. Martins, L. R. de Araújo, and D. R. R. Penido, “K-Medoids clustering applications for high-dimensionality multiphase probabilistic power flow,” Int. J. Electr. Power & Energy Syst., 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:267708052

[10] D. E. Waluyo, H. W. Kinasih, C. Paramita, D. Pergiwati, R. Nohan, and F. A. Rafrastara, “Komparasi dan Implementasi Algoritma Regresi Machine Learning untuk Prediksi Indeks Harga Saham Gabungan,” J. Inform. J. Pengemb. IT, 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:268991837

[11] R. Afifa, M. I. Mazdadi, T. H. Saragih, F. Indriani, and M. Muliadi, “Implementasi Principal Component Analysis (PCA) dan Gap Statistic untuk Clustering Kanker Payudara pada Algoritma K-Means,” Sist. J. Sist. Inf., vol. 13, no. 5, pp. 1852–1864, 2024.

[12] E. Setiawati, U. D. Fernanda, S. Agesti, M. Iqbal, and M. O. A. Herjho, “Implementation of K-Means, K-Medoid and DBSCAN Algorithms In Obesity Data Clustering,” IJATIS Indones. J. Appl. Technol. Innov. Sci., 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:270060204

[13] J. Heidari, N. Daneshpour, and A. Zangeneh, “A novel K-means and K-medoids algorithms for clustering non-spherical-shape clusters non-sensitive to outliers,” Pattern Recognit., vol. 155, p. 110639, 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:270171548

[14] S. Fahad, F. Su, S. U. Khan, M. R. Naeem, and K. Wei, “Implementing a novel deep learning technique for rainfall forecasting via climatic variables: An approach via hierarchical clustering analysis,” Sci. Total Environ., vol. 854, p. 158760, 2023, doi: https://doi.org/10.1016/j.scitotenv.2022.158760.

[15] A. Gupta, H. Sharma, and A. Akhtar, “A comparative analysis of k-means and hierarchical clustering,” EPRA Int. J. Multidiscip. Res., vol. 7, no. 8, 2021.

[16] K. P. Sinaga and M.-S. Yang, “Unsupervised K-Means Clustering Algorithm,” IEEE Access, vol. 8, pp. 80716–80727, 2020, [Online]. Available: https://api.semanticscholar.org/CorpusID:218597833

[17] W. Yang, “Analysis and Application of Big Data Feature Extraction Based on Improved k-means Algorithm,” Scalable Comput. Pr. Exp., vol. 25, pp. 137–145, 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:266781771

[18] M. Mursalim, P. Purwanto, and M. A. Soeleman, “Penentuan Centroid Awal Pada Algoritma K-Means Dengan Dynamic Artificial Chromosomes Genetic Algorithm Untuk Tuberculosis Dataset,” Techno. Com, vol. 20, no. 1, pp. 97–108, 2021.

[19] D. Anggraini and M. S. Hasibuan, “Initial centroid pada algoritma K-means dan K-medoids,” J. Multimed. dan Teknol. Inf., vol. 7, no. 03, pp. 487–500, 2025.

[20] A. Meiriza and E. Ali, “Perbandingan Algoritma K-Means dan K-Medoids untuk Pengelompokan Program BPJS Ketenagakerjaan,” Indones. J. Comput. Sci., vol. 12, no. 2, 2023.

[21] R. H. Maharrani, P. D. Abda’u, and M. N. Faiz, “Clustering method for criminal crime acts using K-means and principal component analysis,” Indones. J. Electr. Eng. Comput. Sci., 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:268217000

[22] H. Wahyono, H. Setiaji, T. Hartati, and N. Wiliani, “K-Means Clustering for Identifying Traffic Accident Hotspots in Depok City,” J. Appl. Res. Comput. Sci. Inf. Syst., 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:278552068

[23] Y. Candra, R. Goejantoro, and A. T. R. Dani, “Pengelompokan Provinsi Berdasarkan Indikator Ekonomi, Pendidikan, Kesehatan, dan Kriminalitas di Indonesia Menggunakan Algoritma Centroid Linkage,” Euler J. Ilm. Mat. Sains dan Teknol., vol. 12, no. 1, pp. 9–15, 2024.

[24] R. Khairuna, Nurdin, and Ar Razi, “Analisis Perbandingan Algoritma K-Means Dan K-Medoids Untuk Klusterisasi Teks Ulasan Pada Aplikasi Cookpad: Comparative Analysis Of K-Means And K-Medoids Algorithms For Clusterization Of Review Texts On The Cookpad Application,” Rabit J. Teknol. dan Sist. Inf. Univrab, vol. 10, no. 2 SE-Articles, pp. 445–458, Jul. 2025, doi: 10.36341/rabit.v10i2.6131.

[25] S. Amaliyah and S. Rianti, “Penerapan Data Mining Untuk Menentukan Kelompok Prioritas Penerima Bantuan PKH Menggunakan Metode Clustering K-Means Pada Desa Kuala Dendang,” J. Inform. Dan Rekayasa Komput., vol. 3, no. 1, pp. 453–458, 2023.

[26] B. Deepa and K. Ramesh, “Epileptic Seizure Detection Using Deep Learning Through Min Max Scaler Normalization,” Int. J. Health Sci. (Qassim)., no. I, pp. 10981–10996, 2022, doi: 10.53730/ijhs.v6nS1.7801.

[27] M. Al Ghifari and W. T. Harsanti Putri, “Clustering Courses Based On Student Grades Using K-Means Algorithm With Elbow Method For Centroid Determination,” Inf. J. Ilm. Bid. Teknol. Inf. dan Komun., vol. 8, no. 1 SE-Articles, pp. 42–46, Jan. 2023, doi: 10.25139/inform.v8i1.4519.

[28] D. Anggreani, N. Nurmisba, D. Setiawan, and L. Lukman, “Optimization of K-Means Clustering Method by Using Elbow Method in Predicting Blood Requirement of Pelamonia Hospital Makassar,” Internet Things Artif. Intell. J., 2024, [Online]. Available: https://api.semanticscholar.org/CorpusID:272982268

[29] D. T. Yuliana, M. I. A. Fathoni, and N. Kurniawati, “Penentuan Penerima Kartu Indonesia Pintar KIP Kuliah Dengan Menggunakan Metode K-Means Clustering,” J. Focus Action Res. Math. (Factor M), vol. 5, no. 1 SE-Articles, pp. 127–141, Dec. 2022, doi: 10.30762/f_m.v5i1.570.

[30] N. Puspitasari, G. Lempas, H. Hamdani, H. Haviuddin, and A. Septiarini, “Perbandingan Algoritma K-Means dan Algoritma K-Medoids Pada Kasus Covid-19 di Indonesia,” Build. Informatics, Technol. Sci., vol. 4, no. 4, p. 2015−2027, 2023, doi: 10.47065/bits.v4i4.2994.

[31] N. Rohman and A. Wibowo, “Perbandingan Metode K-Medoids dan Metode K-Means Dalam Analisis Segmentasi Pelanggan Mall,” SINTECH (Science Inf. Technol. J., vol. 7, no. 1, pp. 49–58, 2024, doi: 10.31598/sintechjournal.v7i1.1507.

[32] I. M. Nur, U. G. Mada, U. M. Semarang, A. Info, and S. Vulnerability, “Analysis of Social Vulnerability in Java Island using K-Medoids Algorithm with Variation of Distance Measurements ( Euclidean , Manhattan , Minkowski ),” vol. 7, no. 2, pp. 467–475, 2024.

[33] N. A. Siagian, A. Rikki, and P. B. N. Simangunsong, “Clustering Menggunakan Metode K-Medoids dengan pendekatan Manhattan Distance,” KAKIFIKOM (Kumpulan Artik. Karya Ilm. Fak. Ilmu Komputer), vol. 6, no. 2, pp. 169–175, 2024, doi: 10.54367/kakifikom.v6i2.4608.

[34] F. R. Senduk, I. Indwiarti, and F. Nhita, “Clustering of Earthquake Prone Areas in Indonesia Using K-Medoids Algorithm,” Indones. J. Comput., vol. 4, no. 3, pp. 65–76, 2020, doi: 10.34818/INDOJC.2019.4.3.359.

Downloads

Published

2026-04-16

How to Cite

[1]
M. A. Fikri, I. D. W. Prastya, and A. Y. Kartini, “Comparison of K-Means and K-Medoids for PKH Priority Clustering”, JAIC, vol. 10, no. 2, pp. 1347–1358, Apr. 2026.

Most read articles by the same author(s)

Similar Articles

1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.