Cluster Analysis for Grouping Districts in Sidoarjo Regency Based on Education Indicators


The purpose of education is to develop self-ability, knowledge, skills, and habits that are passed down from one generation to the next and from educators to students through a teaching process to shape a personality physically and spiritually. It also serves as a benchmark of success for a region. Education indicators can be used as a measuring tool to analyze the quality of education in an area. The current study aimed to determine the level of education in the districts of the Sidoarjo Regency, Indonesia. In this study, the sub-districts of the Sidoarjo Regency were grouped based on the education indicators using cluster analysis. Cluster analysis is a multivariate analysis that groups objects into different categories. Based on the results, two clusters were formed. Of the 18 districts in Sidoarjo Regency, the first cluster comprised of 14 districts (Prambon, Tulangan, Krembung, Tarik, Wonoayu, Gedangan, Porong, Buduran, Candi, Sukodono, Tanggulangin, Sedati, Jabon, Balongbendo), while the second included 4 (Sidoarjo, Waru, Taman, Krian). The results showed higher education indicators in the second cluster. Therefore, the researchers recommend using the results of this study as a reference for developing an equal distribution of education in the Sidoarjo Regency.

Keywords: education indicators, cluster analysis, Sidoarjo districts

[1] Kemendikbud RI. Indikator pendidikan di Indonesia. Jakarta: Kemendikbud RI; 2007.

[2] BPS Kabupaten Sidoarjo. Kabupaten sidoarjo dalam angka 2021. Sidoarjo: CV. Insert Coin; 2021.

[3] Oktavianty E, et al. District/city clustering in sulawesi based on education indicators using average linkage cluster and median linkage analysis. Natural Science.2019;8(3):191–197

[4] Sidik FF. Evaluasi Kinerja pembangunan bidang pendidikan di kabupaten ngawi tahun 2020. FOUNDASIA. 11(1) 2020..24-34 Available from:

[5] Yim O, Ramdeen KT. Hierarchical cluster analysis: Comparison of three linkage measures and application to psychological data. The Quantitative Methods for Psychology. 2015;11(1):8–21.

[6] Gülağz FK, Şahin S. Comparison of hierarchical and non-hierarchical clustering algorithms. International Journal of Computer Engineering and Information Technology. 9 ( 1)2017, 6–14 2017. Available from:

[7] Dini SK, Fauzan A. Clustering provinces in Indonesia based on community welfare indicators. EKSAKTA: Journal of Sciences and Data Analysis.1(1) 2020:56–63.

[8] Adha R, Nurhaliza N, Soleha U. Perbandingan algoritma DBSCAN dan k-means clustering untuk pengelompokan kasus Covid-19 di dunia. Jurnal Sains, Teknologi dan Industri. 2021;18(2):206–211.

[9] Sari IP, Batubara IH. Cluster analysis using k-means algorithm and fuzzy c-means clustering for grouping students’ abilities in online learning process. Journal of Computer Science, Information Technology and Telecommunication Engineering ( JCoSITTE). 2021;2(1):139–144.

[10] Nagari SS, Inayati L. Implementation of clustering using k-means method to determine nutritional status. Jurnal Biometrika dan Kependudukan. 2020;9(1):62-68