Optimasi Klasterisasi Data Peserta Didik SDN 2 Sarajaya Menggunakan Algoritma K-Means Dan Grid Search

Edi  Wahyudin; Muhammad  Syahruf Efendi

Authors

Edi Wahyudin STMIK IKMI Cirebon
Muhammad Syahruf Efendi STMIK IKMI Cirebon

Keywords:

Data, Clustering, DBI, Types Parameter

Abstract

Problem Data analysis is the process of processing data with the aim of finding useful information that can be used as the basis for the analysis process. The data used to analyze a data is using SDN 2 Sarajaya Learner data, the author has a problem that there is no clustering of learner data using K-means at SDN 2 Sarajaya from this problem the author will group the data with the DBI value besides that it will look for what type of parameter is the best from the learner data at SDN 2 Sarajaya, namely with learner data using K-Means algorithm clustering with optimize parameters.Clustering is an algorithm for grouping some data into certain data groups (clusters). Student data has 150 data with 45 attributes including No, Name, NIPD, Jk, NISN, Birthplace, Birthdate, NIK, Religion, Address, Rt, Rw, Hamlet, Village, Subdistrict, Postal Code, Type of Residence, Transportation Equipment, KPS Acceptance, KPS No, Father's Name, Father's Year of Birth, Father's Last Education, Father's Occupation, Father's Income,Mother's Name,Mother's Birth Year,Mother's Last Education,Mother's Occupation,Mother's Income,Rombel,KIP Receipt,KIP Number,Birth Certificate Registration Number,Bank,Bank Account Number,Account in Name,KIP Eligibility (school proposal),Reason for KIP Eligibility,Special Needs,School of Origin,How Many Children,Latitude,Longitude. The tools used by researchers are rapidminer version 9.10.000. The K-Means algorithm is one of the algorithms used for partitioning because K-Means is based on determining the initial number of clusters by definition to find the initial value of the cetroid. The purpose of the research conducted by the author is to find information from the data. The test method used is the KnowlledgeDiscovry in Database (KDD) method. In the clustering results obtained by the learner data cluster using the Devies Bouldin Index value closest to 0 with cluster 2 to cluster 10 experiments resulting in the best K value in cluster 4 den DBI value 0.12 and the number of members L0 : 79 clusters L1: 1 cluster L2: 1 cluster L3: 69 clusters with parameter types BrigmanDivergences.

References

Aditya, A., Jovian, I. and Sari, B.N. (2020) ‘Implementasi K-Means Clustering Ujian Nasional Sekolah Menengah Pertama di Indonesia Tahun 2018/2019’, Jurnal Media Informatika Budidarma, 4(1), p. 51. Available at: https://doi.org/10.30865/mib.v4i1.1784.

Badruttamam, A., Sudarno, S. and Maruddani, D.A.I. (2020) ‘PENERAPAN ANALISIS KLASTER K-MODES DENGAN VALIDASI DAVIES BOULDIN INDEX DALAM MENENTUKAN KARAKTERISTIK KANAL YOUTUBE DI INDONESIA (Studi Kasus: 250 Kanal YouTube Indonesia Teratas Menurut Socialblade)’, Jurnal Gaussian, 9(3), pp. 263–272. Available at: https://doi.org/10.14710/j.gauss.v9i3.28907.

Iin Parlina, Agus Perdana Windarto, Anjar Wanto, M.R.L. (2018) ‘Memanfaatkan Algoritma K-Means Dalam Menentukan Pegawai Yang Layak Mengikuti Asessment Center’, Memanfaatkan Algoritma K-Means Dalam Menentukan Pegawai Yang Layak Mengikuti Asessment Center Untuk Clustering Program Sdp, 3(1), pp. 87–93.

ndraputra, R.A. and Fitriana, R. (2020) ‘K-Means Clustering Data COVID-19’, Jurnal Teknik Industri, 10(3), pp. 275–282. Available at: https://doi.org/10.25105/jti.v10i3.8428.

K-mK-means, M. A., Wahyuni, W. and Fahmi, H. (2020) ‘Penerapan Data Mining Clustering Pada Siswa-Siswi SMK Swasta Jaya Krama Beringin Dalam Menerima Potongan Biaya Administrasi Sekolah Dengan’, 3(2), pp. 1–7.eans, M.A., Wahyuni, W. and Fahmi, H. (2020) ‘Penerapan Data Mining Clustering Pada Siswa-Siswi SMK Swasta Jaya Krama Beringin Dalam Menerima Potongan Biaya Administrasi Sekolah Dengan’, 3(2), pp. 1–7.

Kasus, S., Sma, S. and Tamba, M. (2018) ‘ANALISA CLUSTERING LAPORAN PRESTASI BELAJAR SISWA / SISWI MENGGUNAKAN K-MEANS’, 7, pp. 92–96.

Pada Siswa Baru Sekolahmenengah Kejuruan Untuk Clustering Jurusan’, InfoTekJar (Jurnal Nasional Informatika dan Teknologi Jaringan), 1(2), pp. 100–105. Available at: https://doi.org/10.30743/infotekjar.v1i2.70.

randi rian putra, C. wadisman (2018) ‘No Title’, 1, pp. 72–77.

Sulistiyawati, A. and Supriyanto, E. (2020) ‘Implementasi Algoritma K-means Clustring dalam Penetuan Siswa Kelas Unggulan’, 15(2), pp. 25–36.

Wijaya, Y.A. et al. (2021) ‘Davies Bouldin Index Algorithm for Optimizing Clustering Case Studies Mapping School Facilities’, TEM Journal, 10(3), pp. 1099–1103. Available at: https://doi.org/10.18421/TEM103-13.

Wijayanto, A.D., Fajriah, S.N. and Anita, I.W. (2018) ‘Analisis Kemampuan Komunikasi Matematis Siswa Smp Pada Materi Segitiga Dan Segiempat’, Jurnal Cendekia : Jurnal Pendidikan Matematika, 2(1), pp. 97–104. Available at: https://doi.org/10.31004/cendekia.v2i1.36.