Penerapan Model Support Vector Machine Pada Kasus Klasifikasi Teks Berdasarkan Tujuan SDGS Ke Tiga, Empat, Dan Enam
Abstract
Text classification is a branch of Natural Language Processing (NLP) that enables computers to understand, interpret, and respond to text in a comprehensible language. Classifying texts based on the Sustainable Development Goals (SDGs) is crucial because monitoring the progress of SDGs remains a challenge. Previous studies have shown that text classification techniques using the BERT model have proven effective in classifying texts based on SDG goals. This research utilizes data sourced from the OSDG community website. The method employed is the Support Vector Machine Multiclass (SVM) model and TF-IDF word representation. This research aims to classify texts based on the Sustainable Development Goals (SDGs), specifically focusing on goals three, four, and six., evaluate the model's performance based on the F1-Score metric, and determine the optimal values for the hyperparameters regularized constant and gamma in the RBF kernel. The results of this research yielded a default F1-Score of 97.95% and a post-tuning F1-Score of 97.95%, with the optimal values of C=1, gamma=1, and kernel=rbf.
Copyright (c) 2024 Saprilian Hidayat, Herlina Napitupulu, Nurul Gusriani
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish articles in SisInfo : Jurnal Sistem Informasi dan Informatika agree to the following terms:
- Authors retain copyright of the article and grant the journal right of first publication with the work simultaneously licensed under a CC-BY-SA or The Creative Commons Attribution–ShareAlike License.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).