Analisis Sentimen Ulasan Google Play Store: Studi Komparatif Algoritma SVM, Naïve Bayes, dan Logistic Regression

Authors

  • Singgih Jatmiko Universitas Gunadarma
  • Charles Dometian Univeritas Gunadarma

DOI:

https://doi.org/10.37859/jf.v15i3.10016
Keywords: SVM, naïve bayes, logistic regression, sentiment analysis, google play store

Abstract

This research aims to compare Support Vector Machine (SVM), Naïve Bayes, and Logistic Regression methods in sentiment analysis of app reviews on Google Play Store to identify the best method based on accuracy, precision, recall, and F1-Score using 2000 GoPay and LinkAja reviews from Google Play Store respectively. The methodology consists of six stages, namely, data collection, labeling method evaluation, preprocessing evaluation, SMOTE testing to overcome imbalanced data, hyperparameter tuning optimization, and consistency validation with a combination of TF-IDF and three classification methods. The data were split using an 80:20 ratio, with 80% of the data used for training and 20% for testing. Experimental results show SVM gives the best performance with 93% accuracy, 92% precision, 93% recall, and 92% F1-Score on the GoPay dataset due to its ability to find the optimal hyperplane, followed by Logistic Regression with 92% accuracy and the third Naïve Bayes despite identical accuracy but showing greater bias towards the majority class. Validation using the LinkAja dataset proves SVM still maintains the best performance with 95% accuracy, so the research concludes SVM is the best method for sentiment analysis of app reviews on the Google Play Store which is proven to provide optimal and consistent performance

Downloads

Download data is not yet available.

References

A. Rakhmanita and D. T. Anggarini, “DAMPAK TRANSAKSI PEMBAYARAN GO-PAY BAGI PENINGKATAN PENJUALAN PEDAGANG KECIL MENENGAH DI PASAR MODERN BSD,” Widya Cipta: Jurnal Sekretari dan Manajemen, vol. 4, no. 2, 2020, doi: 10.31294/widyacipta.v4i2.8416.

R. M. Turjaman and I. Budi, “Analisis Sentimen Berbasis Aspek Marketing Mix Terhadap Ulasan Aplikasi Dompet Digital (Studi Kasus: Aplikasi Linkaja Pada Twitter),” Jurnal Darma Agung, vol. 30, no. 2, p. 266, 2022, doi: 10.46930/ojsuda.v30i2.1672.

Y. yuli Astari, A. Afiyati, and S. W. Rozaqi, “Analisis Sentimen Multi-Class Pada Sosial Media Menggunakan Metode Long Short-Term Memory (LSTM),” Jurnal Linguistik Komputasional, vol. 4, no. 1, pp. 8–12, 2021, [Online]. Available: http://inacl.id/journal/index.php/jlk/article/view/43

A. Hendra and F. Fitriyani, “Analisis Sentimen Review Halodoc Menggunakan Naive Bayes Classifier,” JISKA (Jurnal Informatika Sunan Kalijaga), vol. 6, no. 2, pp. 78–89, 2021, doi: 10.14421/jiska.2021.6.2.78-89.

K. Perdana, “Komparasi Metode Naive Bayes, Support Vector Machine, dan Logistic Regression pada Analisis Sentimen Pengguna Aplikasi Transportasi Online,” Kumpulan jurnal Ilmu Komputer (KLIK), vol. 10, no. 01, pp. 27–38, 2023, doi: 10.20527/klik.v10i1.616.

K. A. Baihaqi, “A Comparison Support Vector Machine, Logistic Regression And Naive Bayes For Classification Sentimen Analisys user Mobile App,” International Journal of Artificial Intelligence Research, vol. 7, no. 1, p. 64, 2023, doi: 10.29099/ijair.v7i1.962.

H. Jauhary, “Perbandingan Metode Analisis Sentimen Support Vector Machine, Naive Bayes, dan Logistic Regression (Studi Kasus: Ulasan Google Playstore Aplikasi Linkedin),” AT-TAWASSUTH: Jurnal Ekonomi Islam, 2023, [Online]. Available: https://repository.uinjkt.ac.id/dspace/handle/123456789/76808

O. S. D. Fadhillah, J. H. Jaman, and Carudin, “Perbandingan Naive Bayes, Support Vector Machine, Logistic Regression dan Random Forest dalam Menganalisis Sentimen Mengenai Tiktokshop,” Jurnal Informatika dan Teknik Elektro Terapan, vol. 6, no. 1, pp. 840–847, 2021, doi: 10.23960/jitet.v13i1.5746.

I. R. Ainunnisa and S. Sulastri, “Analisis Sentimen Aplikasi Tiktok dengan Metode Support Vector Machine (SVM), Logistic Regression dan Naive Bayes,” Jurnal Teknologi Sistem Informasi dan Aplikasi, vol. 6, no. 3, pp. 423–430, 2023, doi: 10.32493/jtsi.v6i3.31076.

D. Surya Sayogo, B. Irawan, and A. Bahtiar, “Analisis Sentimen Ulasan Aplikasi DANA di Google Play Store Menggunakan Metode Naive Bayes,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 7, no. 6, pp. 3314–3319, 2024, doi: 10.36040/jati.v7i6.8178.

A. Fathin, “Analisis Sentimen Terhadap Ulasan Aplikasi Mobile Menggunakan Metode Support Vector Machine (Svm) Dan Pendekatan Lexicon Based,” 2022. [Online]. Available: https://repository.uinjkt.ac.id/dspace/handle/123456789/65009

R. A. Afif, “Analisis Sentimen Aplikasi Adiraku di Google Play Store Menggunakan Metode Support Vectore Machine,” Jurnal Fasilkom, vol. 15, no. 1, pp. 163–171, 2025, doi: 10.37859/jf.v15i1.8510.

D. Septiani and I. Isabela, “Analisis Term Frequency Inverse Document Frequency (TF-IDF) Dalam Temu Kembali Informasi Pada Dokumen Teks,” SINTESIA: Jurnal Sistem dan Teknologi Informasi Indonesia, vol. 1, no. 2, pp. 81–88, 2023.

A. Alwasi’a, “Analisis Sentimen Pada Review Aplikasi Berita Online Menggunakan Metode Maximum Entropy (Studi Kasus: Review Detikcom pada Google Play 2019),” 2020. [Online]. Available: http://dspace.uii.ac.id/123456789/24006

S. Shevira, I. M. A. D. Suarjaya, and P. W. Buana, “Pengaruh Kombinasi dan Urutan Pre-Processing pada Tweets Bahasa Indonesia,” JITTER: Jurnal Ilmiah Teknologi dan Komputer, vol. 3, no. 2, p. 1074, 2022, doi: 10.24843/jtrti.2022.v03.i02.p06.

T. A. Pamungkas and A. Salam, “Optimalisasi Model SciBERT dengan Attention-BiLSTM-CRF untuk Pengenalan Entitas Penyakit dalam Teks Biomedis,” Building of Informatics, Technology and Science (BITS), vol. 7, no. 1, pp. 147–156, 2025, doi: 10.47065/bits.v7i1.7263.

K. Akbar and M. Hayaty, “Data Balancing untuk Mengatasi Imbalance Dataset pada Prediksi Produksi Padi,” Jurnal Ilmiah Intech: Information Technology Journal of UMUS, vol. 2, no. 02, pp. 1–14, 2020, doi: 10.46772/intech.v2i02.283.

K. Fujiwara, “Over- and Under-sampling Approach for Extremely Imbalanced and Small Minority Data Problem in Health Record Analysis,” Front Public Health, vol. 8, pp. 1–15, 2020, doi: 10.3389/fpubh.2020.00178.

M. Aminullah, “Perbandingan Kinerja Klasifikasi Machine Learning dengan Teknik Resampling pada Dataset Tidak Seimbang,” 2021. [Online]. Available: https://repository.uinjkt.ac.id/dspace/bitstream/123456789/57648/1/MUHAMMAD AMINULLAH-FST.pdf

C. B. Handoko and C. S. K. Aditya, “Penerapan Teknik SMOTE dalam Mengatasi Imbalance Data Penyakit Diabetes Menggunakan Metode ANN,” Smart Comp: Jurnalnya Orang Pintar Komputer, vol. 14, no. 105, pp. 13–20, 2025.

A. Demircioglu, “Applying oversampling before cross-validation will lead to high bias in radiomics,” Sci Rep, vol. 14, no. 1, pp. 1–11, 2024, doi: 10.1038/s41598-024-62585-z.

O. H. Rahman, G. Abdillah, and A. Komarudin, “Klasifikasi Ujaran Kebencian pada Media Sosial Twitter Menggunakan Support Vector Machine,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 5, no. 1, pp. 17–23, 2021, doi: 10.29207/resti.v5i1.2700.

H. Hendiana, A. Purnamasari, and I. Ali, “Analisis Sentimen Komentar Berita Detik.com Menggunakan Metode Suport Vektor Machine (SVM),” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 8, no. 3, pp. 3175–3181, 2024, doi: 10.36040/jati.v8i3.8421.

Y. Yuliyana and A. S. R. M. Sinaga, “Sistem Pakar Diagnosa Penyakit Gigi Menggunakan Metode Naive Bayes,” Fountain of Informatics Journal, vol. 4, no. 1, p. 19, 2019, doi: 10.21111/fij.v4i1.3019.

G. P. Kawani, “Implementasi Metode Klasifikasi Naive Bayes Dalam Memprediksi Besarnya Penggunaan Listrik Rumah Tangga,” Journal of Informatics, Information System, Software Engineering and Applications (INISTA), vol. 1, no. 2, pp. 73–81, 2019, doi: 10.20895/inista.v1i2.73.

T. Taslim, S. Handayani, and F. Fajrizal, “Kinerja Komparatif Optimasi Metode Naive Bayes dalam Klasifikasi Teks untuk Uji Klinis Kanker,” Jurnal Eksplora Informatika, vol. 13, no. 1, pp. 113–123, 2023, doi: 10.30864/eksplora.v13i1.994.

K. Kelvin, “Analisis Perbandingan Sentimen Corona Virus Disease-2019 (Covid19) pada Twitter Menggunakan Metode Logistic Regression Dan Support Vector Machine (SVM),” Jurnal Sistem Informasi dan Ilmu Komputer Prima (JUSIKOM PRIMA), vol. 5, no. 2, pp. 47–52, 2022, doi: 10.34012/jurnalsisteminformasidanilmukomputerprima.v5i2.2365.

N. K. C. Pratiwi, N. Ibrahim, and S. Saidah, “Prediksi Kanker Paru menggunakan Grid search untuk Optimasi Hyperparameter pada Metode MLP dan Logistic Regression,” ELKOMIKA: Jurnal Teknik Energi Elektrik, Teknik Telekomunikasi, & Teknik Elektronika, vol. 12, no. 3, pp. 556–568, 2024, doi: 10.26760/elkomika.v12i3.556.

Downloads

Published

2026-01-10