Performance Comparison of IndoBERT and Bi-LSTM Models for Sentiment Analysis of Shopee App Users in Indonesia

Authors

  • Taufik Ramadhan Universitas Amikom Yogyakarta
  • Kusnawi Kusnawi Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.30871/jaic.v10i2.10996

Keywords:

BERT, IndoBERT, Sentiment Analysis, Natural Language Processing (NLP), Ecommerce Reviews

Abstract

This study compares the performance of IndoBERT and Bidirectional Long Short-Term Memory (Bi-LSTM) models for sentiment classification of Indonesian-language product reviews from the Shopee e-commerce platform. The original dataset employed a 1–5 rating scale, which was reduced to two sentiment categories: negative (ratings 1–2) and positive (ratings 4–5), while reviews with a rating of 3 were excluded due to their ambiguous nature. IndoBERT (indobenchmark/indobert-base-p1) was applied through a fine-tuning process, whereas the Bi-LSTM model was trained from scratch using comprehensive text preprocessing, including case folding, stopword removal, stemming, tokenization, and padding. The dataset consisted of 7,225 reviews, divided into 5,780 training samples and 1,445 testing samples. Model performance was evaluated using accuracy, precision, recall, and F1-score metrics. Experimental results demonstrate that IndoBERT outperforms Bi-LSTM, achieving an accuracy and F1-score of 85.61%, compared to 78.27% obtained by the Bi-LSTM model. These findings indicate that transformer-based models are more effective in capturing contextual semantics in Indonesian text than recurrent neural network-based approaches.

Downloads

Download data is not yet available.

References

[1] A. H. Ruger, M. Suyanto, and M. P. Kurniawan, “Sentimen Analisis Pelanggan Shopee di Twitter dengan Algoritma Naive Bayes,” J. Inf. Technol., vol. 1, no. 2, pp. 26–29, 2021, doi: 10.46229/jifotech.v1i2.282.

[2] R. C. Rivaldi, T. D. Wismarini, J. T. Lomba, and J. Semarang, “Analisis Sentimen Pada Ulasan Produk Dengan Metode Natural Language Processing (NLP) (Studi Kasus Zalika Store 88 Shopee),” vol. 17, no. 1, pp. 120–128, 2024.

[3] N. Firdausy, I. Yuadi, and I. Puspitasari, “Analisis Sentimen Evaluasi Reaksi E-Learning Menggunakan Algorima Naïve Bayes, Support Vector Machine Dan Deep Learning,” Techno.Com, vol. 22, no. 3, pp. 677–689, 2023, doi: 10.33633/tc.v22i3.8160.

[4] O. Boas, J. Putro, A. Jacobus, and F. D. Kambey, “Aspect-Based Sentiment Analysis Product Review Using CNN and Bidirectional LSTM,” vol. 20, no. 2, pp. 117–124, 2025.

[5] S. W. Nadya Sikana, “Analisis Sentimen untuk Ulasan Produk E-Commerce,” vol. 26, no. 2, pp. 223–238, 2025.

[6] M. M. Shivaji Alaparthi, “Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey Shivaji,” arXiv Prepr., no. 1, pp. 1–15, 2020, doi: https://doi.org/10.48550/arXiv.2007.01127.

[7] D. Yuliana, A. Ningrum, E. Daniati, M. N. Muzaki, and S. Informasi, “Perbandingan Model BERT dan RNN-LSTM pada Analisis Sentimen Aplikasi BRI Mobile,” vol. 4, no. 2, pp. 75–85, 2025.

[8] J. Saquer, “A Comparative Analysis of Transformer and LSTM Models for Detecting Suicidal Ideation on Reddit”.

[9] S. Alfaris and Kusnawi, “Komparasi Metode KNN dan Naive Bayes Terhadap Analisis Sentimen Pengguna Aplikasi Shopee,” Indones. J. Comput. Sci., vol. 12, no. 5, pp. 2766–2776, 2023, doi: 10.33022/ijcs.v12i5.3304.

[10] R. Illahi, S. Agustian, S. K. Riau, S. Baru, and K. Pekanbaru, “Bidirectional Lstm Dan Indobert Dengan Dataset Terbatas,” vol. 7, no. 1, pp. 74–84.

[11] N. Wayan, A. Sekar, and A. R. Isnain, “Analisis Sentimen Terhadap Media Sosial Twitter dengan Kasus Kampanye Anti-Korupsi di Indonesia Menggunakan Naive Bayes,” J. Media Inform. Budidarma, vol. 8, no. April, pp. 695–703, 2024, doi: 10.30865/mib.v8i2.7582.

[12] G. Darmawan, S. Alam, and M. I. Sulistyo, “Analisis Sentimen Berdasarkan Ulasan Pengguna Aplikasi Mypertamina Pada Google Playstore Menggunakan Metode Naïve Bayes,” STORAGE – J. Ilm. Tek. dan Ilmu Komput., vol. 2, no. 3, pp. 100–108, 2023.

[13] M. R. R. Lillah, D. S. Maylawati, W. B. Zulfikar, W. Uriawan, and A. Wahana, “Implementasi Algoritma K-Nearest Neighbor (KNN) untuk analisis sentimen pengguna aplikasi Tokopedia,” Intellect Indones. J. Learn. Teachnological Innov., vol. 02, no. 02, pp. 171–184, 2023, [Online]. Available: https://digilib.uinsgd.ac.id/77056/%0Ahttps://digilib.uinsgd.ac.id/77056/6/4_BAB I.pdf

[14] N. Z. B. Jannah and K. Kusnawi, “Comparison of Naïve Bayes and SVM in Sentiment Analysis of Product Reviews on Marketplaces,” Sinkron, vol. 8, no. 2, pp. 727–733, 2024, doi: 10.33395/sinkron.v8i2.13559.

[15] A. Ananta Firdaus, A. Id Hadiana, and A. Kania Ningsih, “Klasifikasi Sentimen pada Aplikasi Shopee Menggunakan Fitur Bag of Word dan Algoritma Random Forest,” Ranah Res. J. Multidiscip. Res. Dev., vol. 6, no. 5, pp. 1678–1683, 2024, doi: 10.38035/rrj.v6i5.994.

[16] N. Ray, A. Tambunan, D. Retno, and S. Saputro, “Hybrid Integration Of Bert And Bilstm Models For,” vol. 20, no. 2, pp. 1719–1730, 2026.

[17] M. Mustafa and S. Kumar, “BERT-Enhanced Bi-LSTM with weighted cross-entropy for multilingual sentiment classification,” vol. 11, no. 3, pp. 396–416, 2025.

[18] Y. Huang, “Sentiment Analysis of News Content Based on,” pp. 616–620, 2025, doi: 10.1145/3783669.3783764.

[19] M. F. Cahyadi and T. H. Rochadiani, “Implementasi Ensemble Deep Learning Untuk Analisis Sentimen Terhadap Genre Game Mobile,” J. Media Inform. Budidarma, vol. 8, no. 3, p. 1512, 2024, doi: 10.30865/mib.v8i3.7832.

[20] J. Kecerdasan, A. Rekayasa, and K. Wau, “Application of Fine-Tuned IndoBERT for Sentiment Classification Local Product Reviews on Tokopedia Marketplace with Limited Dataset,” vol. 5, no. 1, pp. 1–5, 2025.

[21] S. Aras, R. Ruimassa, E. Agustinus, B. Wambrauw, and E. B. Palalangan, “Sentiment Analysis on Shopee Product Reviews Using IndoBERT,” vol. 6, no. 3, pp. 1616–1627, 2024, doi: 10.51519/journalisi.v6i3.814.

[22] A. C. Adamuthe, “Improved Text Classification using Long Short-Term Memory and Word Embedding Technique,” vol. 13, no. 1, pp. 19–32, 2020.

[23] Y. Wang, X. Cheng, and X. Meng, “Sentiment Analysis with An Integrated Model of BERT and Bi-LSTM Based on Multi-Head Attention Mechanism,” vol. 50, no. 1, 2023.

[24] A. Sami, S. Buyrukoğlu, and M. Rashad, “Advanced deep learning techniques for sentiment analysis : combining Bi-LSTM , CNN , and attention layers,” vol. 11, no. 1, pp. 55–71, 2025.

[25] C. Fri, R. Elouahbi, Y. Taki, and A. Remaida, “Enhanced Bidirectional LSTM for Sentiment Analy sis of Learners ’ Posts in MOOCs,” vol. 16, no. 5, pp. 163–172, 2025.

Downloads

Published

2026-04-29

How to Cite

[1]
T. Ramadhan and K. Kusnawi, “Performance Comparison of IndoBERT and Bi-LSTM Models for Sentiment Analysis of Shopee App Users in Indonesia”, JAIC, vol. 10, no. 2, pp. 2076–2085, Apr. 2026.

Issue

Section

Articles