An Integrated Topic–Sentiment Analysis of User Reviews in Vidio Application Using BERTopic and IndoRoBERTa
DOI:
https://doi.org/10.30871/jaic.v10i3.12952Keywords:
Topic Modeling, Sentiment Analysis, BERTopic, IndoRoBERTa, User ReviewsAbstract
User reviews on digital platforms provide valuable insights into user experience; however, the large volume and unstructured nature of such data make systematic analysis challenging. In the case of the Vidio application, user feedback frequently reflects concerns related to advertisements, subscription systems, and technical performance. Nevertheless, existing researches often apply sentiment analysis and topic modeling separately, limiting the ability to understand how specific discussion themes are associated with user sentiment. To address this limitation, this research proposes an integrated topic–sentiment analysis approach for analyzing user reviews of the Vidio application collected from the Google Play Store. After filtering and quality control, 8,854 reviews were retained for further analysis using BERTopic for topic modeling and IndoRoBERTa for sentiment classification. The topic modeling process was optimized through parameter tuning, resulting in an improvement of the coherence score from 0.4076 to 0.6878, indicating better semantic consistency among the identified topics. Meanwhile, the sentiment classification model achieved an accuracy of 72%, although its performance was affected by class imbalance, particularly in identifying neutral sentiment. The analysis identified seven primary topics, where advertising-related issues emerged as the dominant topic and were strongly associated with negative sentiment, followed by concerns regarding subscription mechanisms and login accessibility. In contrast, content-related topics, particularly sports broadcasts, were consistently associated with positive sentiment. Furthermore, statistical evaluation confirmed a significant relationship between topic categories and sentiment distribution. Overall, the findings demonstrate that integrating topic modeling and sentiment analysis provides a more comprehensive understanding of user opinions and can support improvements in application quality and user experience.
Downloads
References
[1] G. Rosalinda, R. Santoso, and P. Kartikasari, “Pemodelan Topik Ulasan Aplikasi Netflix pada Google Play Store Menggunakan Latent Dirichlet Allocation,” J. Gaussian, vol. 11, no. 4, pp. 554–561, Feb. 2023, doi: 10.14710/j.gauss.11.4.554-561.
[2] R. P. Setiawan, B. Irawan, and W. P. Prihartono, “Analisis Sentimen Ulasan Growtopia di Google Play Store Menggunakan Naïve Bayes Classifier untuk Identifikasi Kebutuhan Pengguna,” J. Inform. Dan Tek. Elektro Terap., vol. 13, no. 2, Apr. 2025, doi: 10.23960/jitet.v13i2.6415.
[3] R. A. K. N. Bintang and N. T. Romadloni, “Perbandingan Kinerja Algoritma Klasifikasi pada Review Pengguna Aplikasi Netflix,” J. Inform. Dan Tek. Elektro Terap., vol. 13, no. 2, Apr. 2025, doi: 10.23960/jitet.v13i2.6303.
[4] N. A. Adrielvino and A. T. Ayunda, “Penerapan Indobert dan Bertopic Dalam Absa untuk Evaluasi Kualitas Aplikasi E-Government Indonesia | Rabit : Jurnal Teknologi dan Sistem Informasi Univrab,” Jan. 2026, Accessed: Mar. 16, 2026. [Online]. Available: https://jurnal.univrab.ac.id/index.php/rabit/article/view/7143
[5] A. Wirayudha, M. Murniyati, and R. Rosdiana, “Analisis Sentimen Terhadap Ulasan Access By KAI Pada Google Play Store Menggunakan Metode Indobert,” Portal Ris. Dan Inov. Sist. Perangkat Lunak, vol. 3, no. 1, pp. 9–20, Jan. 2025, doi: 10.59696/prinsip.v3i1.69.
[6] E. Nurmawati and A. Amanda, “Analisis Sentimen dan Pemodelan Topik pada Tweet Terkait Data Badan Pusat Statistik,” J. Sist. Inf. Dan Inform., vol. 6, no. 2, pp. 165–176, Aug. 2023, doi: 10.47080/simika.v6i2.2789.
[7] M. U. Yanuar and W. Wibowo, “Pemodelan Topik dan Analisis Sentimen pada Ulasan Pengguna Aplikasi Trans Jatim: Topic Modeling and Sentiment Analysis on Trans Jatim Application User Reviews,” MALCOM Indones. J. Mach. Learn. Comput. Sci., vol. 6, no. 1, pp. 156–166, Jan. 2026, doi: 10.57152/malcom.v6i1.2410.
[8] Tarwoto, R. Nugroho, N. Azka, and W. S. R. Graha, “Analisis Sentimen Ulasan Aplikasi Mobile JKN di Google PlayStore Menggunakan IndoBERT,” J. JTIK J. Teknol. Inf. Dan Komun., vol. 9, no. 2, pp. 495–505, Apr. 2025, doi: 10.35870/jtik.v9i2.3340.
[9] K. C. Pradhisa and R. Fajriyah, “Analisis Sentimen Ulasan Pengguna E-commerce di Google Play Store Menggunakan Metode IndoBERT,” Build. Inform. Technol. Sci. BITS, vol. 6, no. 1, pp. 92-104–92−104, Jun. 2024, doi: 10.47065/bits.v6i1.5247.
[10] A. F. Anugrah and R. D. Agatha, “Utilizing IndoBERT and BERTopic to Explore Public Opinion on BPS Instagram Posts,” J. Appl. Inform. Comput., vol. 9, no. 5, pp. 2836–2844, Oct. 2025, doi: 10.30871/jaic.v9i5.10327.
[11] D. Aryani, I. L. Kharisma, A. Sujjada, and K. Kamdan, “Topic Modeling of the 2024 Election Using the BERTopic Method on Detik.com News Articles,” Inf. J. Ilm. Bid. Teknol. Inf. Dan Komun., vol. 9, no. 2, pp. 171–180, Aug. 2024, doi: 10.25139/inform.v9i2.8429.
[12] F. M. Apriansyah, T. I. Ramadhan, C. R. Hidayat, and A. K. Wijaya, “Perbandingan IndoBERT dan IndoRoBERTa Untuk Analisis Sentimen Pada Film Dokumenter Dirty Vote,” J. Inform. J. Pengemb. IT, vol. 10, no. 3, pp. 593–605, Jul. 2025, doi: 10.30591/jpit.v10i3.8607.
[13] Y. Asri, D. Kuswardani, W. N. Suliyanti, Y. O. Manullang, and A. R. Ansyari, “Sentiment analysis based on Indonesian language lexicon and IndoBERT on user reviews PLN mobile application,” Indones. J. Electr. Eng. Comput. Sci., vol. 38, no. 1, pp. 677–688, Apr. 2025, doi: 10.11591/ijeecs.v38.i1.pp677-688.
[14] I. Mursidah, R. Sanjaya, B. Yulianto, D. Sweetania, and P. Sularsih, “Klasifikasi Sentimen Google Play Store Aplikasi ChatGPT Berbahasa Indonesia Berbasis IndoBERT,” J. Minfo Polgan, vol. 14, no. 2, pp. 3349–3359, Dec. 2025, doi: 10.33395/jmp.v14i2.15751.
[15] H. P. Doloksaribu and Y. T. Samuel, “Komparasi Algoritma Data Mining untuk Analisis Sentimen Aplikasi Pedulilindungi,” J. Teknol. Inf. J. Keilmuan Dan Apl. Bid. Tek. Inform., vol. 16, no. 1, pp. 1–11, Jan. 2022, doi: 10.47111/jti.v16i1.3747.
[16] A. A. Permana, M. W. Prayuda, R. Taufiq, and D. A. Kristiyanti, “Analisis Sentimen Terhadap Vaksin Covid-19 Menggunakan Algoritma Naïve Bayes Classifier,” J. Minfo Polgan, vol. 11, no. 2, pp. 129–137, Sep. 2022, doi: 10.33395/jmp.v11i2.12346.
[17] R. Hans, “Mengenal Lemmatization dalam Machine Learning NLP.” Accessed: Mar. 10, 2026. [Online]. Available: https://dqlab.id/mengenal-lemmatization-dalam-machine-learning-nlp
[18] C. Brando, S. Anggai, and T. Tukiyat, “Analisis Sentimen Ulasan Pengguna Aplikasi Info BMKG pada Google Play Store Menggunakan Model Transformer BERT dan RoBERTa,” J. SISKOM-KB Sist. Komput. Dan Kecerdasan Buatan, vol. 9, no. 1, pp. 40–49, Sep. 2025, doi: 10.47970/siskom-kb.v9i1.872.
[19] W. Wahyudin, “Aplikasi Topic Modeling pada Pemberitaan Portal Berita Online Selama Masa Psbb Pertama,” Semin. Nas. Off. Stat., vol. 2020, no. 1, pp. 309–318, 2020, doi: 10.34123/semnasoffstat.v2020i1.579.
[20] J. Luo et al., “Analyzing sentiments in peer review reports: Evidence from two science funding agencies,” Quant. Sci. Stud., vol. 2, no. 4, pp. 1271–1295, Dec. 2021, doi: 10.1162/qss_a_00156.
[21] W. Wahyuni, T. P. Lestari, M. Apriliana, and R. Gumelta, “Implementation of BERTopic for Topic Modeling Analysis of the Free Nutritious Meal Program Based on YouTube Comments,” J. Appl. Inform. Comput., vol. 9, no. 4, pp. 1964–1971, Aug. 2025, doi: 10.30871/jaic.v9i4.9754.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Nalendra Whisnu Pinilih , Ika Novita Dewi, Farrikh Alzami

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








