Aspect-Based Sentiment Analysis for Enhanced Understanding of 'Kemenkeu' Tweets
Abstract
The perceptions and expressions shared by the public on social media play a crucial role in shaping the reputation of government institutions, such as the Ministry of Finance MOF (Kemenkeu) in Indonesia which also has faced increased scrutiny, particularly on Twitter. This study analyzes public sentiment towards the Indonesian Ministry of Finance (MoF) through Aspect-Based Sentiment Analysis (ABSA) on Twitter data. Using a dataset of 10,099 tweets from January to July 2024, this study combines IndoBERT for sentiment classification and Latent Dirichlet Allocation (LDA) for topic modeling. Here, LDA was tested across four scenarios that considered various combinations of stopwords removal and stemming techniques, resulting in coherence scores of 0.314256, 0.369636, 0.350285, and 0.541752. The most optimal results were achieved in the scenario of stopwords removal without stemming (with 0.314256 coherence score). The main results show: 1) Identification of four main topics related to MoF: Economy, Budget, Employees, and Tax; 2) The dominance of negative sentiment (6,837 tweets) compared to positive sentiment (198 tweets) across all topics; 3) The effectiveness of IndoBERT in handling the complexity of the Indonesian language, especially in interpreting context and language nuances; 4) The importance of proper preprocessing, with a scenario of removing stopwords without stemming resulting in the most relevant topics. This study provides valuable insights for MoF to understand public perception and identify areas that require special attention in public communication and policy.
Downloads
References
A. Chaudhuri and C. F. Prendes, “Social Media and EJVES 2013–2023: From Inception to Evolution,” Eur. J. Vasc. Endovasc. Surg., vol. 65, no. 6, pp. 769–771, Jun. 2023, doi: 10.1016/j.ejvs.2023.04.012.
Aldinata, A. M. Soesanto, V. C. Chandra, and D. Suhartono, “Sentiments comparison on Twitter about LGBT,” 7th Int. Conf. Comput. Sci. Comput. Intell. 2022, vol. 216, pp. 765–773, Jan. 2023, doi: 10.1016/j.procs.2022.12.194.
M. Mansoor, “Citizens’ trust in government as a function of good governance and government agency’s provision of quality information on social media during COVID-19,” Gov. Inf. Q., vol. 38, no. 4, p. 101597, Oct. 2021, doi: 10.1016/j.giq.2021.101597.
M. Bordoloi and S. K. Biswas, “Sentiment analysis: A survey on design framework, applications and future scopes,” Artif. Intell. Rev., vol. 56, no. 11, pp. 12505–12560, Nov. 2023, doi: 10.1007/s10462-023-10442-2.
S. Bengesi, T. Oladunni, R. Olusegun, and H. Audu, “A Machine Learning-Sentiment Analysis on Monkeypox Outbreak: An Extensive Dataset to Show the Polarity of Public Opinion From Twitter Tweets,” IEEE Access, vol. 11, pp. 11811–11826, 2023, doi: 10.1109/ACCESS.2023.3242290.
N. Parveen, P. Chakrabarti, B. T. Hung, and A. Shaik, “Twitter sentiment analysis using hybrid gated attention recurrent network,” J. Big Data, vol. 10, no. 1, p. 50, Apr. 2023, doi: 10.1186/s40537-023-00726-3.
Z. Pi and H. Feng, “The evolution of public sentiment toward government management of emergencies: Social media analytics,” Front. Ecol. Evol., vol. 10, p. 1026175, Dec. 2022, doi: 10.3389/fevo.2022.1026175.
A. Al-Adaileh, M. Al-Kfairy, M. Tubishat, and O. Alfandi, “A sentiment analysis approach for understanding users’ perception of metaverse marketplace,” Intell. Syst. Appl., vol. 22, p. 200362, Jun. 2024, doi: 10.1016/j.iswa.2024.200362.
A. A. Raza, A. Habib, J. Ashraf, B. Shah, and F. Moreira, “Semantic Orientation of Crosslingual Sentiments: Employment of Lexicon and Dictionaries,” IEEE Access, vol. 11, pp. 7617–7629, 2023, doi: 10.1109/ACCESS.2023.3238207.
G. Kontonatsios et al., “FABSA: An aspect-based sentiment analysis dataset of user reviews,” Neurocomputing, vol. 562, p. 126867, Dec. 2023, doi: 10.1016/j.neucom.2023.126867.
T. Zhou, Y. Shen, K. Chen, and Q. Cao, “Hierarchical dual graph convolutional network for aspect-based sentiment analysis,” Knowl.-Based Syst., vol. 276, p. 110740, Sep. 2023, doi: 10.1016/j.knosys.2023.110740.
H. Qin, G. Chen, Y. Tian, and Y. Song, “Improving Federated Learning for Aspect-based Sentiment Analysis via Topic Memories,” in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, M.-F. Moens, X. Huang, L. Specia, and S. W. Yih, Eds., Online and Punta Cana, Dominican Republic: Association for Computational Linguistics, Nov. 2021, pp. 3942–3954. doi: 10.18653/v1/2021.emnlp-main.321.
R. Dutta, N. Das, M. Majumder, and B. Jana, “Aspect based sentiment analysis using multi-criteria decision-making and deep learning under COVID-19 pandemic in India,” CAAI Trans. Intell. Technol., vol. 8, no. 1, pp. 219–234, Mar. 2023, doi: 10.1049/cit2.12144.
W. Zheng, H. Jin, Y. Zhang, X. Fu, and X. Tao, “Aspect-Level Sentiment Classification Based on Auto-Adaptive Model Transfer,” IEEE Access, vol. 11, pp. 34990–34998, 2023, doi: 10.1109/ACCESS.2023.3265473.
W. Ahmad, H. U. Khan, T. Iqbal, and S. Iqbal, “Attention-Based Multi-Channel Gated Recurrent Neural Networks: A Novel Feature-Centric Approach for Aspect-Based Sentiment Classification,” IEEE Access, vol. 11, pp. 54408–54427, 2023, doi: 10.1109/ACCESS.2023.3281889.
M. Wankhade, A. C. S. Rao, and C. Kulkarni, “A survey on sentiment analysis methods, applications, and challenges,” Artif. Intell. Rev., vol. 55, no. 7, pp. 5731–5780, Oct. 2022, doi: 10.1007/s10462-022-10144-1.
S. Salmi, R. van der Mei, S. Mérelle, and S. Bhulai, “Topic modeling for conversations for mental health helplines with utterance embedding,” Telemat. Inform. Rep., vol. 13, p. 100126, Mar. 2024, doi: 10.1016/j.teler.2024.100126.
F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” in Proceedings of the 28th International Conference on Computational Linguistics, D. Scott, N. Bel, and C. Zong, Eds., Barcelona, Spain (Online): International Committee on Computational Linguistics, Dec. 2020, pp. 757–770. doi: 10.18653/v1/2020.coling-main.66.
M. N. P. Ma’ady, A. F. A. Rahim, T. S. N. Syahda, A. F. Rizqi, and M. C. A. Ratna, “Malaysia Citizen Sentiment on Government Response Towards Covid-19 Disaster Management: Using LDA-based Topic Visualization on Twitter,” Seventh Inf. Syst. Int. Conf. ISICO 2023, vol. 234, pp. 561–569, Jan. 2024, doi: 10.1016/j.procs.2024.03.040.
S. E. Uthirapathy and D. Sandanam, “Topic Modelling and Opinion Analysis On Climate Change Twitter Data Using LDA And BERT Model.,” Int. Conf. Mach. Learn. Data Eng., vol. 218, pp. 908–917, Jan. 2023, doi: 10.1016/j.procs.2023.01.071.
M. Husna, L. P. Purba, M. E. Rinaldy, and A. R. Lubis, “Predictive Analytics for IMDb Top TV Ratings: A Linear Regression Approach to the Data of Top 250 IMDb TV Shows,” vol. 8, no. 1.
A. Meddeb and L. B. Romdhane, “Using Topic Modeling and Word Embedding for Topic Extraction in Twitter,” Knowl.-Based Intell. Inf. Eng. Syst. Proc. 26th Int. Conf. KES2022, vol. 207, pp. 790–799, Jan. 2022, doi: 10.1016/j.procs.2022.09.134.
R. Rani and D. K. Lobiyal, “Performance evaluation of text-mining models with Hindi stopwords lists,” J. King Saud Univ. - Comput. Inf. Sci., vol. 34, no. 6, Part A, pp. 2771–2786, Jun. 2022, doi: 10.1016/j.jksuci.2020.03.003.
A. Nzeyimana, “Morphological disambiguation from stemming data,” in Proceedings of the 28th International Conference on Computational Linguistics, D. Scott, N. Bel, and C. Zong, Eds., Barcelona, Spain (Online): International Committee on Computational Linguistics, Dec. 2020, pp. 4649–4660. doi: 10.18653/v1/2020.coling-main.409.
P. Yang, Y. Yao, and H. Zhou, “Leveraging Global and Local Topic Popularities for LDA-Based Document Clustering,” IEEE Access, vol. 8, pp. 24734–24745, 2020, doi: 10.1109/ACCESS.2020.2969525.
Y. Zhang and L. Zhang, “Movie Recommendation Algorithm Based on Sentiment Analysis and LDA,” 8th Int. Conf. Inf. Technol. Quant. Manag. ITQM 2020 2021 Dev. Glob. Digit. Econ. COVID-19, vol. 199, pp. 871–878, Jan. 2022, doi: 10.1016/j.procs.2022.01.109.
Y. Li, Q. He, and L. Yang, “Part-of-speech based label update network for aspect sentiment triplet extraction,” J. King Saud Univ. - Comput. Inf. Sci., vol. 36, no. 1, p. 101908, Jan. 2024, doi: 10.1016/j.jksuci.2023.101908.
G. Z. Nabiilah, S. Y. Prasetyo, Z. N. Izdihar, and A. S. Girsang, “BERT base model for toxic comment analysis on Indonesian social media,” 7th Int. Conf. Comput. Sci. Comput. Intell. 2022, vol. 216, pp. 714–721, Jan. 2023, doi: 10.1016/j.procs.2022.12.188.
Copyright (c) 2024 Priska Trisna Sejati, Farrikh Alzami, Aris Marjuni, Heni Indrayani, Ika Dewi Puspitarini
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).