Detection of Political Hoax News Using Fine-Tuning IndoBERT
DOI:
https://doi.org/10.30871/jaic.v9i2.8989Keywords:
Hoax News Detection, IndoBERT, Political Hoax News, Machine Learning, Indonesian NewsAbstract
Indonesia has experienced a surge in the spread of political hoax news, posing a potential threat to democratic and social stability. This study aims to develop a model for detecting political hoax news in the Indonesian language using IndoBERT, a language model optimized for Indonesian text. The dataset was sourced from Kaggle and comprises 20,928 factual news articles and 2,251 hoax news articles from major Indonesian media outlets, including CNN, Kompas, Tempo, and Turnbackhoax. The imbalance between factual and hoax news articles was addressed through undersampling, resulting in 1,302 samples for each class. The research stages include data collection, preprocessing, IndoBERT model training, and model evaluation. Results indicate that fine-tuning IndoBERT can detect political hoax news with an accuracy of 94.1% and an ROC AUC of 0.991, demonstrating high performance in accuracy and generalization capability. This research is expected to contribute to minimizing the spread of political hoax news in Indonesia and enhance media literacy among the public.
Downloads
References
[1] A. Lee, “Online Hoaxes, Existential Threat, and Internet Shutdown: A Case Study of Securitization Dynamics in Indonesia,” Journal of Indonesian Social Sciences and Humanities, vol. 10, no. 1, pp. 17–34, Jun. 2020, doi: 10.14203/jissh.v10i1.156.
[2] D. R. Fatmala, A. Amelia, and F. A. Trianingsih, “The Use of Social Media Bot Accounts on Influencing Public Opinion: A Legal Review in Indonesia,” Legality: Jurnal Ilmiah Hukum, vol. 28, no. 2, pp. 169–182, Sep. 2020, doi: 10.22219/ljih.v28i2.12148.
[3] I. B. K. A. Dwipayana, M. I. S. Abenk, and N. H. Lukman, “Public Awareness in Efforts to Defend The Country in The Digital Era in Order to Fight Hoaxes to Maintain State Resilience,” Journal of Digital Law and Policy, vol. 2, no. 1, pp. 1–10, Sep. 2022, doi: 10.58982/jdlp.v2i1.197.
[4] X. Zhou and R. Zafarani, “A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities,” ACM Comput Surv, vol. 53, no. 5, Sep. 2020, doi: 10.1145/3395046.
[5] A. Orhan, “Fake news detection on social media: the predictive role of university students’ critical thinking dispositions and new media literacy,” Smart Learning Environments, vol. 10, no. 1, Dec. 2023, doi: 10.1186/s40561-023-00248-8.
[6] N. Agustina, A. Adrian, and M. Hermawati, “Implementasi Algoritma Naïve Bayes Classifier untuk Mendeteksi Berita Palsu pada Sosial Media,” Faktor Exacta, vol. 14, no. 4, p. 206, Jan. 2022, doi: 10.30998/faktorexacta.v14i4.11259.
[7] N. G. Ramadhan, F. D. Adhinata, A. J. T. Segara, and D. P. Rakhmadani, “Deteksi Berita Palsu Menggunakan Metode Random Forest dan Logistic Regression,” JURIKOM (Jurnal Riset Komputer), vol. 9, no. 2, p. 251, Apr. 2022, doi: 10.30865/jurikom.v9i2.3979.
[8] D. F. N. Anisa, I. Mukhlash, and M. Iqbal, “Deteksi Berita Online Hoax Covid-19 Di Indonesia Menggunakan Metode Hybrid Long Short Term Memory dan Support Vector Machine,” Jurnal Sains dan Seni ITS, vol. 11, no. 3, Mar. 2023, doi: 10.12962/j23373520.v11i3.83227.
[9] A. Hanifa, S. A. Fauzan, M. Hikal, and M. B. Ashfiya, “Perbandingan Metode LSTM dan GRU (RNN) untuk Klasifikasi Berita Palsu Berbahasa Indonesia,” Dinamika Rekayasa, vol. 17, no. 1, p. 33, Jan. 2021, doi: 10.20884/1.dr.2021.17.1.436.
[10] A. Agarwal, M. Mittal, A. Pathak, and L. M. Goyal, “Fake News Detection Using a Blend of Neural Networks: An Application of Deep Learning,” SN Comput Sci, vol. 1, no. 3, May 2020, doi: 10.1007/s42979-020-00165-4.
[11] A. K. Yadav et al., “Fake News Detection Using Hybrid Deep Learning Method,” SN Comput Sci, vol. 4, no. 6, Nov. 2023, doi: 10.1007/s42979-023-02296-w.
[12] A. Aggarwal, A. Chauhan, D. Kumar, M. Mittal, and S. Verma, “Classification of Fake News by Fine-tuning Deep Bidirectional Transformers based Language Model,” EAI Endorsed Transactions on Scalable Information Systems, vol. 7, no. 27, pp. 1–12, 2020, doi: 10.4108/eai.13-7-2018.163973.
[13] S. M. Sr and S. Ahmad, “BERT based Blended approach for Fake News Detection,” Journal of Big Data and Artificial Intelligence, vol. 2, no. 1, Jan. 2024, doi: 10.54116/jbdai.v2i1.27.
[14] R. K. Kaliyar, A. Goswami, and P. Narang, “FakeBERT: Fake news detection in social media with a BERT-based deep learning approach,” Multimed Tools Appl, vol. 80, no. 8, pp. 11765–11788, Mar. 2021, doi: 10.1007/s11042-020-10183-2.
[15] P. Dhiman, A. Kaur, D. Gupta, S. Juneja, A. Nauman, and G. Muhammad, “GBERT: A hybrid deep learning model based on GPT-BERT for fake news detection,” Heliyon, vol. 10, no. 16, Aug. 2024, doi: 10.1016/j.heliyon.2024.e35865.
[16] F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” Nov. 2020, doi: https://doi.org/10.48550/arXiv.2011.00677.
[17] Y. Liu, S. Agarwal, and S. Venkataraman, “AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning,” Feb. 2021, doi: https://doi.org/10.48550/arXiv.2102.01386.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Charlotte Jocelynne, IGN Lanang Wijayakusuma, Luh Putu Ida Harini

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).