Detection of Political Hoax News Using Fine-Tuning IndoBERT

Charlotte Jocelynne; IGN Lanang Wijayakusuma; Luh Putu Ida Harini

doi:10.30871/jaic.v9i2.8989

Authors

Charlotte Jocelynne Universitas Udayana
IGN Lanang Wijayakusuma Universitas Udayana
Luh Putu Ida Harini Universitas Udayana

DOI:

https://doi.org/10.30871/jaic.v9i2.8989

Keywords:

Hoax News Detection, IndoBERT, Political Hoax News, Machine Learning, Indonesian News

Abstract

Indonesia has experienced a surge in the spread of political hoax news, posing a potential threat to democratic and social stability. This study aims to develop a model for detecting political hoax news in the Indonesian language using IndoBERT, a language model optimized for Indonesian text. The dataset was sourced from Kaggle and comprises 20,928 factual news articles and 2,251 hoax news articles from major Indonesian media outlets, including CNN, Kompas, Tempo, and Turnbackhoax. The imbalance between factual and hoax news articles was addressed through undersampling, resulting in 1,302 samples for each class. The research stages include data collection, preprocessing, IndoBERT model training, and model evaluation. Results indicate that fine-tuning IndoBERT can detect political hoax news with an accuracy of 94.1% and an ROC AUC of 0.991, demonstrating high performance in accuracy and generalization capability. This research is expected to contribute to minimizing the spread of political hoax news in Indonesia and enhance media literacy among the public.

Downloads

Download data is not yet available.

References

[1] A. Lee, “Online Hoaxes, Existential Threat, and Internet Shutdown: A Case Study of Securitization Dynamics in Indonesia,” Journal of Indonesian Social Sciences and Humanities, vol. 10, no. 1, pp. 17–34, Jun. 2020, doi: 10.14203/jissh.v10i1.156.

[2] D. R. Fatmala, A. Amelia, and F. A. Trianingsih, “The Use of Social Media Bot Accounts on Influencing Public Opinion: A Legal Review in Indonesia,” Legality: Jurnal Ilmiah Hukum, vol. 28, no. 2, pp. 169–182, Sep. 2020, doi: 10.22219/ljih.v28i2.12148.

[3] I. B. K. A. Dwipayana, M. I. S. Abenk, and N. H. Lukman, “Public Awareness in Efforts to Defend The Country in The Digital Era in Order to Fight Hoaxes to Maintain State Resilience,” Journal of Digital Law and Policy, vol. 2, no. 1, pp. 1–10, Sep. 2022, doi: 10.58982/jdlp.v2i1.197.

[4] X. Zhou and R. Zafarani, “A Survey of Fake News: Fundamental Theories, Detection Methods, and Opportunities,” ACM Comput Surv, vol. 53, no. 5, Sep. 2020, doi: 10.1145/3395046.

[5] A. Orhan, “Fake news detection on social media: the predictive role of university students’ critical thinking dispositions and new media literacy,” Smart Learning Environments, vol. 10, no. 1, Dec. 2023, doi: 10.1186/s40561-023-00248-8.

[6] N. Agustina, A. Adrian, and M. Hermawati, “Implementasi Algoritma Naïve Bayes Classifier untuk Mendeteksi Berita Palsu pada Sosial Media,” Faktor Exacta, vol. 14, no. 4, p. 206, Jan. 2022, doi: 10.30998/faktorexacta.v14i4.11259.

[7] N. G. Ramadhan, F. D. Adhinata, A. J. T. Segara, and D. P. Rakhmadani, “Deteksi Berita Palsu Menggunakan Metode Random Forest dan Logistic Regression,” JURIKOM (Jurnal Riset Komputer), vol. 9, no. 2, p. 251, Apr. 2022, doi: 10.30865/jurikom.v9i2.3979.

[8] D. F. N. Anisa, I. Mukhlash, and M. Iqbal, “Deteksi Berita Online Hoax Covid-19 Di Indonesia Menggunakan Metode Hybrid Long Short Term Memory dan Support Vector Machine,” Jurnal Sains dan Seni ITS, vol. 11, no. 3, Mar. 2023, doi: 10.12962/j23373520.v11i3.83227.

[9] A. Hanifa, S. A. Fauzan, M. Hikal, and M. B. Ashfiya, “Perbandingan Metode LSTM dan GRU (RNN) untuk Klasifikasi Berita Palsu Berbahasa Indonesia,” Dinamika Rekayasa, vol. 17, no. 1, p. 33, Jan. 2021, doi: 10.20884/1.dr.2021.17.1.436.

[10] A. Agarwal, M. Mittal, A. Pathak, and L. M. Goyal, “Fake News Detection Using a Blend of Neural Networks: An Application of Deep Learning,” SN Comput Sci, vol. 1, no. 3, May 2020, doi: 10.1007/s42979-020-00165-4.

[11] A. K. Yadav et al., “Fake News Detection Using Hybrid Deep Learning Method,” SN Comput Sci, vol. 4, no. 6, Nov. 2023, doi: 10.1007/s42979-023-02296-w.

[12] A. Aggarwal, A. Chauhan, D. Kumar, M. Mittal, and S. Verma, “Classification of Fake News by Fine-tuning Deep Bidirectional Transformers based Language Model,” EAI Endorsed Transactions on Scalable Information Systems, vol. 7, no. 27, pp. 1–12, 2020, doi: 10.4108/eai.13-7-2018.163973.

[13] S. M. Sr and S. Ahmad, “BERT based Blended approach for Fake News Detection,” Journal of Big Data and Artificial Intelligence, vol. 2, no. 1, Jan. 2024, doi: 10.54116/jbdai.v2i1.27.

[14] R. K. Kaliyar, A. Goswami, and P. Narang, “FakeBERT: Fake news detection in social media with a BERT-based deep learning approach,” Multimed Tools Appl, vol. 80, no. 8, pp. 11765–11788, Mar. 2021, doi: 10.1007/s11042-020-10183-2.

[15] P. Dhiman, A. Kaur, D. Gupta, S. Juneja, A. Nauman, and G. Muhammad, “GBERT: A hybrid deep learning model based on GPT-BERT for fake news detection,” Heliyon, vol. 10, no. 16, Aug. 2024, doi: 10.1016/j.heliyon.2024.e35865.

[16] F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” Nov. 2020, doi: https://doi.org/10.48550/arXiv.2011.00677.

[17] Y. Liu, S. Agarwal, and S. Venkataraman, “AutoFreeze: Automatically Freezing Model Blocks to Accelerate Fine-tuning,” Feb. 2021, doi: https://doi.org/10.48550/arXiv.2102.01386.

Detection of Political Hoax News Using Fine-Tuning IndoBERT

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

submit

tools

issn