Lightweight BiLSTM-Attention Model Using GloVe for Multi-Class Mental Health Classification on Reddit

Authors

  • Devin Branwen Universitas Amikom Yogyakarta
  • Emigawaty Emigawaty Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.30871/jaic.v9i5.10157

Keywords:

Bi-Directional LSTM, Mental Health, Natural Language processing, Social Media, Text Classification

Abstract

Mental health issues such as depression, stress, anxiety, and personality disorders are increasingly prevalent, particularly within online communities. This study proposes a lightweight and efficient multi-class classification framework to identify five mental health conditions using Reddit user-generated posts. While previous studies predominantly rely on conventional CNNs or standard machine learning techniques for binary classification, our work introduces a novel Bidirectional Long Short-Term Memory (BiLSTM) model integrated with an attention mechanism. The architecture is further enhanced by synonym-based data augmentation using the WordNet lexical database, which improves semantic diversity and enhances model robustness, particularly for underrepresented classes. Unlike prior works that focus narrowly on binary classification or employ transformer-based models with high computational demands, our model offers a lightweight, high-performance architecture optimized for multi-class detection and real-world deployment. Experimental results demonstrate that the proposed model achieves a peak validation accuracy of 95.02%, along with precision 95.08%, recall 95.02%, and F1-scores of 95.03%. These findings support the advancement of efficient AI-driven diagnostic systems in mental health analytics and lay the groundwork for future integration into mobile or resource-constrained platforms.

Downloads

Download data is not yet available.

References

[1] Ahadi, S. A., Jazayeri, K., & Tebyani, S. (2024). Detecting Suicidality from Reddit Posts Using a Hybrid CNN - LSTM Model. JUCS - Journal of Universal Computer Science, 30(13), 1872–1904. https://doi.org/10.3897/jucs.119828

[2] Ameer, I., Arif, M., Sidorov, G., Gòmez-Adorno, H., & Gelbukh, A. (2022). Mental Illness Classification on Social Media Texts using Deep Learning and Transfer Learning. http://arxiv.org/abs/2207.01012

[3] Chen, H., Dan, L., Lu, Y., Chen, M., & Zhang, J. (2024). An improved data augmentation approach and its application in medical named entity recognition. BMC Medical Informatics and Decision Making, 24(1). https://doi.org/10.1186/s12911-024-02624-x

[4] Chen, Z., Yang, R., Fu, S., Zong, N., Liu, H., & Huang, M. (n.d.). Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN.

[5] Dash, R., Udgata, S., Mohapatra, R. K., Dash, V., & Das, A. (2025). A Deep Learning Approach to Unveil Types of Mental Illness by Analyzing Social Media Posts. Mathematical and Computational Applications, 30(3). https://doi.org/10.3390/mca30030049

[6] García-Noguez, L. R., Salazar-Colores, S., Mondragón-Rodríguez, S., & Tovar-Arriaga, S. (2025). A Novel Methodology for Data Augmentation in Cognitive Impairment Subjects Using Semantic and Pragmatic Features Through Large Language Models. Technologies, 13(8). https://doi.org/10.3390/technologies13080344

[7] Guo, Y., Zhang, Z., & Xu, X. (2023). Research on the detection model of mental illness of online forum users based on convolutional network. BMC Psychology, 11(1). https://doi.org/10.1186/s40359-023-01460-4

[8] Hasan, K., Saquer, J., & Ghosh, M. (2025). Advancing Mental Disorder Detection: A Comparative Evaluation of Transformer and LSTM Architectures on Social Media. http://arxiv.org/abs/2507.19511

[9] Inamdar, S., Chapekar, R., Gite, S., & Pradhan, B. (2023). Machine Learning Driven Mental Stress Detection on Reddit Posts Using Natural Language Processing. Human-Centric Intelligent Systems, 3(2), 80–91. https://doi.org/10.1007/s44230-023-00020-8

[10] Ishikawa, T., Yakoh, T., & Urushihara, H. (2022). An NLP-Inspired Data Augmentation Method for Adverse Event Prediction Using an Imbalanced Healthcare Dataset. IEEE Access, 10, 81166–81176. https://doi.org/10.1109/ACCESS.2022.3195212

[11] Lewy, D., & Mańdziuk, J. (2023). AttentionMix: Data augmentation method that relies on BERT attention mechanism. http://arxiv.org/abs/2309.11104

[12] Montejo-Ráez, A., Molina-González, M. D., Jiménez-Zafra, S. M., García-Cumbreras, M. Á., & García-López, L. J. (2024). A survey on detecting mental disorders with natural language processing: Literature review, trends and challenges. In Computer Science Review (Vol. 53). Elsevier Ireland Ltd. https://doi.org/10.1016/j.cosrev.2024.100654

[13] Odja, K. D., Widiarta, J., Purwanto, E. S., & Ario, M. K. (2024). Mental illness detection using sentiment analysis in social media. Procedia Computer Science, 245(C), 971–978. https://doi.org/10.1016/j.procs.2024.10.325

[14] Oryngozha, N., Shamoi, P., & Igali, A. (2024). Detection and Analysis of Stress-Related Posts in Reddit’s Acamedic Communities. IEEE Access, 12, 14932–14948. https://doi.org/10.1109/ACCESS.2024.3357662

[15] Ren, L., Lin, H., Xu, B., Zhang, S., Yang, L., & Sun, S. (2021). Depression detection on reddit with an emotion-based attention network: Algorithm development and validation. JMIR Medical Informatics, 9(7). https://doi.org/10.2196/28754

[16] Saeed, Q. bin, & Ahmed, I. (n.d.). Early Detection of Mental Health Issues Using Social Media Posts.

[17] Sutranggono, A. N., Sarno, R., & Ghozali, I. (2024). Multi-Class Multi-Level Classification of Mental Health Disorders Based on Textual Data from Social Media. Journal of Information and Communication Technology, 23(1), 77–104. https://doi.org/10.32890/jict2024.23.1.4

[18] Thorstad, R., & Wolff, P. (2019). Predicting future mental illness from social media: A big-data approach. Behavior Research Methods, 51(4), 1586–1600. https://doi.org/10.3758/s13428-019-01235-z

[19] N. Ghoshal, “Reddit Mental Health Dataset,” Kaggle, 2022. [Online]. Available: https://www.kaggle.com/datasets/neelghoshal/reddit-mental-health-data

[20] Zhang, T., Schoene, A. M., Ji, S., & Ananiadou, S. (2022). Natural language processing applied to mental illness detection: a narrative review. In npj Digital Medicine (Vol. 5, Issue 1). Nature Research. https://doi.org/10.1038/s41746-022-00589-7

[21] Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2020). DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. http://arxiv.org/abs/1910.01108

Downloads

Published

2025-10-21

How to Cite

[1]
D. Branwen and E. Emigawaty, “Lightweight BiLSTM-Attention Model Using GloVe for Multi-Class Mental Health Classification on Reddit”, JAIC, vol. 9, no. 5, pp. 2899–2911, Oct. 2025.

Similar Articles

<< < 2 3 4 5 6 > >> 

You may also start an advanced similarity search for this article.