Multiclass Classification of Tomato Leaf Diseases Using GLCM, Color, and Shape Feature Extraction with Optimized XGBoost
DOI:
https://doi.org/10.30871/jaic.v9i6.11273Keywords:
Feature Extraction, Gray-Level Co-occurrence Matrix (GLCM), Principal Component Analysis (PCA), Smote, XGBoostAbstract
Automatic classification of tomato leaf diseases is an essential component in advancing precision agriculture based on artificial intelligence. This study aims to develop a multiclass classification model for tomato leaf diseases by utilizing texture, color, and shape features, and employing an optimized XGBoost algorithm. The public PlantVillage dataset was used, with preprocessing stages including feature extraction, normalization, dimensionality reduction using PCA, and class balancing using SMOTE. The experimental results showed that the model successfully classified ten disease classes with a high accuracy of 97.63%, and both macro and weighted f1-scores of 0.98. These findings indicate that the combination of handcrafted features and XGBoost offers an effective, efficient, and applicable solution for plant disease diagnostic systems.
Downloads
References
[1] P. S. Kanda, K. Xia, A. Kyslytysna, and E. O. Owoola, “Tomato Leaf Disease Recognition on Leaf Images Based on Fine-Tuned Residual Neural Networks,” Plants, vol. 11, no. 21, Nov. 2022, doi: 10.3390/plants11212935.
[2] S. Panno et al., “A review of the most common and economically important diseases that undermine the cultivation of tomato crop in the mediterranean basin,” Agronomy, vol. 11, no. 11, pp. 1–45, 2021, doi: 10.3390/agronomy11112188.
[3] M. T. Rahman, S. D. Dipto, I. J. June, A. Momin, and M. R. Al Mamun, “Machine Learning-based Disease Classification in Tomato (Solanum lycopersicum) Plants,” J. Keteknikan Pertan. Trop. dan Biosist., vol. 12, no. 3, pp. 151–160, Dec. 2024, doi: 10.21776/ub.jkptb.2024.012.03.01.
[4] C. Nyasulu et al., “A comparative study of Machine Learning-based classification of Tomato fungal diseases: Application of GLCM texture features,” Heliyon, vol. 9, no. 11, Nov. 2023, doi: 10.1016/j.heliyon.2023.e21697.
[5] M. Altalak, M. A. Uddin, A. Alajmi, and A. Rizg, “A Hybrid Approach for the Detection and Classification of Tomato Leaf Diseases,” Appl. Sci., vol. 12, no. 16, Aug. 2022, doi: 10.3390/app12168182.
[6] A. Khan, U. Nawaz, L. Kshetrimayum, L. Seneviratne, and I. Hussain, “Early and Accurate Detection of Tomato Leaf Diseases Using TomFormer,” ArXiv, Dec. 2023, [Online]. Available: http://arxiv.org/abs/2312.16331
[7] A. Chelladurai, D. P. Manoj Kumar, S. S. Askar, and M. Abouhawwash, “Classification of tomato leaf disease using Transductive Long Short-Term Memory with an attention mechanism,” Front. Plant Sci., vol. 15, 2024, doi: 10.3389/fpls.2024.1467811.
[8] O. Attallah, “Tomato Leaf Disease Classification via Compact Convolutional Neural Networks with Transfer Learning and Feature Selection,” Horticulturae, vol. 9, no. 2, Feb. 2023, doi: 10.3390/horticulturae9020149.
[9] Y. Borhani, J. Khoramdel, and E. Najafi, “A deep learning based approach for automated plant disease classification using vision transformer,” Sci. Rep., vol. 12, no. 1, pp. 1–10, 2022, doi: 10.1038/s41598-022-15163-0.
[10] H. Ghosh et al., “Advanced neural network architectures for tomato leaf disease diagnosis in precision agriculture,” Discov. Sustain., vol. 6, no. 1, 2025, doi: 10.1007/s43621-025-01149-1.
[11] E. Zhang, N. Zhang, F. Li, and C. Lv, “A lightweight dual-attention network for tomato leaf disease identification,” Front. Plant Sci., vol. 15, no. August, pp. 1–18, 2024, doi: 10.3389/fpls.2024.1420584.
[12] M. S. A. M. Al-gaashani, F. Shang, M. Khayyat, M. S. A. Muthanna, and A. A. Abd El-Latif, “IET Image Processing - 2022 - Al‐gaashani - Tomato leaf disease classification by exploiting transfer learning and feature.pdf,” 2021.
[13] L. Tan, J. Lu, and H. Jiang, “Tomato Leaf Diseases Classification Based on Leaf Images: A Comparison between Classical Machine Learning and Deep Learning Methods,” AgriEngineering, vol. 3, no. 3, pp. 542–558, 2021, doi: 10.3390/agriengineering3030035.
[14] A. Tabbakh and S. S. Barpanda, “Evaluation of Machine Learning Models for Plant Disease Classification Using Modified GLCM and Wavelet Based Statistical Features,” Trait. du Signal, vol. 39, no. 6, pp. 1893–1905, 2022, doi: 10.18280/ts.390602.
[15] K. Kishore Kumar and E. Kannan, “An Efficient Deep Neural Network for Disease Detection in Rice Plant Using XGBOOST Ensemble Learning Framework,” Int. J. Intell. Syst. Appl. Eng., vol. 10, no. 3, pp. 116–128, 2022.
[16] S. U. Khan, A. Alsuhaibani, A. Alabduljabbar, F. Almarshad, Y. N. Altherwy, and T. Akram, A review on automated plant disease detection: motivation, limitations, challenges, and recent advancements for future research, vol. 37, no. 3. Springer International Publishing, 2025. doi: 10.1007/s44443-025-00040-3.
[17] R. Khan, N. Ud Din, A. Zaman, and B. Huang, “Automated Tomato Leaf Disease Detection Using Image Processing: An SVM-Based Approach with GLCM and SIFT Features,” J. Eng. (United Kingdom), vol. 2024, 2024, doi: 10.1155/2024/9918296.
[18] C. Cuenca-Romero, O. E. Apolo-Apolo, J. N. Rodríguez Vázquez, G. Egea, and M. Pérez-Ruiz, “Tackling unbalanced datasets for yellow and brown rust detection in wheat,” Front. Plant Sci., vol. 15, no. May, pp. 1–11, 2024, doi: 10.3389/fpls.2024.1392409.
[19] W. Shafik, A. Tufail, C. Liyanage De Silva, and R. A. Awg Haji Mohd Apong, “A novel hybrid inception-xception convolutional neural network for efficient plant disease classification and detection,” Sci. Rep., vol. 15, no. 1, p. 3936, 2025, doi: 10.1038/s41598-024-82857-y.
[20] T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” Proc. ACM SIGKDD Int. Conf. Knowl. Discov. Data Min., vol. 13-17-Augu, pp. 785–794, 2016, doi: 10.1145/2939672.2939785.
[21] Q. H. Cap, H. Uga, S. Kagiwada, and H. Iyatomi, “LeafGAN: An Effective Data Augmentation Method for Practical Plant Disease Diagnosis,” IEEE Trans. Autom. Sci. Eng., vol. 19, no. 2, pp. 1258–1267, 2022, doi: 10.1109/TASE.2020.3041499.
[22] M. Bhandari, T. B. Shahi, A. Neupane, and K. B. Walsh, “BotanicX-AI: Identification of Tomato Leaf Diseases Using an Explanation-Driven Deep-Learning Model,” J. Imaging, vol. 9, no. 2, 2023, doi: 10.3390/jimaging9020053.
[23] M. H. Saleem, S. Khanchi, J. Potgieter, and K. M. Arif, “Image-Based Plant Disease Identification by Deep,” Plants, vol. 9, no. 1451, pp. 1–23, 2020.
[24] N. Ahmad, H. M. S. Asif, G. Saleem, M. U. Younus, S. Anwar, and M. R. Anjum, “Leaf Image-Based Plant Disease Identification Using Color and Texture Features,” Wirel. Pers. Commun., vol. 121, no. 2, pp. 1139–1168, 2021, doi: 10.1007/s11277-021-09054-2.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Fransisko Andrade Laiskodat, Majid Rahardi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








