Boosting CNN Accuracy for Sundanese Script Recognition through Feature Extraction Techniques

Authors

  • Musthofa Galih Pradana Universitas Pembangunan Nasional Veteran Jakarta
  • Hilda Khoirunnisa Politeknik Manufaktur Bandung

DOI:

https://doi.org/10.30871/jaic.v9i6.11332

Keywords:

Convolution Neural Network, Sundanese script, Image Processing

Abstract

Sundanese script is included in the cultural heritage in Indonesia, especially the culture in West Java. As a society that appreciates and preserves Indonesian culture and art, active participation can be realized through efforts to strengthen and preserve this script, one of which is by utilizing digital media. One of the technology-based digital media that can be used to preserve culture is image detection to make it easier to recognize Sundanese script. One of the models that can be used is the Convolutional Neural Network (CNN) with the MobileNetV2 architecture, with limited resources this architecture is able to produce good detection. This study applies the Convolutional Neural Network (CNN) algorithm with the MobileNetV2 architecture which will be tested with two main test scenarios, namely by applying feature extraction and without using feature extraction. The focus of this study will explore the influence and significance of the influence of feature extraction on the final results of image detection using the Convolutional Neural Network (CNN). The two feature extraction models used are Local Binary Pattern and Gray-Level Co-occurrence Matrix. These two feature extraction models will be tested with Sundanese script image data with data of 2,300 Sundanese script images. The results of this study show that the best results were obtained in the Convolutional Neural Network (CNN) with Gray-Level Co-occurrence Matrix (GLCM) with the best accuracy results at 93.8%. This is because the addition of the Gray-Level Co-occurrence Matrix (GLCM) is able to capture spatial texture statistics such as contrast, homogeneity, entropy, and correlation between pixel pairs. With these results, it can be concluded that in this study feature extraction has an effect and is able to increase the detection accuracy of the Convolutional Neural Network (CNN) model with the MobileNetV2 architecture in Sundanese script image data.

Downloads

Download data is not yet available.

References

[1] M. G. Pradana, I. W. Rangga Pinastawa, N. Maulana, and W. D. Prastowo, “Performance Analysis of Tree-Based Algorithms in Predicting Employee Attrition,” CCIT Journal, vol. 16, no. 2, pp. 220–232, Jul. 2023, doi: 10.33050/ccit.v16i2.2580.

[2] M. G. Pradana, P. H. Saputro, and D. L. Tyas, “Unveiling Gender From Indonesian Names Using Random Forest And Logistic Regression Algorithms,” Jurnal Techno Nusa Mandiri, vol. 21, no. 2, pp. 144–150, Sep. 2024, doi: 10.33480/techno.v21i2.5537.

[3] N. A. Arifuddin, I. W. R. Pinastawa, N. Anugraha, and M. G. Pradana, “Classification of Stroke Opportunities with Neural Network and K-Nearest Neighbor Approaches,” SinkrOn, vol. 8, no. 2, pp. 688–693, Apr. 2023, doi: 10.33395/sinkron.v8i2.12228.

[4] M. Galih Pradana, N. Irzavika, N. Maulana, J. Mu, and K. Wari, “Performance Improvement of Cosine Similarity Algorithm with Bidirectional Encoder Representations from Transformers on Abstract Document Similarity Detection,” JOIV : International Journal on Informatics Visualization, vol. 9, no. 2, pp. 824–830, Mar. 2025, doi: http://dx.doi.org/10.62527/joiv.9.2.2853.

[5] Y. A. Wahdah, M. Muhajir, and A. W. Abdullah, “Kamus Online Sebagai Media Penerjemahan Teks Bagi Calon Guru Bahasa Arab,” Edukasiana: Jurnal Inovasi Pendidikan, vol. 2, no. 3, pp. 138–150, 2023, doi: 10.56916/ejip.v2i3.368.

[6] Y. Han, L. Li, and J. Zhang, “A coordinated representation learning enhanced multimodal machine translation approach with multi-attention,” ICMR 2020 - Proceedings of the 2020 International Conference on Multimedia Retrieval, pp. 571–577, 2020, doi: 10.1145/3372278.3390717.

[7] A. Rusdi, M. G. Pradana, and N. A. Arifuddin, “Advancing Realistic Non-Playable Characters Conversations in Juragan Fauna with GPT-3.5,” in 2024 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), IEEE, Nov. 2024, pp. 996–1001. doi: 10.1109/ICIMCIS63449.2024.10956580.

[8] M. T. R. Laskar, X. Huang, and E. Hoque, “Contextualized embeddings based transformer encoder for sentence similarity modeling in answer selection task,” Proceedings of the Twelfth …, 2020, [Online]. Available: https://aclanthology.org/2020.lrec-1.676/

[9] A. Mulyanto, E. Susanti, F. Rossi, W. Wajiran, and R. I. Borman, “Penerapan Convolutional Neural Network (CNN) pada Pengenalan Aksara Lampung Berbasis Optical Character Recognition (OCR),” Jurnal Edukasi dan Penelitian Informatika (JEPIN), vol. 7, no. 1, p. 52, Apr. 2021, doi: 10.26418/jp.v7i1.44133.

[10] F. T. Anggraeny, E. P. Mandyartha, and D. S. Y. Kartika, “Texture Feature Local Binary Pattern for Handwritten Character Recognition,” in 2020 6th Information Technology International Seminar (ITIS), IEEE, Oct. 2020, pp. 125–129. doi: 10.1109/ITIS50118.2020.9320980.

[11] G. N. Adli Kesaulya, A. Fariza, and T. Karlita, “Javanese Script Text Image Recognition Using Convolutional Neural Networks,” in 2022 International Electronics Symposium (IES), IEEE, Aug. 2022, pp. 534–539. doi: 10.1109/IES55876.2022.9888527.

[12] F. T. Anggraeny, Y. V. Via, and R. Mumpuni, “Image preprocessing analysis in handwritten Javanese character recognition,” Bulletin of Electrical Engineering and Informatics, vol. 12, no. 2, pp. 860–867, Apr. 2023, doi: 10.11591/eei.v12i2.4172.

[13] A. Jindal and R. Ghosh, “An optimized CNN system to recognize handwritten characters in ancient documents in Grantha script,” International Journal of Information Technology, vol. 15, no. 4, pp. 1975–1983, Apr. 2023, doi: 10.1007/s41870-023-01247-1.

[14] I. W. R. Pinastawa, M. G. Pradana, and K. Khoironi, “Edge Detection Model Performance Using Canny, Prewitt and Sobel in Face Detection,” Sinkron, vol. 8, no. 2, pp. 623–631, Mar. 2024, doi: 10.33395/sinkron.v8i2.13497.

[15] M. G. Pradana, H. Khoirunnisa, and I. W. R. Pinastawa, “Evaluation of Convolutional Neural Network Model Architecture Performance,” pp. 628–632, 2023, doi: 10.1109/icimcis60089.2023.10349075.

[16] D. Bhatt et al., “CNN Variants for Computer Vision: History, Architecture, Application, Challenges and Future Scope,” Electronics (Basel), vol. 10, no. 20, p. 2470, Oct. 2021, doi: 10.3390/electronics10202470.

[17] L. Alzubaidi et al., “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions,” J Big Data, vol. 8, no. 1, p. 53, Mar. 2021, doi: 10.1186/s40537-021-00444-8.

[18] P. Setiaji, K. Adi, and B. Surarso, “Development of Classification Method for Determining Chicken Egg Quality Using GLCM-CNN Method,” Ingénierie des systèmes d information, vol. 29, no. 2, pp. 397–407, Apr. 2024, doi: 10.18280/isi.290201.

[19] P. H. Saputro, D. P. Wijaya, M. G. Pradana, D. L. Tyas, and W. F. Zalmi, “Comparison ADAM-optimizer and SGDM for Classification Images of Rice Leaf Disease,” Proceedings - 4th International Conference on Informatics, Multimedia, Cyber and Information System, ICIMCIS 2022, pp. 348–353, 2022, doi: 10.1109/ICIMCIS56303.2022.10017644.

[20] R. O. Ogundokun, R. Maskeliunas, S. Misra, and R. Damaševičius, “Improved CNN Based on Batch Normalization and Adam Optimizer,” 2022, pp. 593–604. doi: 10.1007/978-3-031-10548-7_43.

[21] Y. S. Chowdhury, R. Dasgupta, and S. Nanda, “Analysis of Various Optimizer on CNN model in the Application of Pneumonia Detection,” in 2021 3rd International Conference on Signal Processing and Communication (ICPSC), IEEE, May 2021, pp. 417–421. doi: 10.1109/ICSPC51351.2021.9451768.

[22] A. A. Hidayat, K. Purwandari, T. W. Cenggoro, and B. Pardamean, “A Convolutional Neural Network-based Ancient Sundanese Character Classifier with Data Augmentation,” Procedia Comput Sci, vol. 179, pp. 195–201, 2021, doi: 10.1016/j.procs.2020.12.025.

[23] D. R. Maulana, M. G. Pradana, and M. P. Muslim, “Handwriting Classification of Sundanese Script Using LBP Feature Extraction and CNN,” in 2024 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS), IEEE, Nov. 2024, pp. 1079–1084. doi: 10.1109/ICIMCIS63449.2024.10956206.

[24] S. D. Pande et al., “Digitization of handwritten Devanagari text using CNN transfer learning – A better customer service support,” Neuroscience Informatics, vol. 2, no. 3, p. 100016, Sep. 2022, doi: 10.1016/j.neuri.2021.100016.

[25] N. Saxena and S. Chauhan, “Transformation of handwritten Devnagari script into word editable form using CNN,” in 2020 2nd International Conference on Advances in Computing, Communication Control and Networking (ICACCCN), IEEE, Dec. 2020, pp. 734–738. doi: 10.1109/ICACCCN51052.2020.9362824.

[26] P. Vakharwala, R. Chhabda, V. Painter, U. Pawar, and S. Dastoor, “Performance Analysis of Various Trained CNN Models on Gujarati Script,” 2021, pp. 483–492. doi: 10.1007/978-981-15-7062-9_48.

[27] D. Zhang, Y. Liu, Z. Wang, and D. Wang, “OCR with the Deep CNN Model for Ligature Script-Based Languages like Manchu,” Sci Program, vol. 2021, pp. 1–9, Jun. 2021, doi: 10.1155/2021/5520338.

[28] J. Bati and P. Raj Dawadi, “Ranjana Script Handwritten Character Recognition using CNN,” JOIV : International Journal on Informatics Visualization, vol. 7, no. 3, pp. 984–990, Sep. 2023, doi: 10.30630/joiv.7.3.1725.

[29] A. Alsalihi, H. K. Aljobouri, and E. A. K. ALTameemi, “GLCM and CNN Deep Learning Model for Improved MRI Breast Tumors Detection,” International Journal of Online and Biomedical Engineering (iJOE), vol. 18, no. 12, pp. 123–137, Sep. 2022, doi: 10.3991/ijoe.v18i12.31897.

[30] A. Gurunathan and B. Krishnan, “A Hybrid CNN-GLCM Classifier For Detection And Grade Classification Of Brain Tumor,” Brain Imaging Behav, vol. 16, no. 3, pp. 1410–1427, Jun. 2022, doi: 10.1007/s11682-021-00598-2.

[31] A. Naik and D. R. Edla, “Lung Tumor Classification Using CNN- and GLCM-Based Features,” 2021, pp. 157–163. doi: 10.1007/978-981-15-8289-9_15.

[32] P. Purnawansyah et al., “Comparative Study of Herbal Leaves Classification using Hybrid of GLCM-SVM and GLCM-CNN,” ILKOM Jurnal Ilmiah, vol. 15, no. 2, pp. 382–389, Aug. 2023, doi: 10.33096/ilkom.v15i2.1759.382-389.

[33] M. Yogeshwari and G. Thailambal, “Automatic feature extraction and detection of plant leaf disease using GLCM features and convolutional neural networks,” Mater Today Proc, vol. 81, pp. 530–536, 2023, doi: 10.1016/j.matpr.2021.03.700.

[34] H. N. Carlo, K. Andreas, Meiliana, and A. Y. Zakiyyah, “Comparative Analysis of Fungal Infections Classification in Apple Leaves Using CNN and CNN with GLCM Features,” in 2024 6th International Conference on Cybernetics and Intelligent System (ICORIS), IEEE, Nov. 2024, pp. 1–5. doi: 10.1109/ICORIS63540.2024.10903820.

[35] L. R and L. S, “Enhanced AMD detection in OCT images using GLCM texture features with Machine Learning and CNN methods,” Biomed Phys Eng Express, vol. 11, no. 2, p. 025006, Mar. 2025, doi: 10.1088/2057-1976/ada6bc.

Downloads

Published

2025-12-08

How to Cite

[1]
M. G. Pradana and H. Khoirunnisa, “Boosting CNN Accuracy for Sundanese Script Recognition through Feature Extraction Techniques”, JAIC, vol. 9, no. 6, pp. 3553–3561, Dec. 2025.

Similar Articles

1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.