Utilization of ResNet Architecture and Transfer Learning Method in the Classification of Faces of Individuals with Down Syndrome
Abstract
Classifying the faces of individuals with Down Syndrome poses a significant challenge in image processing and genetic anomaly detection. This study leverages the ResNet34 architecture and transfer learning methods to improve classification accuracy for Down Syndrome facial recognition. Three experiments were conducted, varying the batch size, learning rate, and number of epochs. In the first experiment, the model achieved an accuracy of 82.83%, precision of 0.8362, recall of 0.8350, and an F1 score of 0.8348, showing promising performance but falling short of the target accuracy of 85%. The second experiment yielded the best results, with an accuracy of 87.88%, precision of 0.8956, recall of 0.8956, and an F1 score of 0.8956, indicating an optimal balance between correct predictions and errors. The third experiment resulted in the lowest accuracy, at 80.47%, with a precision of 0.8272, recall of 0.8249, and an F1 score of 0.8247, signifying a decline in performance compared to the other trials. Among the three experiments, the best configuration was achieved in the second trial, as the high recall value is crucial in medical contexts to ensure that as many individuals with Down Syndrome are correctly detected as possible, minimizing the risk of serious consequences due to false negatives.
Downloads
References
K. He, X. Zhang, S. Ren, dan J. Sun, “Deep residual learning for image recognition,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 770-778.
J. Yosinski, J. Clune, Y. Bengio, dan H. Lipson, “How transferable are features in deep neural networks?,” Advances in neural information processing systems, vol. 27, pp. 3320-3328, 2014.
G. Huang, Z. Liu, L. van der Maaten, dan K. Q. Weinberger, “Densely connected convolutional networks,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 2261-2269.
Z. Zhu, X. Huang, dan D. Du, “ResNet-based deep learning model for medical image analysis,” J. Med. Imag. Health Inform., vol. 9, no. 5, pp. 1023-1030, 2019.
S. J. Pan, dan Q. Yang, “A survey on transfer learning,” IEEE Trans. Knowl. Data Eng., vol. 22, no. 10, pp. 1345-1359, 2019.
G. Voulodimos, N. Doulamis, A. Doulamis, dan E. Protopapadakis, “Deep learning for computer vision: A brief review,” Comput. Intell. Neurosci., vol. 2018, pp. 1-13, 2018.
J. Smith, “A disciplined approach to neural network hyper-parameters: Part 1—learning rate, batch size, momentum, and weight decay,” arXiv preprint arXiv:1803.09820, 2018.
F. Rozaan, "Identifikasi wajah pengidap Down Syndrome menggunakan model CNN," Makalah IF4073 Interpretasi dan Pengolahan Citra, Institut Teknologi Bandung, 2023(Makalah-IF4073-Citra-20…).
M. Cayli, "Detection of Down Syndrome in Children," Kaggle, 2023. [Online]. Available: https://www.kaggle.com/datasets/mervecayli/detection-of-down-syndrome-in-children/data. [Accessed: 01-Aug-2024].
I. Loshchilov dan F. Hutter, "Decoupled weight decay regularization," arXiv preprint arXiv:1711.05101, 2019.
S. B. Kotsiantis, “Supervised Machine Learning: A Review of Classification Techniques,” Informatica, vol. 31, no. 3, pp. 249–268, 2007.
D. M. W. Powers, “Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation,” J. Mach. Learn. Technol., vol. 2, no. 1, pp. 37-63, 2011.
M. Sokolova and G. Lapalme, “A systematic analysis of performance measures for classification tasks,” Inf. Process. Manage., vol. 45, no. 4, pp. 427-437, Jul. 2009.
E. W. Steyerberg, Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating. New York, NY, USA: Springer Science & Business Media, 2010
B. Jin, L. Cruz and N. Gonçalves, "Deep Facial Diagnosis: Deep Transfer Learning From Face Recognition to Facial Diagnosis," in IEEE Access, vol. 8, pp. 123649-123661, 2020, doi: 10.1109/ACCESS.2020.3005687.
Ş. Kiliç, İ. Askerzade and Y. Kaya, "Using ResNet Transfer Deep Learning Methods in Person Identification According to Physical Actions," in IEEE Access, vol. 8, pp. 220364-220373, 2020, doi: 10.1109/ACCESS.2020.3040649.
J. Wang et al., "Multiple Genetic Syndromes Recognition Based on a Deep Learning Framework and Cross-Loss Training," in IEEE Access, vol. 10, pp. 117084-117092, 2022, doi: 10.1109/ACCESS.2022.3218160.
Copyright (c) 2024 Made Doddy Adi Pranatha, Gede Herdian Setiawan, M Azman Maricar
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).