Skincare Product Recommendation System Using Hybrid Ensemble Learning
DOI:
https://doi.org/10.30871/jaic.v10i2.10461Keywords:
Ensemble Learning, Hybrid recommender system, Random Forest, TF-IDF, SVD Matrix FactorizationAbstract
The global beauty and personal care products market reached USD 557.24 billion in 2023 and is projected to reach USD 937.13 billion by 2030, growing at a CAGR of 7.7%. This growth is driven by increasing consumer awareness of personal appearance and the proliferation of e-commerce platforms, particularly in the Asia-Pacific region. However, the wide variety of skincare products creates information overload, making it difficult for consumers to identify products that suit their specific needs. Traditional recommendation systems generally rely on a single approach, either content-based filtering, which struggles to capture user preferences, or collaborative filtering, which faces cold-start and data sparsity issues. This study proposes a hybrid ensemble learning approach that integrates three complementary techniques: (1) TF-IDF content-based filtering to analyze product similarities based on brand, category, and product attributes; (2) SVD matrix factorization collaborative filtering to capture latent patterns of user-product interactions through synthetic user data generation; and (3) Random Forest as a meta-learner to intelligently combine the outputs of the two methods. The proposed system was evaluated using a dataset of more than 7,500 skincare products from Sociolla via the Kaggle repository, covering more than 300 brands with detailed product attributes including ratings, reviews, prices, and customer engagement metrics. The hybrid ensemble approach showed strong predictive performance with an R² score of 0.830, explaining 83.0% of the variance in product ratings. The system successfully recommended five products from five different brands (Biyu, True-to-skin, Jacquelle, The-aubree, and Biore) with ensemble scores ranging from 0.872 to 0.996, demonstrating cross-brand recommendation capabilities. Feature importance analysis shows relatively equal contributions from log_wishlist (29.7%), brand_encoded (25.6%), log_reviews (24.1%), and category_encoded (20.6%). These findings indicate that this hybrid approach effectively overcomes the limitations of single-method recommendation systems and can be adapted to various e-commerce product categories, providing a more relevant and personalized shopping experience for consumers.
Downloads
References
[1] Grand View Research, “Beauty and Personal Care Products Market Report, 2030,” Grand View Research. Accessed: Jul. 10, 2025. [Online]. Available: https://www.grandviewresearch.com/industry-analysis/beauty-personal-care-products-market
[2] N. Yu and others, “Digital transformation and consumer engagement in the beauty industry,” Journal of Retailing and Consumer Services, vol. 69, 2023, doi: 10.1016/j.jretconser.2022.103142.
[3] A. Syamsudin, S. Sabirin, and E. Elliyana, “Generational Differences in Online Shopping: Millennials VS. Generation Z,” Journal of Production, Operations Management and Economics, no. 51, pp. 51–62, Jan. 2025, doi: 10.55529/jpome.51.51.62.
[4] T. Regina, P. Teknik, E. Fakultas, T. Itb, and S. Jakarta, “Dampak Media Sosial Terhadap Keputusan Pembelian Konsumen Gen Z.”
[5] A. Sulami, V. Atina, and N. Nurmalitasari, “Penerapan Metode Content Based Filtering dalam Sistem Rekomendasi Pemilihan Produk Skincare,” STRING (Satuan Tulisan Riset dan Inovasi Teknologi), vol. 9, no. 2, pp. 172–181, 2024.
[6] V. P. Ardiana, V. Atina, and N. Nurmalitasari, “Sistem Rekomendasi Pemilihan Basic Skincare Menggunakan Metode Content Based Filtering,” G-Tech: Jurnal Teknologi Terapan, vol. 8, no. 4, pp. 2154–2167, 2024.
[7] A. Wildana Al Kautsar, “Sistem Rekomendasi Pemilihan Produk Skincare,” 2025. [Online]. Available: www.femaledaily.com,
[8] Z. Jin, F. Ye, N. Nedjah, and X. Zhang, “A comparative study of various recommendation algorithms based on E-commerce big data,” Electron. Commer. Res. Appl., vol. 68, p. 101461, Nov. 2024, doi: 10.1016/j.elerap.2024.101461.
[9] C. Pardede, P. R. Togatorop, A. Aritonang, and R. Lumbantoruan, “Diversity Focused Content-Based Filtering Recommendation System,” in 2024 2nd International Conference on Technology Innovation and Its Applications (ICTIIA), IEEE, Sep. 2024, pp. 1–6. doi: 10.1109/ICTIIA61827.2024.10761479.
[10] E. Erlangga and H. Sutrisno, “Sistem rekomendasi beauty shop berbasis collaborative filtering,” EXPERT: Jurnal Manajemen Sistem Informasi Dan Teknologi, vol. 10, no. 2, pp. 47–52, 2020.
[11] A. A. Hidayat, Joni Maulindar, and R. P. Indah, “Application of Item-Based Collaborative Filtering Method for Skincare Recommendation System,” Journal of Advances in Information and Industrial Technology, vol. 7, no. 1, pp. 13–20, May 2025, doi: 10.52435/jaiit.v7i1.588.
[12] S. A. Pratama, “Pengembangan Sistem Rekomendasi Buku Menggunakan Collaborative Filtering,” Jurnal Komputer, vol. 2, no. 2, Jun. 2024, doi: 10.70963/jk.v2i2.112.
[13] K. S. Y. Putri, I. M. A. D. Suarjaya, and W. O. Vihikan, “Sistem Rekomendasi Skincare Menggunakan Metode Content Based Filtering dan Collaborative Filtering,” Decode: Jurnal Pendidikan Teknologi Informasi, vol. 4, no. 3, pp. 764–774, 2024.
[14] P. Patil, S. U. Kadam, E. R. Aruna, A. More, R. M. Balajee, and B. N. K. Rao, “Recommendation System for E-Commerce Using Collaborative Filtering,” Journal Europeen des Systemes Automatises, vol. 57, no. 4, pp. 1145–1153, Aug. 2024, doi: 10.18280/jesa.570421.
[15] A. Chaudhari, A. A. Hitham Seddig, A. Sarlan, and R. Raut, “A Hybrid Recommendation System: A Review,” IEEE Access, vol. 12, pp. 157107–157126, 2024, doi: 10.1109/ACCESS.2024.3480693.
[16] R. Widayanti, “Improving Recommender Systems using Hybrid Techniques of Collaborative Filtering and Content-Based Filtering,” Journal of Applied Data Sciences, vol. 4, no. 3, pp. 289–302, Sep. 2023, doi: 10.47738/jads.v4i3.115.
[17] Z. Y. Khan, Z. Niu, A. S. Nyamawe, and I. ul Haq, “A Deep Hybrid Model for Recommendation by jointly leveraging ratings, reviews and metadata information,” Eng. Appl. Artif. Intell., vol. 97, p. 104066, Jan. 2021, doi: 10.1016/j.engappai.2020.104066.
[18] V. G. Morales Murillo, D. E. Pinto Avendaño, F. Rojas Lopez, and J. M. Gonzales Calleros, “A Systematic Literature Review on the Hybrid Approaches for Recommender Systems,” Computación y Sistemas, vol. 26, no. 1, Mar. 2022, doi: 10.13053/cys-26-1-4180.
[19] F. Fkih, “Similarity measures for Collaborative Filtering-based Recommender Systems: Review and experimental comparison,” Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 9, pp. 7645–7669, Oct. 2022, doi: 10.1016/j.jksuci.2021.09.014.
[20] L. Q. Rosydah and T. Widiyaningtyas, “Perbandingan Cosine Similarity dan Mean Squared Difference dalam Rekomendasi Buku Fiksi berbasis Item,” Edumatic: Jurnal Pendidikan Informatika, vol. 8, no. 2, pp. 565–574, Dec. 2024, doi: 10.29408/edumatic.v8i2.27783.
[21] K. Sonali, S. V. G. Reddy, K. T. Reddy, and V. V. Kumari, “Analysing the performance of Recommendation System using different similarity metrics,” International Research Journal of Engineering and Technology, 2023, [Online]. Available: www.irjet.net
[22] Statista.com, “Beauty & Personal Care - Indonesia.”
[23] A. Nalluri and Y. Zhang, “A Smart Hybrid Enhanced Recommendation and Personalization Algorithm Using Machine Learning,” in Proceedings of the 16th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, SCITEPRESS - Science and Technology Publications, 2024, pp. 465–472. doi: 10.5220/0013064100003838.
[24] S. S. Roy, A. Kumar, and R. Suresh Kumar, “Metadata and Review-Based Hybrid Apparel Recommendation System Using Cascaded Large Language Models,” IEEE Access, vol. 12, pp. 140053–140071, 2024, doi: 10.1109/ACCESS.2024.3462793.
[25] I. Sanjaya, A. Sujjada, and Y. Pratama, “Implementation of Content-Based Filtering in a Novel Recommendation System to Enhance User Experience,” bit-Tech, vol. 8, no. 1, Aug. 2025, doi: 10.32877/bt.v8i1.2833.
[26] R. Widayanti, M. Heru, R. Chakim, C. Lukita, U. Rahardja, and N. Lutfiani, “Improving Recommender Systems using Hybrid Techniques of Collaborative Filtering and Content-Based Filtering,” Journal of Applied Data Sciences, vol. 4, no. 3, pp. 289–302, 2023.
[27] S. G. Aranzamendez, J. C. Bolito, A. C. Rafe, J. Guialil, D. M. Cortez, and R. Dioses, “An Enhanced Content-based Filtering Using Maximal Marginal Relevance,” International Journal of Computing Sciences Research, vol. 8, pp. 3070–3087, Jan. 2024, doi: 10.25147/ijcsr.2017.001.1.204.
[28] A. H. J. P. Juni Permana and Agung Toto Wibowo, “Movie Recommendation System Based on Synopsis Using Content-Based Filtering with TF-IDF and Cosine Similarity,” International Journal on Information and Communication Technology (IJoICT), vol. 9, no. 2, pp. 1–14, Dec. 2023, doi: 10.21108/ijoict.v9i2.747.
[29] C. Kanduri et al., “Profiling the baseline performance and limits of machine learning models for adaptive immune receptor repertoire classification,” Gigascience, vol. 11, May 2022, doi: 10.1093/gigascience/giac046.
[30] M. Oetarjo, D. A. G. Rohim, V. Firdaus, and S. S. Togayev, “Online Reviews and Ratings Shape Purchasing Decisions in Indonesian E-Commerce,” 2023, pp. 557–566. doi: 10.2991/978-2-38476-052-7_61.
[31] A. Sovia Pramudita, “How Reviews and Ratings Influence Consumer Purchase Intentions in Indonesian E-Commerce,” Jurnal Logistik Bisnis, vol. 14, no. 2, 2024, [Online]. Available: https://ejurnal.poltekpos.ac.id/index.php/logistik/
[32] J. Yang, R. Sarathy, and J. Lee, “The effect of product review balance and volume on online Shoppers’ risk perception and purchase intention,” Decis. Support Syst., vol. 89, pp. 66–76, Sep. 2016, doi: 10.1016/j.dss.2016.06.009.
[33] M. L. Wallace et al., “Use and misuse of random forest variable importance metrics in medicine: demonstrations through incident stroke prediction,” BMC Med. Res. Methodol., vol. 23, no. 1, p. 144, Jun. 2023, doi: 10.1186/s12874-023-01965-x.
[34] C. Wies, R. Miltenberger, G. Grieser, and A. Jahn-Eimermacher, “Exploring the variable importance in random forests under correlations: a general concept applied to donor organ quality in post-transplant survival,” BMC Med. Res. Methodol., vol. 23, no. 1, p. 209, Sep. 2023, doi: 10.1186/s12874-023-02023-2.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Yulviani Puteri Puspita Sari, Anggit Dwi Hartanto, Ika Nur Fajri

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








