Content-Based Filtering Recommendation System for E-Commerce Products Using Sentence-BERT and Cosine Similarity

Authors

  • Muhammad Abbad Raid Alfath Universitas Amikom Yogyakarta
  • Arif Nur Rohman Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.30871/jaic.v10i3.12920

Keywords:

Recommender System, Content-Based Filtering, Watches, Sentence-BERT, Cosine Similarity

Abstract

The rapid growth of e-commerce catalogs complicates product discovery, particularly for items with complex technical specifications like luxury watches. Conventional keyword searches and TF-IDF methods often fail to capture underlying semantic relationships. The primary objective of this study is to propose a robust e-commerce recommendation system utilizing Content-Based Filtering enhanced with Sentence-BERT (SBERT) semantic embeddings. This study employs an experimental comparative research design. The methodology involves aggregating product attributes into descriptive sentences and pre-processing them to minimize representation bias. These sentences are transformed into high-dimensional embeddings using the lightweight all-MiniLM-L6-v2 SBERT model, with similarities calculated via the Cosine Similarity algorithm. The system's performance is comparatively evaluated against a baseline TF-IDF method. Main outcomes and experimental results across 20 testing scenarios demonstrate that SBERT significantly outperformed the baseline, achieving an average Precision@5 of 93.00%, a Recall@5 of 2.71%, and a highly efficient latency of 0.38 ms. In conclusion, SBERT provides a superior, scalable solution for recommending complex products by accurately capturing the semantic similarity of technical specifications and textual representations of visual characteristics. The approach inherently resolves the item cold-start problem, and its successful integration into a web application confirms its feasibility for real-time similarity computation in modern e-commerce platforms.

Downloads

Download data is not yet available.

References

[1] S. I. Adam dan W. Grivin, “Implementasi Sistem Rekomendasi Produk E-Commerce Menggunakan Content-Based Filtering Berbasis Cosine Similarity,” Jurnal Sistem Informasi dan Teknik Komputer, vol. 10, no. 2, 2025.

[2] S. Rahmadhani dkk., “Sistem Rekomendasi Penelusuran Buku Berbasis Content-Based Filtering dengan Pembobotan TF-RF,” JIP (Jurnal Informatika Polinema), 2024.

[3] A. Nurcahya dan S. Supriyanto, “Content-based recommender system architecture for similar e-commerce products,” Jurnal Informatika, vol. 14, no. 3, hlm. 90, Sep 2020, doi: 10.26555/jifo.v14i3.a18511.

[4] M. P. Syah, A. P. Wardani, M. Idhom, dan Trimono, “Perbandingan Representasi Teks TF-IDF Dan BERT Terhadap Akurasi Cosine Similarity Dalam Penilaian Otomatis Jawaban Berbasis Teks,” Data Sciences Indonesia (DSI), vol. 5, no. 1, hlm. 47–59, Jul 2025, doi: 10.47709/dsi.v5i1.6021.

[5] M. D. S. Antariksa, A. Sugiharto, dan B. Surarso, “BERT Model Fine-tuned for Scientific Document Classification and Recommendation,” Jurnal RESTI, vol. 9, no. 4, hlm. 754–764, Agu 2025, doi: 10.29207/resti.v9i4.6789.

[6] R. M. Holis, P. E. P. Utomo, dan B. F. Hutabarat, “Semantic FAQ Chatbot Using SBERT (Sentence-BERT) and Cosine Similarity for Academic Services,” Brilliance: Research of Artificial Intelligence, vol. 5, no. 2, hlm. 915–922, Okt 2025, doi: 10.47709/brilliance.v5i2.7027.

[7] F. T. Sabilillah, S. Winarno, dan R. B. Abiyyi, “Implementasi BERT dan Cosine Similarity untuk Rekomendasi Dosen Pembimbing berdasarkan Judul Tugas Akhir,” Edumatic: Jurnal Pendidikan Informatika, vol. 8, no. 2, hlm. 585–594, Des 2024, doi: 10.29408/edumatic.v8i2.27791.

[8] A. P. Putra, D. P. S. Putri, dan A. C. Wiranatha, “Scientific Paper Recommendation System: Application of Sentence Transformers and Cosine Similarity Using arXiv Data,” Journal of Applied Informatics and Computing (JAIC), vol. 9, no. 4, 2025.

[9] M. A. Palomino dan F. Aider, “Evaluating the Effectiveness of Text Pre-Processing in Sentiment Analysis,” Applied Sciences (Switzerland), vol. 12, no. 17, Sep 2022, doi: 10.3390/app12178765.

[10] C. Y. Ariyanto dan A. S. Aji, “Pengembangan Sistem Deteksi Plagiarisme Dokumen Jurnal Berbasis Bidirectional Encoder Representations from Transformers Dan Cosine Similarity,” Jurnal Riset Komputer, vol. 12, no. 6, hlm. 942–948, 2025, doi: 10.30865/jurikom.v12i6.9325.

[11] M. Z. F. Johari dan A. D. Laksito, “The Hybrid Recommender System of the Indonesian Online Market Products using IMDb weight rating and TF-IDF,” Jurnal RESTI, vol. 5, no. 5, hlm. 977–983, Okt 2021, doi: 10.29207/resti.v5i5.3486.

[12] R. Alana dan A. Hartanto, “Implementasi Algoritma Content Based Filtering dalam Sistem Rekomendasi Komik,” Sistemasi: Jurnal Sistem Informasi, vol. 13, no. 4, hlm. 1344–1355, 2024, [Daring]. Tersedia pada: http://sistemasi.ftik.unisi.ac.id

[13] M. W. Putri, A. Muchayan, dan M. Kamisutara, “Sistem Rekomendasi Produk Pena Eksklusif Menggunakan Metode Content-Based Filtering dan TF-IDF,” JOINTECS (Journal of Information Technology and Computer Science), vol. 3, no. 1, hlm. 229–236, 2018.

[14] A. Setiawan, Z. Abidin, dan M. Imamudin, “Impact of Preprocessing on Indonesian Extractive Summarization Using LexRank, TextRank, DivRank, and Cosine Similarity,” G-Tech: Jurnal Teknologi Terapan, vol. 9, no. 4, hlm. 2311–2321, Okt 2025, doi: 10.70609/g-tech.v9i4.8306.

[15] H. Hartatik dan A. Syafrianto, “Penerapan Model Sentence-BERT Untuk Sistem Rekomendasi Buku Berbasis Konten Di Perpustakaan Digital,” Jurnal Dialektika Informatika (Detika), vol. 6, no. 1, hlm. 12–19, Nov 2025, doi: 10.24176/detika.v6i1.15916.

[16] M. F. Abdurrafi, D. Handayani, dan U. Ningsih, “Content-based filtering using cosine similarity algorithm for alternative selection on training programs,” Journal of Soft Computing Exploration, 2023, doi: 10.52465/joscex.v4i3.232.

[17] X. Yin, W. Zhang, W. Zhu, S. Liu, dan T. Yao, “Improving sentence representations via component focusing,” Applied Sciences (Switzerland), vol. 10, no. 3, Feb 2020, doi: 10.3390/app10030958.

[18] N. Azizah dan A. F. Rozi, “Sistem Rekomendasi Produk Somethinc Menggunakan Metode Content-based Filtering,” Jurnal Teknologi Dan Sistem Informasi Bisnis, vol. 6, no. 3, hlm. 461–468, Jul 2024, doi: 10.47233/jteksis.v6i3.1411.

[19] M. P. Nabila, J. D. Irawan, dan A. Faisol, “Sistem Rekomendasi Judul Skripsi Menggunakan Cosine Similarity Pada JATI ITN Malang,” JUSIFOR : Jurnal Sistem Informasi dan Informatika, vol. 4, no. 2, hlm. 243–252, Des 2025, doi: 10.70609/jusifor.v4i2.8340.

[20] R. Astri, L. P. Hung, S. Binti Sura, dan A. Kamal, “Improving the Accuracy of Tourism Recommendation System Based on Neural Collaborative Filtering,” Jurnal RESTI, vol. 9, no. 4, hlm. 886–893, 2025, doi: 10.29207/resti.v9i4.6516.

Downloads

Published

2026-06-17

How to Cite

[1]
M. A. Raid Alfath and A. N. Rohman, “Content-Based Filtering Recommendation System for E-Commerce Products Using Sentence-BERT and Cosine Similarity”, JAIC, vol. 10, no. 3, pp. 2930–2937, Jun. 2026.

Similar Articles

<< < 3 4 5 6 7 > >> 

You may also start an advanced similarity search for this article.