Improving Retrieval-Augmented Generation Performance Using the MAF-RAG Architecture, EVR–VOR Vector Retrieval, and Multi-Agent Fallback Reasoning
DOI:
https://doi.org/10.30871/jaic.v10i1.11738Keywords:
LLM, Multi-Agent Reasoning, Multi-Phase Retrieval, RAG Architecture, Vector SearchAbstract
Retrieval-Augmented Generation (RAG) AI chatbots have gained popularity for their effectiveness in producing accurate, fast, and reliable responses; however, they have faced critical challenges stemming from limited datasets, outdated documents, and noisy, unfiltered data. This study proposes a Multi-Agent Fallback in Retrieval Augmented Generation (MAF-RAG). This robust RAG system testing pipeline integrates three-phase retrieval, filtering, and re-ranking data, along with a multi-agent debating process to address these challenges. This study demonstrates MAF-RAG's ability to perform under a constrained dataset, using a near-deployment dataset of 1,100 real-world documents. The pipeline utilizes 150 testing queries, carefully selected to reflect real-world RAG-based chatbot scenarios. A sentence-transformers/all-MiniLM-L6-v encoder encodes various chunks of documents into a 384-dimensional query vector embedding, ensuring an accurate relationship between testing queries and vectorized documents. The results show that the proposed MAF-RAG significantly outperforms the baseline system, achieving a mean F1-score of 0.556, an improvement of 18.8% over the Enhanced Baseline (mean F1-score = 0.469) and a 70.0% improvement over the Legacy Baseline (mean F1-score = 0.327). MAF-RAG also achieves the highest success rate, with 78% of the queries, while other baseline systems manage only 34% and 62%, respectively. MAF-RAG also reduces the failure rate by 42.1%, significantly increasing system reliability. Although MAF-RAG exhibits an increase in latency of 4.9%, these trade-offs are outweighed by the significant improvements in system reliability and performance. These findings highlight the contribution of this study: by implementing a robust retrieval testing pipeline, system accuracy can be improved, reducing the presence of noisy and unfiltered documents, and increasing system performance even when faced with challenging and varied datasets, making it a suitable solution for a RAG-based chatbot system that faces dataset challenges.
Downloads
References
[1] C. Chang et al., “MAIN-RAG : Multi-Agent Filtering Retrieval-Augmented Generation,” 2024.
[2] R. Bommasani et al., “On the Opportunities and Risks of Foundation Models,” pp. 1–214, 2022, [Online]. Available: http://arxiv.org/abs/2108.07258
[3] OpenAI, “ChatGPT,” 2025, GPT-5. [Online]. Available: https://openai.com/index/introducing-gpt-5/
[4] Anthropic, “Claude 4,” 2025, Claude 4.5 Sonnet. [Online]. Available: https://www.anthropic.com/news/claude-4
[5] Google., “Gemini,” 2025, Gemini 2.5. [Online]. Available: https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025/
[6] N. Alharbi, F. Ud Din, D. Paul, and E. Sadgrove, “Driving AI chatbot adoption: A systematic review of factors, barriers, and future research directions,” Journal of Open Innovation: Technology, Market, and Complexity, vol. 11, no. 3, p. 100590, 2025, doi: 10.1016/j.joitmc.2025.100590.
[7] R. Akkiraju et al., “FACTS About Building Retrieval Augmented Generation-based Chatbots,” 2024, [Online]. Available: http://arxiv.org/abs/2407.07858
[8] H. Wang and E. Stengel-eskin, “Retrieval-Augmented Generation with Conflicting Evidence,” pp. 1–22, 2025.
[9] I. Augenstein et al., “Factuality Challenges in the Era of Large Language Models,” pp. 1–13, 2023.
[10] W. Chen, Y. Pan, and L. Pan, “On the Risk of Misinformation Pollution with Large Language Models,” no. 2, pp. 1389–1403, 2023.
[11] J. Zhou and A. G. Parker, “Synthetic Lies : Understanding AI-Generated Misinformation and Evaluating Algorithmic and Human Solutions,” 2023, doi: 10.1145/3544548.3581318.
[12] A. T. Kalai, O. Nachum, S. S. Vempala, and E. Zhang, “Why Language Models Hallucinate,” vol. 3, no. May, pp. 1–36, 2025, [Online]. Available: http://arxiv.org/abs/2509.04664
[13] Pemerintah Provinsi Jawa Tengah, “Central Java Investment Platform.” [Online]. Available: https://cjip.jatengprov.go.id/
[14] Pemerintah Indonesia, “Online Single Submission (OSS-RBA).” [Online]. Available: https://oss.go.id/
[15] D. J. Tengah, “Dinas Penanaman Modal dan Pelayanan Terpadu Satu Pintu Provinsi Jawa Tengah.” [Online]. Available: https://dpmptsp.jatengprov.go.id/
[16] N. Reimers and I. Gurevych, “Sentence-BERT: Sentence embeddings using siamese BERT-networks,” EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference, pp. 3982–3992, 2019, doi: 10.18653/v1/D19-1410.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Erlanda Prasetio, L. Budi Handoko Handoko, Khafiiz Hastuti

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








