Anti-Data Leakage Pipeline for Differentiated Thyroid Cancer Recurrence Prediction: Integrating SMOTE, Optuna-based Optimization, and Bootstrap BCa Validation
DOI:
https://doi.org/10.30871/jaic.v10i3.12922Keywords:
Anti-data leakage, Bootstrap BCa, Thyroid cancer recurrence, Isotonic regression, Machine learning, Calibrated pipeline, SMOTE, OPTUNAAbstract
Thyroid cancer recurrence prediction remains a critical clinical challenge, as early identification of high-risk patients enables targeted monitoring and intervention. This study presents a comparative evaluation of six machine learning classifiers (XGBoost, LightGBM, CatBoost, Logistic Regression, Random Forest, and Decision Tree) using the UCI Differentiated Thyroid Cancer Recurrence dataset which consists of 383 patient records and 16 clinical features. To prevent performance overestimation, a rigorous anti-data leakage pipeline was implemented, encapsulating SMOTE, Optuna-based hyperparameter optimization, and Isotonic Calibration within the cross-validation process. Furthermore, model stability was assessed using Bias-Corrected and accelerated (BCa) Bootstrap validation with 2,000 iterations. Experimental results demonstrate that XGBoost achieved the best overall performance with an F1-score of 0.9545, an AUC-ROC of 0.9967, and the lowest Brier Score of 0.0183. Bootstrap BCa analysis confirmed XGBoost as the most stable model, with a 95% CI F1-score width of 0.1429 and unbiased estimation. These findings suggest that XGBoost, integrated within a zero-leakage pipeline and validated through Bootstrap BCa, is a promising candidate for post-treatment clinical decision support in differentiated thyroid cancer management.
Downloads
References
[1] J. Ferlay et al., “Cancer site ranking,” Lyon, France, 2024. Accessed: Apr. 25, 2026. [Online]. Available: https://gco.iarc.who.int/today
[2] L. Davies and H. G. Welch, “Current thyroid cancer trends in the United States,” JAMA Otolaryngol. Head Neck Surg., vol. 140, no. 4, pp. 317–322, 2014, doi: 10.1001/jamaoto.2014.1.
[3] R. M. Tuttle et al., “Estimating risk of recurrence in differentiated thyroid cancer after total thyroidectomy and radioactive iodine remnant ablation: Using response to therapy variables to modify the initial risk estimates predicted by the new American thyroid association staging system,” Thyroid, vol. 20, no. 12, pp. 1341–1349, Dec. 2010, doi: 10.1089/thy.2010.0178.
[4] Y. Li et al., “A machine learning-based model for predicting recurrence in intermediate- and high-risk differentiated thyroid cancer: insights from a retrospective single-center study of 2388 patients,” Front. Endocrinol. (Lausanne)., vol. 16, 2025, doi: 10.3389/fendo.2025.1552479.
[5] E. Clark, S. Price, T. Lucena, B. Haberlein, A. Wahbeh, and R. Seetan, “Predictive Analytics for Thyroid Cancer Recurrence: A Machine Learning Approach,” Knowledge, vol. 4, no. 4, pp. 557–570, Nov. 2024, doi: 10.3390/knowledge4040029.
[6] D. Thakur, T. Gera, V. Bhardwaj, R. Mazen, A. Lasisi, and T. Engida, “A comparative study on advanced predictive modeling of thyroid cancer recurrence using multi algorithmic machine learning frameworks,” Sci. Rep., Dec. 2025, doi: 10.1038/s41598-025-33396-7.
[7] G. Hu et al., “Machine learning prediction of thyroid cancer recurrence for early screening and clinical decision pathways: a retrospective cohort study,” Discover Oncology, Jan. 2026, doi: 10.1007/s12672-025-04293-2.
[8] A. A. Hanani, T. B. Donmez, M. Kutlu, and M. Mansour, “Predicting thyroid cancer recurrence using supervised CatBoost A SHAP-based explainable AI approach,” Medicine (United States), vol. 104, no. 22, May 2025, doi: 10.1097/MD.0000000000042667.
[9] H. Wen, X. Li, and X. Zhao, “TC check: a web app for thyroid cancer recurrence prediction using explainable machine learning,” J. Cancer Res. Clin. Oncol., vol. 152, no. 1, Jan. 2026, doi: 10.1007/s00432-025-06377-6.
[10] A. Woźniacki, W. Książek, and P. Mrowczyk, “A Novel Approach for Predicting the Survival of Colorectal Cancer Patients Using Machine Learning Techniques and Advanced Parameter Optimization Methods,” Cancers (Basel)., vol. 16, no. 18, Sep. 2024, doi: 10.3390/cancers16183205.
[11] D. A. Murtiningsih, W. Sari, and I. N. Fajri, “Comparison of Light Gradient Boosting Machine, eXtreme Gradient Boosting, and CatBoost with Balancing and Hyperparameter Tuning for Hypertension Risk Prediction on Clinical Dataset,” 2025. [Online]. Available: http://jurnal.polibatam.ac.id/index.php/JAIC
[12] A. Ichwani, R. Indra Kesuma, A. Setiawan, E. Wicaksono, and R. Hanifah, “Preventing Data Leakage in Classification via Integrated Machine Learning Pipelines: Preprocessing, Feature Transformation, and Hyperparameter Tuning,” Jurnal Teknik Informatika (JUTIF), vol. 7, no. 1, 2026, doi: 10.52436/1.jutif.2026.7.2.5490.
[13] P. Netayawijit, W. Chansanam, and K. Sorn-In, “Interpretable Machine Learning Framework for Diabetes Prediction: Integrating SMOTE Balancing with SHAP Explainability for Clinical Decision Support,” Healthcare (Switzerland), vol. 13, no. 20, Oct. 2025, doi: 10.3390/healthcare13202588.
[14] G. Varoquaux and O. Colliot, “Evaluating Machine Learning Models and Their Diagnostic Value,” in Neuromethods, vol. 197, Humana Press Inc., 2023, pp. 601–630. doi: 10.1007/978-1-0716-3195-9_20.
[15] S. Borzooei and A. Tarokhian, “Differentiated Thyroid Cancer Recurrence,” UCI Machine Learning Repository. Accessed: May 02, 2026. [Online]. Available: https://doi.org/10.24432/C5632J
[16] T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama, “Optuna: A Next-generation Hyperparameter Optimization Framework,” in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, Jul. 2019, pp. 2623–2631. doi: 10.1145/3292500.3330701.
[17] A. Niculescu-Mizil and R. Caruana, “Predicting good probabilities with supervised learning,” in Proceedings of the 22nd International Conference on Machine Learning, in ICML ’05. New York, NY, USA: Association for Computing Machinery, 2005, pp. 625–632. doi: 10.1145/1102351.1102430.
[18] Bradley. Efron and Robert. Tibshirani, An introduction to the bootstrap. Chapman & Hall/CRC, 1998.
[19] B. Efron and B. Narasimhan, “The Automatic Construction of Bootstrap Confidence Intervals,” Journal of Computational and Graphical Statistics, vol. 29, no. 3, pp. 608–619, Jul. 2020, doi: 10.1080/10618600.2020.1714633.
[20] S. M. Lundberg and S.-I. Lee, “A Unified Approach to Interpreting Model Predictions,” in Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., Curran Associates, Inc., 2017. [Online]. Available: https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf
[21] S. M. Lundberg et al., “From local explanations to global understanding with explainable AI for trees,” Nat. Mach. Intell., vol. 2, no. 1, pp. 56–67, 2020, doi: 10.1038/s42256-019-0138-9.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Deri Rosadi, Sindhu Rakasiwi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








