Anti-Data Leakage Pipeline for Differentiated Thyroid Cancer Recurrence Prediction: Integrating SMOTE, Optuna-based Optimization, and Bootstrap BCa Validation

Authors

  • Deri Rosadi Universitas Dian Nuswantoro
  • Sindhu Rakasiwi Universitas Dian Nuswantoro

DOI:

https://doi.org/10.30871/jaic.v10i3.12922

Keywords:

Anti-data leakage, Bootstrap BCa, Thyroid cancer recurrence, Isotonic regression, Machine learning, Calibrated pipeline, SMOTE, OPTUNA

Abstract

Thyroid cancer recurrence prediction remains a critical clinical challenge, as early identification of high-risk patients enables targeted monitoring and intervention. This study presents a comparative evaluation of six machine learning classifiers (XGBoost, LightGBM, CatBoost, Logistic Regression, Random Forest, and Decision Tree) using the UCI Differentiated Thyroid Cancer Recurrence dataset which consists of 383 patient records and 16 clinical features. To prevent performance overestimation, a rigorous anti-data leakage pipeline was implemented, encapsulating SMOTE, Optuna-based hyperparameter optimization, and Isotonic Calibration within the cross-validation process. Furthermore, model stability was assessed using Bias-Corrected and accelerated (BCa) Bootstrap validation with 2,000 iterations. Experimental results demonstrate that XGBoost achieved the best overall performance with an F1-score of 0.9545, an AUC-ROC of 0.9967, and the lowest Brier Score of 0.0183. Bootstrap BCa analysis confirmed XGBoost as the most stable model, with a 95% CI F1-score width of 0.1429 and unbiased estimation. These findings suggest that XGBoost, integrated within a zero-leakage pipeline and validated through Bootstrap BCa, is a promising candidate for post-treatment clinical decision support in differentiated thyroid cancer management.

Downloads

Download data is not yet available.

Author Biographies

Deri Rosadi, Universitas Dian Nuswantoro

Informatics Engineering, Faculty of Computer Science, Universitas Dian Nuswantoro, Semarang, Indonesia

Sindhu Rakasiwi, Universitas Dian Nuswantoro

Lecturer, Informatics Engineering, Faculty of Computer Science, Universitas Dian Nuswantoro, Semarang, Indonesia

References

[1] J. Ferlay et al., “Cancer site ranking,” Lyon, France, 2024. Accessed: Apr. 25, 2026. [Online]. Available: https://gco.iarc.who.int/today

[2] L. Davies and H. G. Welch, “Current thyroid cancer trends in the United States,” JAMA Otolaryngol. Head Neck Surg., vol. 140, no. 4, pp. 317–322, 2014, doi: 10.1001/jamaoto.2014.1.

[3] R. M. Tuttle et al., “Estimating risk of recurrence in differentiated thyroid cancer after total thyroidectomy and radioactive iodine remnant ablation: Using response to therapy variables to modify the initial risk estimates predicted by the new American thyroid association staging system,” Thyroid, vol. 20, no. 12, pp. 1341–1349, Dec. 2010, doi: 10.1089/thy.2010.0178.

[4] Y. Li et al., “A machine learning-based model for predicting recurrence in intermediate- and high-risk differentiated thyroid cancer: insights from a retrospective single-center study of 2388 patients,” Front. Endocrinol. (Lausanne)., vol. 16, 2025, doi: 10.3389/fendo.2025.1552479.

[5] E. Clark, S. Price, T. Lucena, B. Haberlein, A. Wahbeh, and R. Seetan, “Predictive Analytics for Thyroid Cancer Recurrence: A Machine Learning Approach,” Knowledge, vol. 4, no. 4, pp. 557–570, Nov. 2024, doi: 10.3390/knowledge4040029.

[6] D. Thakur, T. Gera, V. Bhardwaj, R. Mazen, A. Lasisi, and T. Engida, “A comparative study on advanced predictive modeling of thyroid cancer recurrence using multi algorithmic machine learning frameworks,” Sci. Rep., Dec. 2025, doi: 10.1038/s41598-025-33396-7.

[7] G. Hu et al., “Machine learning prediction of thyroid cancer recurrence for early screening and clinical decision pathways: a retrospective cohort study,” Discover Oncology, Jan. 2026, doi: 10.1007/s12672-025-04293-2.

[8] A. A. Hanani, T. B. Donmez, M. Kutlu, and M. Mansour, “Predicting thyroid cancer recurrence using supervised CatBoost A SHAP-based explainable AI approach,” Medicine (United States), vol. 104, no. 22, May 2025, doi: 10.1097/MD.0000000000042667.

[9] H. Wen, X. Li, and X. Zhao, “TC check: a web app for thyroid cancer recurrence prediction using explainable machine learning,” J. Cancer Res. Clin. Oncol., vol. 152, no. 1, Jan. 2026, doi: 10.1007/s00432-025-06377-6.

[10] A. Woźniacki, W. Książek, and P. Mrowczyk, “A Novel Approach for Predicting the Survival of Colorectal Cancer Patients Using Machine Learning Techniques and Advanced Parameter Optimization Methods,” Cancers (Basel)., vol. 16, no. 18, Sep. 2024, doi: 10.3390/cancers16183205.

[11] D. A. Murtiningsih, W. Sari, and I. N. Fajri, “Comparison of Light Gradient Boosting Machine, eXtreme Gradient Boosting, and CatBoost with Balancing and Hyperparameter Tuning for Hypertension Risk Prediction on Clinical Dataset,” 2025. [Online]. Available: http://jurnal.polibatam.ac.id/index.php/JAIC

[12] A. Ichwani, R. Indra Kesuma, A. Setiawan, E. Wicaksono, and R. Hanifah, “Preventing Data Leakage in Classification via Integrated Machine Learning Pipelines: Preprocessing, Feature Transformation, and Hyperparameter Tuning,” Jurnal Teknik Informatika (JUTIF), vol. 7, no. 1, 2026, doi: 10.52436/1.jutif.2026.7.2.5490.

[13] P. Netayawijit, W. Chansanam, and K. Sorn-In, “Interpretable Machine Learning Framework for Diabetes Prediction: Integrating SMOTE Balancing with SHAP Explainability for Clinical Decision Support,” Healthcare (Switzerland), vol. 13, no. 20, Oct. 2025, doi: 10.3390/healthcare13202588.

[14] G. Varoquaux and O. Colliot, “Evaluating Machine Learning Models and Their Diagnostic Value,” in Neuromethods, vol. 197, Humana Press Inc., 2023, pp. 601–630. doi: 10.1007/978-1-0716-3195-9_20.

[15] S. Borzooei and A. Tarokhian, “Differentiated Thyroid Cancer Recurrence,” UCI Machine Learning Repository. Accessed: May 02, 2026. [Online]. Available: https://doi.org/10.24432/C5632J

[16] T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama, “Optuna: A Next-generation Hyperparameter Optimization Framework,” in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, Jul. 2019, pp. 2623–2631. doi: 10.1145/3292500.3330701.

[17] A. Niculescu-Mizil and R. Caruana, “Predicting good probabilities with supervised learning,” in Proceedings of the 22nd International Conference on Machine Learning, in ICML ’05. New York, NY, USA: Association for Computing Machinery, 2005, pp. 625–632. doi: 10.1145/1102351.1102430.

[18] Bradley. Efron and Robert. Tibshirani, An introduction to the bootstrap. Chapman & Hall/CRC, 1998.

[19] B. Efron and B. Narasimhan, “The Automatic Construction of Bootstrap Confidence Intervals,” Journal of Computational and Graphical Statistics, vol. 29, no. 3, pp. 608–619, Jul. 2020, doi: 10.1080/10618600.2020.1714633.

[20] S. M. Lundberg and S.-I. Lee, “A Unified Approach to Interpreting Model Predictions,” in Advances in Neural Information Processing Systems, I. Guyon, U. Von Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., Curran Associates, Inc., 2017. [Online]. Available: https://proceedings.neurips.cc/paper_files/paper/2017/file/8a20a8621978632d76c43dfd28b67767-Paper.pdf

[21] S. M. Lundberg et al., “From local explanations to global understanding with explainable AI for trees,” Nat. Mach. Intell., vol. 2, no. 1, pp. 56–67, 2020, doi: 10.1038/s42256-019-0138-9.

Downloads

Published

2026-06-18

How to Cite

[1]
D. Rosadi and S. Rakasiwi, “Anti-Data Leakage Pipeline for Differentiated Thyroid Cancer Recurrence Prediction: Integrating SMOTE, Optuna-based Optimization, and Bootstrap BCa Validation”, JAIC, vol. 10, no. 3, pp. 2973–2985, Jun. 2026.

Most read articles by the same author(s)

Similar Articles

1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.