Binary Classification for Predicting the Investment Trends of The Younger Generation Based on Machine Learning
DOI:
https://doi.org/10.30871/jaic.v9i6.11549Keywords:
Machine Learning, Feature Engineering, Binary Classification, Multi-class Classification, Hybrid Modeling, Data PreprocessingAbstract
This computational study examines investment behavior patterns among a specialized cohort of 115 final year and thesis writing university students, implementing sophisticated feature engineering to transform categorical survey responses into quantifiable financial metrics. The research methodology leverages this unique dataset where respondents' advanced academic standing provides particularly relevant insights into near-term investment decisions. Experimental outcomes reveal distinct algorithmic performance patterns: Random Forest achieved 69.6% accuracy in multi-class classification with weighted averages of 0.662 precision, 0.696 recall, and 0.678 F1-score, while Logistic Regression demonstrated superior binary classification capability with 82.6% accuracy, supported by 0.818 precision, 0.826 recall, and 0.814 F1-score (weighted averages). The hybrid architecture integrating machine learning with business rules achieved peak performance of 85.2% accuracy, successfully balancing predictive power with operational interpretability. These findings underscore how strategically engineered features combined with a carefully selected respondent pool can effectively decode complex financial behaviors, providing financial institutions with actionable frameworks for developing targeted investment solutions for the graduate student demographic while advancing methodological approaches for specialized survey data in fintech applications.
Downloads
References
[1] M. A. Majid, P. D. Saputri, dan S. Soehardjoepri, “Stock Market Index Prediction using Bi-directional Long Short-Term Memory,” J. Appl. Informatics Comput., vol. 8, no. 1, hal. 55–61, 2024, doi: 10.30871/jaic.v8i1.7195.
[2] V. Chang, Q. A. Xu, A. Chidozie, dan H. Wang, “Predicting Economic Trends and Stock Market Prices with Deep Learning and Advanced Machine Learning Techniques,” Electron., vol. 13, no. 17, 2024, doi: 10.3390/electronics13173396.
[3] Nigar Sultana et al., “Machine Learning Solutions for Predicting Stock Trends in BRICS amid Global Economic Shifts and Decoding Market Dynamics,” J. Econ. Financ. Account. Stud., vol. 6, no. 6, hal. 84–101, 2024, doi: 10.32996/jefas.2024.6.6.7.
[4] M. Miranda dan S. Sriani, “Implementation of K-Means Clustering in Grouping Sales Data at Zura Mart,” J. Appl. Informatics Comput., vol. 9, no. 2, hal. 547–555, 2025, doi: 10.30871/jaic.v9i2.9160.
[5] A. Agung, K. Agung, C. Wiranatha, dan A. Info, “Stock Sentiment Prediction of LQ-45 Based on News Articles Using,” J. Appl. Informatics Comput., vol. 9, no. 4, hal. 1154–1162, 2025.
[6] M. Baratchi et al., Automated machine learning: past, present and future, vol. 57, no. 5. Springer Netherlands, 2024. doi: 10.1007/s10462-024-10726-1.
[7] I. Price et al., “Probabilistic weather forecasting with machine learning,” Nature, vol. 637, no. 8044, hal. 84–90, 2025, doi: 10.1038/s41586-024-08252-9.
[8] G. Long et al., “Machine learning on national shopping data reliably estimates childhood obesity prevalence and socio-economic deprivation,” Food Policy, vol. 131, no. February, hal. 102826, 2025, doi: 10.1016/j.foodpol.2025.102826.
[9] D. Hidayatullah dan I. Jatnika, “Development of a Classification Model for Underpriced Issuers Using Machine Learning Algorithms,” J. Appl. Informatics Comput., vol. 9, no. 4, hal. 1648–1654, 2025.
[10] P. Giudici, “Safe machine learning,” Statistics (Ber)., vol. 58, no. 3, hal. 473–477, 2024, doi: 10.1080/02331888.2024.2361481.
[11] G. Lăzăroiu, T. Gedeon, E. Rogalska, M. Andronie, K. F. Michalikova, dan Z. Musova, The economics of deep and machine learning-based algorithms for COVID-19 prediction, detection, and diagnosis shaping the organizational management of hospitals, vol. 15, no. 1. 2024. doi: 10.24136/oc.2984.
[12] R. Alsabt, W. Alkhaldi, Y. A. Adenle, dan H. M. Alshuwaikhat, “Optimizing waste management strategies through artificial intelligence and machine learning - An economic and environmental impact study,” Clean. Waste Syst., vol. 8, no. April, hal. 100158, 2024, doi: 10.1016/j.clwas.2024.100158.
[13] A. A. Ahmed, S. Sayed, A. Abdoulhalik, S. Moutari, dan L. Oyedele, “Applications of machine learning to water resources management: A review of present status and future opportunities,” J. Clean. Prod., vol. 441, no. August 2023, hal. 140715, 2024, doi: 10.1016/j.jclepro.2024.140715.
[14] J. Tian, C. Yin, M. Wang, H. Li, dan H. Xu, Predicting Property Tax Classifications: An Empirical Study Using Multiple Machine Learning Algorithms on U.S. State-Level Data, no. Icemed 2025. Atlantis Press International BV, 2025. doi: 10.2991/978-94-6463-811-0_36.
[15] Md Fakhrul Islam Sumon et al., “Environmental and Socio-Economic Impact Assessment of Renewable Energy Using Machine Learning Models,” J. Econ. Financ. Account. Stud., vol. 6, no. 5, hal. 112–122, 2024, doi: 10.32996/jefas.2024.6.5.13.
[16] M. Izza dan M. Lutfi, “Detection of Sugarcane Leaf Disease Using Pre-Trained Feature Extraction and SVM Method,” vol. 9, no. 5, hal. 2296–2302, 2025.
[17] S. Mujiyono, U. P. Sanjaya, I. S. Wibisono, dan H. Setyowati, “Prediksi Fluktuasi Berat Badan Berdasarkan Pola Hidup Menggunakan Model XGBoost dan Deep Learning,” J. Algoritm., vol. 22, no. 1, hal. 221–233, 2025, doi: 10.33364/algoritma/v.22-1.2253.
[18] R. M. A. A. Bhirawa, U. P. Sanjaya, I. Engineering, S. Programme, N. Waluyo, dan C. Java, “From Data Imbalance To Precision : Smote-Driven Machine Learning For Early Detection Of Kidney Disease Optimasi Klasifikasi Data Tidak Seimbang Pada,” J. Inovtek Polbeng, vol. 10, no. 1, hal. 514–525, 2025.
[19] U. P. Ais, Salma Rihadatul Sanjaya, “Perbandingan Algoritma Random Forest, XGBoost, dan Logistic Regression untuk Prediksi Risiko Kekambuhan Kanker Tiroid,” Edumatic J. Pendidik. Inform., vol. 9, no. 1, hal. 236–245, Apr 2025, doi: 10.29408/edumatic.v9i1.29644.
[20] R. S. Ilhamy dan U. P. Sanjaya, “Algoritma K-Nearest Neighbors ( KNN ) untuk Klasifikasi Citra Buah Pisang dengan Ekstraksi Ciri Gray Level Co-Occurrence,” J. Telemat., vol. 17, no. 2, hal. 88–93, 2022.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Weka Brilliant Jaya Oktana, Ucta Pradema Sanjaya

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) ) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).








