Fraud Detection Analysis in Supplementary Health Insurance Using the LSTM Model
Keywords:
Fraud detection, supplementary health insurance, Long Short-Term Memory (LSTM), deep learning, sequential modeling, feature engineeringAbstract
This study aimed to design and evaluate an advanced Long Short-Term Memory (LSTM) deep learning model to accurately detect fraudulent claims in supplementary health insurance by leveraging sequential data patterns and domain-specific engineered features. An applied research design was used to build a robust fraud detection framework. A dataset of 20,000 health insurance claims was obtained from a supplementary insurance provider, containing both legitimate and fraudulent cases. Raw relational tables — including policy, insured individuals, claims, disease registry, and branch information — were merged into a single structured dataset using SQL. Rigorous data preprocessing was performed: irrelevant variables were removed, highly correlated features were eliminated through correlation analysis, dimensionality reduction was applied via Principal Component Analysis (PCA), and extreme outliers were excluded using the interquartile range (IQR) method. All numerical features were standardized, and class imbalance was addressed by weighting fraudulent cases during training. The processed data were reshaped into sequences suitable for LSTM input and divided into training and testing sets. An LSTM model with 32 hidden units and a Sigmoid output layer was trained using the Adam optimizer and binary cross-entropy loss, with performance validated through k-fold cross-validation. The LSTM model achieved outstanding predictive performance, with an overall accuracy of 100% on the test dataset. Both fraudulent and non-fraudulent claims reached perfect precision, recall, and F1-scores. Macro and weighted averages also recorded 1.00 across all metrics, indicating the model’s ability to detect rare fraudulent events without sacrificing specificity. By combining advanced deep learning with systematic data preparation and domain-informed feature engineering, the proposed LSTM framework effectively identified complex fraud patterns in supplementary health insurance. This approach offers a scalable and reliable solution to strengthen fraud risk management and reduce financial losses in the insurance sector.
Downloads
References
Alarfaj, F. K., Malik, I., Khan, H. U., Almusallam, N., Ramzan, M., & Ahmed, M. (2022). Credit card fraud detection using state-of-the-art machine learning and deep learning algorithms. IEEE Access, 10, 39700-39715. https://doi.org/10.1109/ACCESS.2022.3166891
Ali, A., Abd Razak, S., Othman, S. H., Eisa, T. A. E., Al-Dhaqm, A., Nasser, M., Elhassan, T., Elshafie, H., & Saif, A. (2022). Financial fraud detection based on machine learning: a systematic literature review. Applied Sciences, 12(19), 9637. https://doi.org/10.3390/app12199637
Chang, V., Doan, L. M. T., Di Stefano, A., Sun, Z., & Fortino, G. (2022). Digital payment fraud detection methods in digital ages and Industry 4.0. Computers and Electrical Engineering, 100, 107734. https://doi.org/10.1016/j.compeleceng.2022.107734
Chen, Y., & Wu, Z. (2022). Financial fraud detection of listed companies in China: A machine learning approach. Sustainability, 15(1), 105. https://doi.org/10.3390/su15010105
Hashemi, S. K., Mirtaheri, S. L., & Greco, S. (2023). Fraud Detection in Banking Data by Machine Learning Techniques. IEEE Access, 11, 3034-3043.
Isangediok, M., & Gajamannage, K. (2022). Fraud Detection Using Optimized Machine Learning Tools Under Imbalance Classes. https://doi.org/10.1109/bigdata55660.2022.10020723
Jabbar, M. s. A., & Suharjito, S. (2020). Fraud Detection Call Detail Record Using Machine Learning in Telecommunications Company. Advances in Science Technology and Engineering Systems Journal, 5(4), 63-69. https://doi.org/10.25046/aj050409
Le Khac, N. A., & Kechadi, T. (2020). Application of big data and machine learning in financial forensics and fraud detection. Forensic Science International: Reports, 2, 100088. https://doi.org/10.1016/j.fsir.2020.100088
Maheshwari, M., & Begde, P. (2025). The role of artificial intelligence and machine learning in enhancing tax compliance and fraud detection in India. Journal of Informatics Education and Research, 5(2).
Mahmoudi, M., & Shahrokh, M. (2024). Machine Learning in Fraud Detection.
Minastireanu, E. A., & Mesnita, G. (2019). An Analysis of the Most Used Machine Learning Algorithms for Online Fraud Detection. Informatica Economica, 23(1). https://doi.org/10.12948/issn14531305/23.1.2019.01
Mohamed, A. H., Abourezka, M. A., & Maghraby, F. A. (2021). A Comparative Analysis of Credit Card Fraud Detection Using Machine Learning and Deep Learning Techniques. 267-282. https://doi.org/10.1007/978-981-16-2275-5_16
Mukherjee, U., Thakkar, V., Dutta, S., Mukherjee, U., & Bandyopadhyay, S. K. (2021). Emerging Approach for Detection of Financial Frauds Using Machine Learning. Asian Journal of Research in Computer Science, 9-22. https://doi.org/10.9734/ajrcos/2021/v11i330263
Psychoula, I., Gutmann, A., Mainali, P., Lee, S. H., Dunphy, P., & Petitcolas, F. (2021). Explainable machine learning for fraud detection. Computer, 54(10), 49-59. https://doi.org/10.1109/MC.2021.3081249
Pumsirirat, A., & Liu, Y. (2018). Credit Card Fraud Detection Using Deep Learning Based on Auto-Encoder and Restricted Boltzmann Machine. International Journal of Advanced Computer Science and Applications, 9(1). https://doi.org/10.14569/ijacsa.2018.090103
Sadeghi, T., & Nodehi, A. (2023). Fraud Detection in Bank Cards Based on Image Processing Using Machine Learning Algorithms.
Zhao, Z., & Bai, T. (2022). Financial fraud detection and prediction in listed companies using SMOTE and machine learning algorithms. Entropy, 24(8), 1157. https://doi.org/10.3390/e24081157
Downloads
Published
Submitted
Revised
Accepted
Issue
Section
License
Copyright (c) 2025 Masoumeh Esmaeili (Author); Mohammad Malekinia (Corresponding author); Alireza Pourebrahimi (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.