Fraud Detection Analysis in Supplementary Health Insurance Using the LSTM Model

Authors

    Masoumeh Esmaeili Department of Management, Ki.c.,, Islamic Azad University, Kish, Iran
    Mohammad Malekinia * Department of Management, ST.C., Islamic Azad University, Tehran, Iran M_malekinia@azad.ac.ir
    Alireza Pourebrahimi Department of Industrial Management, Ka.C., Islamic Azad University, Karaj, Iran

Keywords:

Fraud detection, supplementary health insurance, Long Short-Term Memory (LSTM), deep learning, sequential modeling, feature engineering

Abstract

This study aimed to design and evaluate an advanced Long Short-Term Memory (LSTM) deep learning model to accurately detect fraudulent claims in supplementary health insurance by leveraging sequential data patterns and domain-specific engineered features. An applied research design was used to build a robust fraud detection framework. A dataset of 20,000 health insurance claims was obtained from a supplementary insurance provider, containing both legitimate and fraudulent cases. Raw relational tables — including policy, insured individuals, claims, disease registry, and branch information — were merged into a single structured dataset using SQL. Rigorous data preprocessing was performed: irrelevant variables were removed, highly correlated features were eliminated through correlation analysis, dimensionality reduction was applied via Principal Component Analysis (PCA), and extreme outliers were excluded using the interquartile range (IQR) method. All numerical features were standardized, and class imbalance was addressed by weighting fraudulent cases during training. The processed data were reshaped into sequences suitable for LSTM input and divided into training and testing sets. An LSTM model with 32 hidden units and a Sigmoid output layer was trained using the Adam optimizer and binary cross-entropy loss, with performance validated through k-fold cross-validation. The LSTM model achieved outstanding predictive performance, with an overall accuracy of 100% on the test dataset. Both fraudulent and non-fraudulent claims reached perfect precision, recall, and F1-scores. Macro and weighted averages also recorded 1.00 across all metrics, indicating the model’s ability to detect rare fraudulent events without sacrificing specificity. By combining advanced deep learning with systematic data preparation and domain-informed feature engineering, the proposed LSTM framework effectively identified complex fraud patterns in supplementary health insurance. This approach offers a scalable and reliable solution to strengthen fraud risk management and reduce financial losses in the insurance sector.

Downloads

Download data is not yet available.

References

Alarfaj, F. K., Malik, I., Khan, H. U., Almusallam, N., Ramzan, M., & Ahmed, M. (2022). Credit card fraud detection using state-of-the-art machine learning and deep learning algorithms. IEEE Access, 10, 39700-39715. https://doi.org/10.1109/ACCESS.2022.3166891

Ali, A., Abd Razak, S., Othman, S. H., Eisa, T. A. E., Al-Dhaqm, A., Nasser, M., Elhassan, T., Elshafie, H., & Saif, A. (2022). Financial fraud detection based on machine learning: a systematic literature review. Applied Sciences, 12(19), 9637. https://doi.org/10.3390/app12199637

Chang, V., Doan, L. M. T., Di Stefano, A., Sun, Z., & Fortino, G. (2022). Digital payment fraud detection methods in digital ages and Industry 4.0. Computers and Electrical Engineering, 100, 107734. https://doi.org/10.1016/j.compeleceng.2022.107734

Chen, Y., & Wu, Z. (2022). Financial fraud detection of listed companies in China: A machine learning approach. Sustainability, 15(1), 105. https://doi.org/10.3390/su15010105

Hashemi, S. K., Mirtaheri, S. L., & Greco, S. (2023). Fraud Detection in Banking Data by Machine Learning Techniques. IEEE Access, 11, 3034-3043.

Isangediok, M., & Gajamannage, K. (2022). Fraud Detection Using Optimized Machine Learning Tools Under Imbalance Classes. https://doi.org/10.1109/bigdata55660.2022.10020723

Jabbar, M. s. A., & Suharjito, S. (2020). Fraud Detection Call Detail Record Using Machine Learning in Telecommunications Company. Advances in Science Technology and Engineering Systems Journal, 5(4), 63-69. https://doi.org/10.25046/aj050409

Le Khac, N. A., & Kechadi, T. (2020). Application of big data and machine learning in financial forensics and fraud detection. Forensic Science International: Reports, 2, 100088. https://doi.org/10.1016/j.fsir.2020.100088

Maheshwari, M., & Begde, P. (2025). The role of artificial intelligence and machine learning in enhancing tax compliance and fraud detection in India. Journal of Informatics Education and Research, 5(2).

Mahmoudi, M., & Shahrokh, M. (2024). Machine Learning in Fraud Detection.

Minastireanu, E. A., & Mesnita, G. (2019). An Analysis of the Most Used Machine Learning Algorithms for Online Fraud Detection. Informatica Economica, 23(1). https://doi.org/10.12948/issn14531305/23.1.2019.01

Mohamed, A. H., Abourezka, M. A., & Maghraby, F. A. (2021). A Comparative Analysis of Credit Card Fraud Detection Using Machine Learning and Deep Learning Techniques. 267-282. https://doi.org/10.1007/978-981-16-2275-5_16

Mukherjee, U., Thakkar, V., Dutta, S., Mukherjee, U., & Bandyopadhyay, S. K. (2021). Emerging Approach for Detection of Financial Frauds Using Machine Learning. Asian Journal of Research in Computer Science, 9-22. https://doi.org/10.9734/ajrcos/2021/v11i330263

Psychoula, I., Gutmann, A., Mainali, P., Lee, S. H., Dunphy, P., & Petitcolas, F. (2021). Explainable machine learning for fraud detection. Computer, 54(10), 49-59. https://doi.org/10.1109/MC.2021.3081249

Pumsirirat, A., & Liu, Y. (2018). Credit Card Fraud Detection Using Deep Learning Based on Auto-Encoder and Restricted Boltzmann Machine. International Journal of Advanced Computer Science and Applications, 9(1). https://doi.org/10.14569/ijacsa.2018.090103

Sadeghi, T., & Nodehi, A. (2023). Fraud Detection in Bank Cards Based on Image Processing Using Machine Learning Algorithms.

Zhao, Z., & Bai, T. (2022). Financial fraud detection and prediction in listed companies using SMOTE and machine learning algorithms. Entropy, 24(8), 1157. https://doi.org/10.3390/e24081157

Downloads

Published

2026-01-01

Submitted

2025-06-11

Revised

2025-09-14

Accepted

2025-09-23

Issue

Section

Articles

How to Cite

Esmaeili , M. ., Malekinia, M., & Pourebrahimi , A. (2026). Fraud Detection Analysis in Supplementary Health Insurance Using the LSTM Model. Digital Transformation and Administration Innovation, 1-9. https://www.journaldtai.com/index.php/jdtai/article/view/209

Similar Articles

21-30 of 76

You may also start an advanced similarity search for this article.