Proceedings of the 2025 International Conference on Hybrid Commerce, Human Capital, and Economic Dynamics (ICHCH 2025)

Evaluating the Impact of Feature Engineering on Auto Insurance Claim Prediction Models

Authors
Wenyi Fang1, *
1Department of Mathematics and Statistics, McMaster University, Hamilton, ON, L8S 4K1, Canada
*Corresponding author. Email: fangw10@mcmaster.ca
Corresponding Author
Wenyi Fang
Available Online 18 June 2026.
DOI
10.2991/978-2-38476-585-0_17How to use a DOI?
Keywords
Insurance Claim Prediction; Feature Engineering; Machine Learning Models
Abstract

Predicting automobile insurance claim is essential for risk assessment, premium calculating, and fraud detection. All of which help ensure the profitability and stability of insurance companies. This study introduced a novel engineered feature, the Collision Index, designed to quantify localized automobile collision risk, which is aggregated into an insurance claim dataset to evaluate its impact on model performance. Four machine learning models, Random Forest, Extreme Gradient Boosting (XGBoost), Support Vector Machine and Neural Network, were trained using the dataset with and without the engineered feature. Their F1 scores are compared using the paired Student’s t-test. Results indicate that only the Random Forest model witnessed a statistically significant improvement with the inclusion of the collision index at 0.05 significance level. The other models fail to observe significant performance gain, or imply a decisive conclusion due to computational constraints. Limitations of this paper include the imbalanced dataset, the estimated nature of collision index and possible incorrect assumption made during paired t-test.

Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the 2025 International Conference on Hybrid Commerce, Human Capital, and Economic Dynamics (ICHCH 2025)
Series
Advances in Economics, Business and Management Research
Publication Date
18 June 2026
ISBN
978-2-38476-585-0
ISSN
2352-5428
DOI
10.2991/978-2-38476-585-0_17How to use a DOI?
Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Wenyi Fang
PY  - 2026
DA  - 2026/06/18
TI  - Evaluating the Impact of Feature Engineering on Auto Insurance Claim Prediction Models
BT  - Proceedings of the 2025 International Conference on Hybrid Commerce, Human Capital, and Economic Dynamics (ICHCH 2025)
PB  - Atlantis Press
SP  - 142
EP  - 151
SN  - 2352-5428
UR  - https://doi.org/10.2991/978-2-38476-585-0_17
DO  - 10.2991/978-2-38476-585-0_17
ID  - Fang2026
ER  -