Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025)

Enhancing BdSL Recognition: Comparative Evaluation of CNN, VGG16, ResNet50, and MobileNet Architectures

Authors
Jannatul Ferdoss Faria1, Sadia Akter1, Fatema Tuj Tarannom Esty1, *
1University of Information Technology and Sciences, Holding 190, Road 5, Block J, BaridharaMaddhaNaya Nagar Rd, Dhaka, 1212, Bangladesh
*Corresponding author. Email: fatema.tarannom@uits.edu.bd
Corresponding Author
Fatema Tuj Tarannom Esty
Available Online 8 June 2026.
DOI
10.2991/978-94-6239-664-7_89How to use a DOI?
Keywords
Bangla Sign Language (BdSL); Deep Learning; CNN; VGG16; ResNet50; MobileNet; Transfer Learning; Gesture Recognition
Abstract

Sign language is an important form of communication for the hearing-impaired. Unfortunately, Bangla Sign Language (BdSL) has been largely neglected in the field of technology studies and applications. We presented a framework to incorporate deep learning into BdSL recognition to create an inclusive means of communication in Bangladesh by translating sign gestures into text in real time. A dataset of more than 3,600 samples was prepared from 36 BdSL gestures in various lighting and hand-shape conditions with the help of 10 volunteers. The data was pre-processed with the following steps: frame extraction, resizing to 224 × 224 pixels, normalization to grayscale, data augmentation, and normalization of landmarks using MediaPipe. Four models were utilized in the study: baseline CNN, VGG16, ResNet50, and MobileNet. The baseline CNN revealed moderate performance, whereas VGG16 with transfer learning bagged higher scores for recognition accuracy. The fine-tuned and augmented MobileNet model outperformed all other techniques at 92% accuracy, which shows its ability to reliably reduce the gesture recognition task. The findings of this study demonstrate the potential for the system to be deployed on mobile devices and embedded systems to create an efficient and effective approach for real-time BdSL-to-text translation for communication with hearing-impaired people in Bangladesh, ultimately supporting their social inclusion.

Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025)
Series
Advances in Intelligent Systems Research
Publication Date
8 June 2026
ISBN
978-94-6239-664-7
ISSN
1951-6851
DOI
10.2991/978-94-6239-664-7_89How to use a DOI?
Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Jannatul Ferdoss Faria
AU  - Sadia Akter
AU  - Fatema Tuj Tarannom Esty
PY  - 2026
DA  - 2026/06/08
TI  - Enhancing BdSL Recognition: Comparative Evaluation of CNN, VGG16, ResNet50, and MobileNet Architectures
BT  - Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025)
PB  - Atlantis Press
SP  - 1317
EP  - 1328
SN  - 1951-6851
UR  - https://doi.org/10.2991/978-94-6239-664-7_89
DO  - 10.2991/978-94-6239-664-7_89
ID  - Faria2026
ER  -