Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025)

BdSL-Net: A Hybrid CNN-LSTM-Attention Framework for Real Time Bangla Sign Language Recognition

Authors
Md Faisal Hasan1, Md. Nazmus Sakib Sheam1, Uzzwal Kumar Biswas1, Jarin Tasnim Tonvi1, *, Syed Ahsanul Kabir1
1Department of Computer Science and Engineering, Green University of Bangladesh, Narayanganj, Dhaka, 1461, Bangladesh
*Corresponding author. Email: jarin@cse.green.edu.bd
Corresponding Author
Jarin Tasnim Tonvi
Available Online 8 June 2026.
DOI
10.2991/978-94-6239-664-7_90How to use a DOI?
Keywords
Bengali Sign Language (BdSL); Real-time Gesture Recognition; Hybrid Deep Learning Model; CNN-LSTM-Attention Network; Explainable Artificial Intelligence (XAI)
Abstract

Communication barriers between the Deaf and hard-of-hearing (DHH) community and the hearing population in Bangladesh persist due to the lack of automated Bengali Sign Language (BdSL) translation tools. This study proposes BdSL-Net, a real-time BdSL recognition framework based on computer vision and deep learning. A custom dataset of video 2,000 samples covering 40 BdSL signs was developed, from which 1,662 skeletal keypoints were extracted per frame using MediaPipe Holistic. The proposed hybrid neural architecture integrates a 1D Convolutional Neural Network (CNN) for spatial feature extraction, a Long Short-Term Memory (LSTM) for temporal sequence modeling, and an Attention mechanism to highlight the most discriminative motion segments. BdSL-Net achieved 96.08% accuracy and was implemented as a realtime prototype. Explainable AI (XAI) analysis further validated that the model effectively attends to crucial temporal features within gesture sequences. The results demonstrate BdSL-Net’s potential as a visionbased assistive technology for bridging communication gaps and enabling future continuous BdSL translation. The findings confirm that the CNN–LSTM–Attention hybrid model offers high-accuracy recognition of BdSL gestures and provides a viable proof of concept for vision-based assistive communication technologies in low-resource linguistic contexts.

Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

Volume Title
Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025)
Series
Advances in Intelligent Systems Research
Publication Date
8 June 2026
ISBN
978-94-6239-664-7
ISSN
1951-6851
DOI
10.2991/978-94-6239-664-7_90How to use a DOI?
Copyright
© 2026 The Author(s)
Open Access
Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

TY  - CONF
AU  - Md Faisal Hasan
AU  - Md. Nazmus Sakib Sheam
AU  - Uzzwal Kumar Biswas
AU  - Jarin Tasnim Tonvi
AU  - Syed Ahsanul Kabir
PY  - 2026
DA  - 2026/06/08
TI  - BdSL-Net: A Hybrid CNN-LSTM-Attention Framework for Real Time Bangla Sign Language Recognition
BT  - Proceedings of the International Conference on Intelligent Data Analysis and Applications (IDAA 2025)
PB  - Atlantis Press
SP  - 1329
EP  - 1344
SN  - 1951-6851
UR  - https://doi.org/10.2991/978-94-6239-664-7_90
DO  - 10.2991/978-94-6239-664-7_90
ID  - Hasan2026
ER  -