An Explainable Deep Learning Framework for Robust Deepfake Detection in Video Streams

Amitoj Kaur; Shivangi Sharma

doi:10.2991/978-94-6239-697-5_11

<Previous Article In Volume

Next Article In Volume>

An Explainable Deep Learning Framework for Robust Deepfake Detection in Video Streams

Authors

Amitoj Kaur¹^{, *}, Shivangi Sharma²

¹Gulzar Group of Institutes, Khanna, 141401, Punjab, India

²Gulzar Group of Institutes, Khanna, 141401, India

^*Corresponding author. Email: amitojbdwn@gmail.com

Corresponding Author

Amitoj Kaur

Available Online 4 June 2026.

DOI: 10.2991/978-94-6239-697-5_11 How to use a DOI?
Keywords: Deepfake detection; explainable artificial Intelligence; CNN-LSTM; Grad-CAM; video forensics
Abstract: The creation of deepfakes with the help of state-of-the-art deep learning methods has become a significant challenge to the authenticity of digital media, cybersecurity and trust in the society. As the Generative Adversarial Networks (GANs) and diffusion models of synthesis continue to progress quickly, fake videos are becoming more and more difficult to be detected as fake. Even though detection methods based on deep learning have shown high accuracy, their non-transparency prevents their application in the real world in forensics and legal fields. The proposed paper offers an Explainable Deep Learning Framework of Robust Deepfake Detection in Video Streams that is based on integrating spatial-temporal learning with post-hoc explainability. The suggested model combines a Convolutional Neural Network (CNN) based on spatial artifact extraction and a Long Short-Term Memory (LSTM) network based on time inconsistency modelling on video frames. Gradient-weighted Class Activation Mapping (Grad-CAM) is also used to provide model decisions explanations visually to enhance interpretability. Four benchmark datasets, including FaceForensics++, Celeb-DF, DFDC and DeeperForensics-1.0, are experimented on with a variety of measures. The proposed solution is more accurate and robust and has a better generalization performance as compared to the baseline CNN and CNN-RNN. Moreover, the explainability analysis shows that the model always concentrates on manipulated regions of the face including eye boundaries, mouth contours, and facial boundaries. The findings support the fact that explainable AI should be incorporated into deepfake detection because it increases the level of trust and forensic utility.
Copyright: © 2026 The Author(s)
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Download article (PDF)

<Previous Article In Volume

Next Article In Volume>

Volume Title: Proceedings of the Conference on Bridging Engineering Disciplines with AI and Machine Learning (BEDAIML 2026)
Series: Advances in Intelligent Systems Research
Publication Date: 4 June 2026
ISBN: 978-94-6239-697-5
ISSN: 1951-6851
DOI: 10.2991/978-94-6239-697-5_11 How to use a DOI?
Open Access: Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

Cite this article

ris enw bib

TY  - CONF
AU  - Amitoj Kaur
AU  - Shivangi Sharma
PY  - 2026
DA  - 2026/06/04
TI  - An Explainable Deep Learning Framework for Robust Deepfake Detection in Video Streams
BT  - Proceedings of the Conference on Bridging Engineering Disciplines with AI and Machine Learning (BEDAIML 2026)
PB  - Atlantis Press
SP  - 117
EP  - 124
SN  - 1951-6851
UR  - https://doi.org/10.2991/978-94-6239-697-5_11
DO  - 10.2991/978-94-6239-697-5_11
ID  - Kaur2026
ER  -

download .riscopy to clipboard