DeepFake Video

Detection System

AI Machine Learning Computer Vision Deep Learning

MODEL: ResNet50 + LSTM

TASK: Binary Classification

OUTPUT: REAL / FAKE

01

Introduction

⚠

The rapid advancement of AI has made it possible to create highly realistic synthetic videos — commonly known as deepfakes. These manipulated videos can closely mimic real human faces and voices, making it increasingly difficult to distinguish authentic from fake content.

🎯

This project focuses on building a deepfake video detection system using deep learning. The system classifies videos as REAL or FAKE by analyzing visual patterns extracted from video frames.

🔬

The approach leverages preprocessing (frame extraction + face detection) followed by feature learning using neural network models — contributing to trust and authenticity in digital content.

👤

FAKE DETECTED

Confidence

94%

Artifacts

87%

Warping

76%

03

Summary of Notable DeepFake Tools

Tool	Repository	Key Features
Faceswap	github.com/deepfakes/faceswap	Two encoder-decoder pairs; shared encoder parameters
Faceswap-GAN	github.com/shaoanlu/faceswap-GAN	Adversarial loss + perceptual loss (VGGface) on auto-encoder architecture
Few-Shot Face Translation	github.com/shaoanlu/fewshot-facetranslation-GAN	Pre-trained face recognition for latent embedding; FUNIT + SPADE semantic priors
DeepFaceLab	github.com/iperov/DeepFaceLab	Extended Faceswap with H64, H128, LIAEF128, SAE models; S3FD, MTCNN, dlib extraction

04

Literature Survey Part I

[6]

Face Warping Artifact Detection

A dedicated CNN model compares generated faces with surrounding regions to detect artifacts. Current DF algorithms generate limited-resolution images that require transformation to match source video faces — creating detectable warping artifacts.

CNNArtifact DetectionFace Warping

[7]

Eye Blinking Detection

Uncovers fake face recordings by detecting eye blinking patterns — a physiological signal absent in synthesized videos. Tested on benchmark datasets with promising results on DNN-generated recordings. Absence of flickering is used as the primary detection hint.

Physiological SignalEye BlinkDNN

NOTE: Our strategy extends beyond single-parameter detection — considering teeth, wrinkles, and multiple facial parameters simultaneously.

04

Literature Survey Part II

[8]

Capsule Network Detection

Detects manipulated/forged video and image data across various situations including replay attacks and computer-generated videos. Random noise was used in training — an undesirable practice. Our method proposes a noiseless, real-time dataset for improved robustness.

Capsule NetworkReplay AttacksForgery Detection

[9]

Biological Signal Detection

Detects fake portrait videos using biological signals (PPG guides) extracted from genuine/fake video pairs. Trains a probabilistic SVM + CNN to ensure spatial soundness and temporal consistency. Achieves high accuracy regardless of generator, content, or goal.

PPG SignalsSVM + CNNTemporal Consistency

05

Problem Statement

🎬

Task Definition

Design and develop a deep learning algorithm to classify video as deepfake or pristine. Predict the probability that a video is fake — a binary classification problem.

📥

Input / Output

INPUTVideo (.mp4) — 30 frames @ 1920×1080px

↓

OUTPUTLabel L ∈ {REAL, FAKE}

Loss Function

Binary Cross Entropy Loss optimized on every training sample:

BL(V, I) = −1 · log p(Y=1|V)

− (1−I) · log p(Y=0|V)

where p(Y=i|V) is the probability that the network labels the video as class i.

✅

REAL

V* = 0

VS

❌

FAKE

V* = 1

06

Methodology — Overview

Many tools exist to create DeepFakes, but few can reliably detect them. Our approach detects all types of deepfakes:

Replacement DF

Retrenchment DF

Interpersonal DF

System Architecture Pipeline

🎥

Video Input
.mp4

→

🖼

Frame
Extraction
OpenCV

→

👁

Face
Detection
Preprocessing

→

🧠

ResNet50
Features
CNN

→

🔄

LSTM
Sequence
Temporal

→

⚡
Classification
REAL / FAKE

06

Methodology — Technologies & Methods

Methods Used

01CNN (Convolutional Neural Network)

02Transfer Learning

03Binary Classification

04Frame Extraction Technique

05ResNet50

06LSTM

07Deep Learning-based Detection

Technologies Used

Programming Language	Python
Deep Learning Framework	TensorFlow / Keras
Computer Vision	OpenCV (cv2)
Deep Learning Model	ResNet50
Sequence Model	LSTM
Data Processing	NumPy, Pandas
Visualization	Matplotlib, Seaborn
ML Utilities	Scikit-learn
Dev Environment	Jupyter Notebook

06

Methodology — Algorithm

01

Start

02

Load Real and Fake Video Dataset

03

Extract Frames from Each Video using OpenCV

04

Resize and Preprocess Frames

05

Apply ResNet50 for Feature Extraction

06

Store Extracted Features Sequentially

07

Pass Feature Sequences into LSTM Network

08

Train Deep Learning Model

09

Classify Video as REAL or FAKE

10

Evaluate Model Performance

11

Display Prediction Results

12

End

deepfake_detector.py

import cv2, numpy as np
from tensorflow.keras import Sequential
from tensorflow.keras.applications import ResNet50

# Feature Extractor
resnet = ResNet50(weights='imagenet',
                  include_top=False)

# Sequence Model
model = Sequential([
  resnet,
  LSTM(256),
  Dense(1, activation='sigmoid')
])

# Binary Classification
model.compile(
  loss='binary_crossentropy',
  optimizer='adam'
)

06

Methodology — Flow Chart

START

↓

Load Dataset
Real + Fake Videos

↓

Frame Extraction
OpenCV — 30 frames/video

↓

Preprocessing
Resize + Face Crop + Normalize

↓

ResNet50
CNN Feature Extraction

↓

LSTM Network
Temporal Sequence Learning

↓

Classify Video

↙

REAL ✅

↘

FAKE ❌

↓

Evaluate Performance
Accuracy · Precision · Recall · F1

↓

END

07

Result

Training Accuracy

Epoch 1

65%

Epoch 5

78%

Epoch 10

85%

Final

92%

Train Accuracy

80%

Test Accuracy

ResNet50
+LSTM

Architecture

Binary
CE

Loss Function

08

Comparison Table

Model	Train Accuracy	Test Accuracy	Status
Custom Model	0.8923	0.8027	Baseline
ResNet50 + LSTM ⭐	~0.92	~0.80	Our Model
MesoNet	0.9568	0.8997	Comparison
DenseNet121	0.9699	0.8881	Comparison

09

Future Work

01

📈

Improving Model Accuracy

Enhance detection precision through advanced architectures and fine-tuning strategies.

02

⚡

Real-Time Detection

Develop low-latency inference pipelines for live video stream analysis.

03

🗄

Larger & Diverse Dataset

Expand training data with diverse demographics, lighting, and manipulation techniques.

04

📱

Mobile & Edge Optimization

Optimize model for deployment on mobile devices and edge computing hardware.

05

🔍

Explainable AI

Integrate XAI techniques to visualize and explain model detection decisions.

06

🎵

Multi-Modal Detection

Combine audio + visual signals for more robust cross-modal deepfake detection.

07

🛡

Robustness Against New Techniques

Continuously adapt to emerging deepfake generation methods and adversarial attacks.

10

Conclusion

01

System Design

A deepfake video detection system was developed using deep learning on a large-scale video dataset. The system distinguishes real from manipulated videos by learning spatial and temporal patterns from extracted video frames through preprocessing (frame extraction + face cropping).

02

Results & Performance

The proposed approach is effective in identifying deepfake content with good accuracy, demonstrating the potential of combining convolutional and sequential models. Careful preprocessing and balanced dataset preparation play a key role in achieving reliable results.

03

Broader Impact

This project contributes toward addressing the growing challenge of digital misinformation caused by synthetic media — providing a foundation for advanced deepfake detection in security, media verification, and online content moderation.

🛡

Digital
Trust

Security

Media

Justice

Privacy

Trust

11

References Part I

[1]

Joshua Brockschmidt, Jiacheng Shang, and Jie Wu. On the Generality of Facial Forgery Detection. IEEE 16th International Conference on Mobile Ad Hoc and Sensor Systems Workshops (MASSW), pp. 43–47. IEEE, 2019.

[2]

Yuezun Li, Ming-Ching Chang, and Siwei Lyu. In Ictu Oculi: Exposing AI Generated Fake Face Videos by Detecting Eye Blinking. arXiv:1806.02877v2, 2018.

[3]

TackHyun Jung, SangWon Kim, and KeeCheon Kim. Deep-Vision: Deepfakes Detection Using Human Eye Blinking Pattern. IEEE Access, 8:83144–83154, 2020.

[4]

Konstantinos Vougioukas, Stavros Petridis, and Maja Pantic. Realistic Speech-Driven Facial Animation with GANs. International Journal of Computer Vision, 128:1398–1413, 2020.

[5]

Hai X. Pham, Yuting Wang, and Vladimir Pavlovic. Generative Adversarial Talking Head: Bringing Portraits to Life with a Weakly Supervised Neural Network. arXiv:1803.07716, 2018.

[6]

Yuezun Li, Siwei Lyu. Exposing DF Videos By Detecting Face Warping Artifacts. arXiv:1811.00656v3.

11

References Part II

[7]

Yuezun Li, Ming-Ching Chang and Siwei Lyu. Exposing AI Created Fake Videos by Detecting Eye Blinking. arXiv.

[8]

Huy H. Nguyen, Junichi Yamagishi, and Isao Echizen. Using Capsule Networks to Detect Forged Images and Videos.

[9]

Umur Aybars Ciftci, İlke Demir, Lijun Yin. Detection of Synthetic Portrait Videos using Biological Signals. arXiv:1901.02212v2.

[10]

Liu, M. Y., Huang, X., Mallya, A., Karras, T., Aila, T., Lehtinen, J., and Kautz, J. Few-shot unsupervised image-to-image translation. Proceedings of the IEEE International Conference on Computer Vision, pp. 10551–10560, 2019.

Thank You

DeepFake Video Detection AI/ML Project

DeepFake Video

Detection System

Table of Contents

Introduction

Summary of Notable DeepFake Tools

Literature Survey Part I

Literature Survey Part II

Problem Statement

Task Definition

Input / Output

Loss Function

Methodology — Overview

System Architecture Pipeline

Methodology — Technologies & Methods

Methods Used

Technologies Used

Methodology — Algorithm

Methodology — Flow Chart

Result

Comparison Table

Future Work

Improving Model Accuracy

Real-Time Detection

Larger & Diverse Dataset

Mobile & Edge Optimization

Explainable AI

Multi-Modal Detection

Robustness Against New Techniques

Conclusion

System Design

Results & Performance

Broader Impact

References Part I

References Part II