Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Last update: Dec 02, 2022

Related tags

Deep Learning LMFD-PAD

Overview

LMFD-PAD

Note

This is the official repository of the paper: LMFD-PAD: Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection. The paper can be found in here.

Pipeline Overview

Data preparation

Since the data in all used PAD datasets in our work are videos, we sample 10 frames in the average time interval of each video. In addition, the ratio of bona fide and attack is balanced by simple duplication. Finally, CSV files are generated for further training and evaluation. The format of the dataset CSV file is:

image_path,label
/image_dir/image_file_1.png, bonafide
/image_dir/image_file_2.png, bonafide
/image_dir/image_file_3.png, attack
/image_dir/image_file_4.png, attack

Training

The training code for intra-dataset and cross-dataset experiments is same, the difference code between intra_db_main.py and cross_db_main.py is evaluation metrics.

Example of intra-dataset training and testing:

python intra_db_main.py \
  --protocol_dir 'dir_containing_csv_files' \
  --backbone resnet50 \
  --pretrain True \
  --lr 0.001 \
  --batch_size 64 \
  --prefix 'custom_note' \

Example of cross-dataset training and testing is similar:

python cross_db_main.py \
  --protocol_dir 'dir_containing_csv_files' \
  --backbone resnet50 \
  --pretrain True \
  --lr 0.001 \
  --batch_size 64 \
  --prefix 'custom_note' \

Results

The results of cross-dataset evaluation under different experimental settings on four face PAD datasets. More details can be found in paper.

Models

Four models pre-trained based on four cross-dataset experimental settings can be download via google driver.

if you use LMFD-HAM architecture in this repository, please cite the following paper:

@misc{fang2021learnable,
    title={Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection},
    author={Meiling Fang and Naser Damer and Florian Kirchbuchner and Arjan Kuijper},
    year={2021},
    eprint={2109.07950},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Related tags

Overview

LMFD-PAD

Note

Pipeline Overview

Data preparation

Training

Results

Models

Owner

Official repository for the ISBI 2021 paper Transformer Assisted Convolutional Neural Network for Cell Instance Segmentation

TSIT: A Simple and Versatile Framework for Image-to-Image Translation

My implementation of Fully Convolutional Neural Networks in Keras

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes

Implementation of: "Exploring Randomly Wired Neural Networks for Image Recognition"

Image morphing without reference points by applying warp maps and optimizing over them.

Multi-Stage Episodic Control for Strategic Exploration in Text Games

paper list in the area of reinforcenment learning for recommendation systems

🔊 Audio and fastai v2

Reinforcement learning for self-driving in a 3D simulation

Beginner-friendly repository for Hacktober Fest 2021. Start your contribution to open source through baby steps. 💜

Topic Modelling for Humans

3D2Unet: 3D Deformable Unet for Low-Light Video Enhancement (PRCV2021)

CVPR 2021

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

D2Go is a toolkit for efficient deep learning

This script runs neural style transfer against the provided content image.

A system for quickly generating training data with weak supervision

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond