Face Transformer for Recognition

Zhong Yaoyao

Last update: Nov 30, 2022

Related tags

Deep Learning Face-Transformer

Overview

Face-Transformer

This is the code of Face Transformer for Recognition (https://arxiv.org/abs/2103.14803v2).

Recently there has been great interests of Transformer not only in NLP but also in computer vision. We wonder if transformer can be used in face recognition and whether it is better than CNNs. Therefore, we investigate the performance of Transformer models in face recognition. The models are trained on a large scale face recognition database MS-Celeb-1M and evaluated on several mainstream benchmarks, including LFW, SLLFW, CALFW, CPLFW, TALFW, CFP-FP, AGEDB and IJB-C databases. We demonstrate that Transformer models achieve comparable performance as CNN with similar number of parameters and MACs.

Usage Instructions

1. Preparation

The code is mainly adopted from Vision Transformer, and DeiT. In addition to PyTorch and torchvision, install vit_pytorch by Phil Wang, and package timm==0.3.2 by Ross Wightman. Sincerely appreciate for their contributions.

pip install vit-pytorch

pip install timm==0.3.2

Copy the files of fold "copy-to-vit_pytorch-path" to vit-pytorch path.

.
├── __init__.py
├── vit_face.py
└── vits_face.py

2. Databases

You can download the training databases, MS-Celeb-1M (version ms1m-retinaface), and put it in folder 'Data'.

You can download the testing databases as follows and put them in folder 'eval'.

LFW: Baidu Netdisk(password: dfj0), Google Drive
SLLFW: Baidu Netdisk(password: l1z6), Google Drive
CALFW: Baidu Netdisk(password: vvqe), Google Drive
CPLFW: Baidu Netdisk(password: jyp9), Google Drive
TALFW: Baidu Netdisk(password: izrg), Google Drive
CFP_FP: Baidu Netdisk(password: 4fem), Google Drive--refer to Insightface
AGEDB: Baidu Netdisk(password: rlqf), Google Drive--refer to Insightface

3. Train Models

ViT-P8S8

CUDA_VISIBLE_DEVICES='0,1,2,3' python3 -u train.py -b 480 -w 0,1,2,3 -d retina -n VIT -head CosFace --outdir ./results/ViT-P8S8_ms1m_cosface_s1 --warmup-epochs 1 --lr 3e-4 

CUDA_VISIBLE_DEVICES='0,1,2,3' python3 -u train.py -b 480 -w 0,1,2,3 -d retina -n VIT -head CosFace --outdir ./results/ViT-P8S8_ms1m_cosface_s2 --warmup-epochs 0 --lr 1e-4 -r path_to_model 

CUDA_VISIBLE_DEVICES='0,1,2,3' python3 -u train.py -b 480 -w 0,1,2,3 -d retina -n VIT -head CosFace --outdir ./results/ViT-P8S8_ms1m_cosface_s3 --warmup-epochs 0 --lr 5e-5 -r path_to_model

ViT-P12S8

CUDA_VISIBLE_DEVICES='0,1,2,3' python3 -u train.py -b 480 -w 0,1,2,3 -d retina -n VITs -head CosFace --outdir ./results/ViT-P12S8_ms1m_cosface_s1 --warmup-epochs 1 --lr 3e-4 

CUDA_VISIBLE_DEVICES='0,1,2,3' python3 -u train.py -b 480 -w 0,1,2,3 -d retina -n VITs -head CosFace --outdir ./results/ViT-P12S8_ms1m_cosface_s2 --warmup-epochs 0 --lr 1e-4 -r path_to_model 

CUDA_VISIBLE_DEVICES='0,1,2,3' python3 -u train.py -b 480 -w 0,1,2,3 -d retina -n VITs -head CosFace --outdir ./results/ViT-P12S8_ms1m_cosface_s3 --warmup-epochs 0 --lr 5e-5 -r path_to_model

4. Pretrained Models and Test Models (on LFW, SLLFW, CALFW, CPLFW, TALFW, CFP_FP, AGEDB)

You can download the following models

ViT-P8S8: Baidu Netdisk(password: spkf), Google Drive
ViT-P12S8: Baidu Netdisk(password: 7caa), Google Drive

You can test Models

python test.py --model ./results/ViT-P12S8_ms1m_cosface/Backbone_VITs_Epoch_2_Batch_12000_Time_2021-03-17-04-05_checkpoint.pth --network VIT 

python test.py --model ./results/ViT-P12S8_ms1m_cosface/Backbone_VITs_Epoch_2_Batch_12000_Time_2021-03-17-04-05_checkpoint.pth --network VITs

You might also like...

VGGFace2-HQ - A high resolution face dataset for face editing purpose

Comments

Issues about input normalization

Hi,

I have some questions about your input normalization. Specifically, I checked your code and notice that you leave out normalization for input images which means the input value is ranged [0, 255]. I tried to normalize the input to [-1, 1], and the results dropped a little, so is this a training trick or there is some insights behind it?

opened by zcy8123878 2
Small improvements
Hi zhongyy, thank you very much for your quick reply on my issue. Here is a pull-request to improve your repo:

fixed error in init.py

added requirements.txt and listed all required packages to be able to run test.py with a pretrained model

added information of property file into readme.md

added information about install with pip install -r requirements.txt
opened by Martlgap 1
IJB-C Evaluation

Hi, I have been trying to re-produce your network's result on IJB-C using insightface's evaluation code but have been unable to do so. Would it be possible that you upload your evaluation code?

opened by junwah712 0
Requirements problem

Hi!

I am trying to install requirements but always I obtain the following problem loading data:

Exception ignored in: <function MXRecordIO.del at 0x7f742061ff70> Traceback (most recent call last): File "/home/dparres/miniconda3/envs/fr1/lib/python3.8/site-packages/mxnet/recordio.py", line 88, in del File "/home/dparres/miniconda3/envs/fr1/lib/python3.8/site-packages/mxnet/recordio.py", line 262, in close TypeError: super() argument 1 must be type, not None

What I can do? I am using python 3.8 and the command "pip install -r requirements.txt" does not work for me.

opened by daniusparres 0

Face Transformer for Recognition

Related tags

Overview

Face-Transformer

Usage Instructions

1. Preparation

2. Databases

3. Train Models

4. Pretrained Models and Test Models (on LFW, SLLFW, CALFW, CPLFW, TALFW, CFP_FP, AGEDB)

You might also like...

VGGFace2-HQ - A high resolution face dataset for face editing purpose

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.

AI Face Mesh: This is a simple face mesh detection program based on Artificial intelligence.

Video-face-extractor - Video face extractor with Python

Face and Pose detector that emits MQTT events when a face or human body is detected and not detected.

A PyTorch Toolbox for Face Recognition

MagFace: A Universal Representation for Face Recognition and Quality Assessment

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

Face Detection & Age Gender & Expression & Recognition

Comments

Issues about input normalization

Small improvements

IJB-C Evaluation

Requirements problem

Owner

Zhong Yaoyao

Face Library is an open source package for accurate and real-time face detection and recognition

A large-scale face dataset for face parsing, recognition, generation and editing.

Face Transformer for Recognition

VSR-Transformer - This paper proposes a new Transformer for video super-resolution (called VSR-Transformer).

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Code for HLA-Face: Joint High-Low Adaptation for Low Light Face Detection (CVPR21)

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Realtime Face Anti Spoofing with Face Detector based on Deep Learning using Tensorflow/Keras and OpenCV

Swapping face using Face Mesh with TensorFlow Lite

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.