CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

HUAWEI Noah's Ark Lab

Last update: Jan 8, 2023

Related tags

Deep Learning tensorflow pytorch transformer imagenet convolutional-neural-networks pretrained-models cvpr model-compression efficient-inference ghostnet vision-transformer

Overview

CV Backbones

including GhostNet, TinyNet, TNT (Transformer in Transformer) developed by Huawei Noah's Ark Lab.

GhostNet Code
TinyNet Code
TNT Code
PyramidTNT Code
LegoNet Code
Versatile Filters Code
Citation
Other versions

News

2022/01/05 PyramidTNT: An improved TNT baseline is released.

2021/09/28 The paper of TNT (Transformer in Transformer) is accepted by NeurIPS 2021.

2021/09/18 The extended version of Versatile Filters is accepted by T-PAMI.

2021/08/30 GhostNet paper is selected as the Most Influential CVPR 2020 Papers.

2021/08/26 The codes of LegoNet and Versatile Filters has been merged into this repo.

2021/06/15 The code of TNT (Transformer in Transformer) has been released in this repo.

2020/10/31 GhostNet+TinyNet achieves better performance. See details in our NeurIPS 2020 paper: arXiv.

2020/06/10 GhostNet is included in PyTorch Hub.

GhostNet Code

This repo provides GhostNet pretrained models and inference code for TensorFlow and PyTorch:

Tensorflow: ./ghostnet_tensorflow with pretrained model.
PyTorch: ./ghostnet_pytorch with pretrained model.
We also opensource code on MindSpore Hub and MindSpore Model Zoo.

For training, please refer to tinynet or timm.

TinyNet Code

This repo provides TinyNet pretrained models and inference code for PyTorch:

PyTorch: ./tinynet_pytorch with pretrained model.
We also opensource training code on MindSpore Model Zoo.

TNT Code

This repo provides training code and pretrained models of TNT (Transformer in Transformer) for PyTorch:

PyTorch: ./tnt_pytorch.
We also opensource code on MindSpore Model Zoo.

The code of PyramidTNT is also released:

PyTorch: ./tnt_pytorch.

LegoNet Code

This repo provides the implementation of paper LegoNet: Efficient Convolutional Neural Networks with Lego Filters (ICML 2019)

PyTorch: ./legonet_pytorch.

Versatile Filters Code

This repo provides the implementation of paper Learning Versatile Filters for Efficient Convolutional Neural Networks (NeurIPS 2018)

PyTorch: ./versatile_filters.

Citation

@inproceedings{ghostnet,
  title={GhostNet: More Features from Cheap Operations},
  author={Han, Kai and Wang, Yunhe and Tian, Qi and Guo, Jianyuan and Xu, Chunjing and Xu, Chang},
  booktitle={CVPR},
  year={2020}
}
@inproceedings{tinynet,
  title={Model Rubik’s Cube: Twisting Resolution, Depth and Width for TinyNets},
  author={Han, Kai and Wang, Yunhe and Zhang, Qiulin and Zhang, Wei and Xu, Chunjing and Zhang, Tong},
  booktitle={NeurIPS},
  year={2020}
}
@inproceedings{tnt,
  title={Transformer in transformer},
  author={Han, Kai and Xiao, An and Wu, Enhua and Guo, Jianyuan and Xu, Chunjing and Wang, Yunhe},
  booktitle={NeurIPS},
  year={2021}
}
@inproceedings{legonet,
    title={LegoNet: Efficient Convolutional Neural Networks with Lego Filters},
    author={Yang, Zhaohui and Wang, Yunhe and Liu, Chuanjian and Chen, Hanting and Xu, Chunjing and Shi, Boxin and Xu, Chao and Xu, Chang},
    booktitle={ICML},
    year={2019}
  }
@inproceedings{wang2018learning,
  title={Learning versatile filters for efficient convolutional neural networks},
  author={Wang, Yunhe and Xu, Chang and Chunjing, XU and Xu, Chao and Tao, Dacheng},
  booktitle={NeurIPS},
  year={2018}
}

Other versions of GhostNet

This repo provides the TensorFlow/PyTorch code of GhostNet. Other versions and applications can be found in the following:

timm: code with pretrained model
Darknet: cfg file, and description
Gluon/Keras/Chainer: code
Paddle: code
Bolt inference framework: benckmark
Human pose estimation: code
YOLO with GhostNet backbone: code
Face recognition: cavaface, FaceX-Zoo, TFace

Comments

TypeError: __init__() got an unexpected keyword argument 'bn_tf'

Hello, I want to ask what caused the following error when running the train.py file? thank you “ TypeError: init() got an unexpected keyword argument 'bn_tf' ”

opened by ModeSky 16
Counting ReLU vs HardSwish FLOPs

Thank you very much for sharing the source code. I have a question related to FLOPs counting for ReLU and HardSwish. I saw in the paper the flops are the same in ReLU and HardSwish. Can you explain this situation?

opened by jahongir7174 10
kernel size in primary convolution of Ghost module

Hi, It is said in your paper that the primary convolution in Ghost module can have customized kernel size, which is a major difference from existing efficient convolution schemes. However, it seems that in this code all the kernel size of primary convolution in Ghost module are set to [1, 1], and the kernel set in _CONV_DEFS_0 are only used in blocks of stride=2. Is it set intentionally?

opened by YUHAN666 9
用GhostModule替换Conv2d，loss降的很慢？

我直接将efficientnet里面的MBConvBlock中的Conv2d替换为GhostModule： Conv2d(in_channels=inp, out_channels=oup, kernel_size=1, bias=False) 替换为 GhostModule(inp, oup)，其他参数不变，为什么损失比以前收敛的更慢了，一直降不下来？请问需要修改其他什么参数吗？

opened by yc-cui 8
Training hyperparams on ImageNet

Hi, thanks for sharing such a wonderful work, I'd like to reproduce your results on ImageNet, could you please specify training parameters such as initial learning rate, how to decay it, batch size, etc. It would be even better if you can provide tricks to train GhostNet, such as label smoothing and data augmentation. Thx!
good first issue

opened by sean-zhuh 8
Why did you exclude EfficientNetB0 from Accuracy-Latency chart?
@iamhankai Hi,

Great work!

Why did you exclude EfficientNetB0 (0.390 BFlops - 76.3% Top1) from Accuracy-Latency chart?

Also what mini_batch_size did you use for training GhostNet?
opened by AlexeyAB 8
VIG pretrained weights

@huawei-noah-admin cna you please share the VIG pretraiend model on google drive or one drive as baidu is not accessible from our end

THank in advance

opened by abhigoku10 7
The implementation of Isotropic architecture

Hi, thanks for sharing this impressive work. The paper mentioned two architectures, Isotropic one and pyramid one. I noticed that in the code, this is a reduce_ratios, and this reduce_ratios are used by a avg_pooling operation to calculate before building the graph. I am wondering whether all I need to do is setting this reduce_ratios to [1,1,1,1] if I want to implement the Isotropic architecture. Thanks.

self.n_blocks = sum(blocks) channels = opt.channels reduce_ratios = [4, 2, 1, 1] dpr = [x.item() for x in torch.linspace(0, drop_path, self.n_blocks)] num_knn = [int(x.item()) for x in torch.linspace(k, k, self.n_blocks)]

opened by buptxiaofeng 6
Gradient overflow occurs while training tnt-ti model

^@^@Train: 41 [ 0/625 ( 0%)] Loss: 4.564162 (4.5642) Time: 96.744s, 21.17/s (96.744s, 21.17/s) LR: 8.284e-04 Data: 94.025 (94.025) ^@^@^@^@Train: 41 [ 50/625 ( 8%)] Loss: 4.395192 (4.4797) Time: 2.742s, 746.96/s (7.383s, 277.38/s) LR: 8.284e-04 Data: 0.057 (4.683) ^@^@^@^@Train: 41 [ 100/625 ( 16%)] Loss: 4.424296 (4.4612) Time: 2.741s, 747.15/s (6.529s, 313.66/s) LR: 8.284e-04 Data: 0.056 (3.831) Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 16384.0 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 16384.0 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 16384.0 Gradient overflow. Skipping step, loss scaler 0 reducing loss scale to 16384.0

And the top-1 acc is only 0.2 after 40 epochs.

Any tips available here, dear @iamhankai @yitongh

opened by jimmyflycv 6
Bloated model

Hi, I am using Ghostnet backbone for training YoloV3 model in Tensorflow, but I am getting a bloated model. The checkpoint data size is approx. 68MB, but the checkpoint given here is of approx 20MB https://github.com/huawei-noah/ghostnet/blob/master/tensorflow/models/ghostnet_checkpoint.data-00000-of-00001

I am also training EfficientNet model with YoloV3 and that seems to be working fine, without any bloated size.

Could anyone or the author please confirm if this is the correct architecture or anything seems weird? I have attached the Ghostnet architecture file out of the code.

Thanks. ghostnet_model_arch.txt

opened by ghost 6
Replace Conv2d in my network, however it becomes slower, why?

Above all, thanks for your great work! It really inspires me a lot! But now I have a question.

I replace all the Conv2d operations in my network except the final ones, the model parameters really becomes much more less. However, when testing, I found that the average forward time decreases a lot by the replacement (from 428FPS down to 354FPS). So, is this a normal phenomenon? Or is this because of the concat operation?

opened by FunkyKoki 6
VIG for segmenation

@iamhankai thanks for open-sourcing the code base . Can you please let me knw how to use the pvig for segmentation related activities its really helpful

THanks in advance

opened by abhigoku10 0
higher performance of ViG

I try to train ViG-S on ImageNet and get 80.54% top1 accuracy, which is higher than that in paper, 80.4%. I wonder if 80.4 is the average of multiple trainings? If yes, how many reps do you use?

opened by tdzdog 9

Releases(GhostNetV2)

GhostNetV2(Dec 1, 2022)

Put the checkpoints of GhostNetV2 here.
Source code(tar.gz)
Source code(zip)
ck_ghostnetv2_10.pth.tar(24.03 MB)
ck_ghostnetv2_13.pth.tar(34.89 MB)
ck_ghostnetv2_16.pth.tar(48.05 MB)
g_ghost_regnet(Nov 13, 2022)

Source code(tar.gz)
Source code(zip)
g_ghost_regnet_16.0g_79.9.pth(123.89 MB)
g_ghost_regnet_3.2g_77.8.pth(40.36 MB)
g_ghost_regnet_4.0g_78.6.pth(60.29 MB)
g_ghost_regnet_8.0g_79.0.pth(94.12 MB)
vig(Nov 13, 2022)

Source code(tar.gz)
Source code(zip)
vig_b_82.6.pth(332.21 MB)
vig_s_80.6.pth(87.35 MB)
vig_ti_74.5.pth(27.74 MB)
snnmlp(Sep 15, 2022)

Source code(tar.gz)
Source code(zip)
snnmlp_base_83.59.log(16.72 MB)
snnmlp_base_83.59.pt(1006.48 MB)
snnmlp_small_83.30.log(8.57 MB)
snnmlp_small_83.30.pt(569.39 MB)
snnmlp_tiny_81.88.log(8.51 MB)
snnmlp_tiny_81.88.pt(324.57 MB)
pyramid-vig(Aug 23, 2022)

Vision GNN
Source code(tar.gz)
Source code(zip)
pvig_b_83.66.pth.tar(364.38 MB)
pvig_m_83.1.pth.tar(198.03 MB)
pvig_s_82.1.pth.tar(111.19 MB)
pvig_ti_78.5.pth.tar(47.98 MB)
wavemlp(Mar 10, 2022)

The checkpoints and training logs of WaveMLP.
Source code(tar.gz)
Source code(zip)
WaveMLP_M.log(1.93 MB)
WaveMLP_M.pth.tar(168.74 MB)
WaveMLP_S.log(1.96 MB)
WaveMLP_S.pth.tar(117.58 MB)
WaveMLP_T.log(1.97 MB)
WaveMLP_T.pth.tar(65.89 MB)
WaveMLP_T_dw.log(1.98 MB)
WaveMLP_T_dw.pth.tar(58.61 MB)
v1.2.0(Aug 31, 2021)

Put the checkpoints of TinyNet here.
Source code(tar.gz)
Source code(zip)
tinynet_a.pth(23.87 MB)
tinynet_b.pth(14.41 MB)
tinynet_c.pth(9.51 MB)
tinynet_d.pth(9.03 MB)
tinynet_e.pth(7.88 MB)
ghostnet_pth(Apr 11, 2021)

Source code(tar.gz)
Source code(zip)
ghostnet_1x.pth(19.95 MB)
v1.1.0(Apr 2, 2021)

This version contains GhostNet & TinyNet codes.
Source code(tar.gz)
Source code(zip)
v1.0.0(Sep 17, 2020)

This version contains GhostNet-1.0x pretrained on ImageNet dataset. Both PyTorch and TensorFlow versions are included. For more details, pelase refer to the paper "GhostNet: More Features from Cheap Operations. CVPR 2020" on arXiv.
Source code(tar.gz)
Source code(zip)

Owner

HUAWEI Noah's Ark Lab

Working with and contributing to the open source community in data mining, artificial intelligence, and related fields.

GitHub

Facilitates implementing deep neural-network backbones, data augmentations

Introduction Nowadays, the training of Deep Learning models is fragmented and unified. When AI engineers face up with one specific task, the common wa

40 Dec 29, 2022

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

RITA: a Study on Scaling Up Generative Protein Sequence Models RITA is a family of autoregressive protein models, developed by a collaboration of Ligh

69 Dec 22, 2022

A trusty face recognition research platform developed by Tencent Youtu Lab

Introduction TFace: A trusty face recognition research platform developed by Tencent Youtu Lab. It provides a high-performance distributed training fr

956 Jan 1, 2023

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

Ultralight-SimplePose Support NCNN mobile terminal deployment Based on MXNET(>=1.5.1) GLUON(>=0.7.0) framework Top-down strategy: The input image is t

223 Dec 27, 2022

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

关于实现的一点说明山东大学 2020级苏博南 www.subonan.com 文件说明 tools.py 这里面主要有两个函数： resize(a, lenb) 这其实是我找同学写的一个小算法hhh。给出一个$28\times 28$的方阵a，返回一个$lenb\times lenb$的方阵。因

2 Aug 29, 2022

Yolo ros - YOLO-ROS for HUAWEI ATLAS200

YOLO-ROS YOLO-ROS for NVIDIA YOLO-ROS for HUAWEI ATLAS200, please checkout for b

5 Oct 18, 2022

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

piSTAR Lab WARNING: This is an early release. Overview piSTAR Lab is a modular deep reinforcement learning platform built to make AI experimentation a

0 Aug 1, 2022

All the code and files related to the MI-Lab of UE19CS305 course in sem 5

Machine-Intelligence-Lab-CS305 The compilation of all the code an drelated files from MI-Lab UE19CS305 (of batch 2019-2023) offered by PES University

3 Nov 10, 2022

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

CORNELLSASLAB SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab Instructions: This python code can be used to convert SAS out

2 Jan 26, 2022

All-in-one Docker container that allows a user to explore Nautobot in a lab environment.

Nautobot Lab This container is not for production use! Nautobot Lab is an all-in-one Docker container that allows a user to quickly get an instance of

29 Sep 16, 2022

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification A pilot project for the NHS AI Lab Skunkworks team, Long Stayer Risk Stratification use

21 Nov 14, 2022

Code for the AI lab course 2021/2022 of the University of Verona

AI-Lab Code for the AI lab course 2021/2022 of the University of Verona Set-Up the environment for the curse Download Anaconda for your System. Instal

5 Oct 19, 2022

Train neural network for semantic segmentation (deep lab V3) with pytorch in less then 50 lines of code

Train neural network for semantic segmentation (deep lab V3) with pytorch in 50 lines of code Train net semantic segmentation net using Trans10K datas

17 Dec 19, 2022

Manipulation OpenAI Gym environments to simulate robots at the STARS lab

Manipulator Learning This repository contains a set of manipulation environments that are compatible with OpenAI Gym and simulated in pybullet. In par

5 Dec 8, 2022

Experiments for Operating Systems Lab (ETCS-352)

Operating Systems Lab (ETCS-352) Experiments for Operating Systems Lab (ETCS-352) performed by me in 2021 at uni. All codes are written by me except t

0 Sep 6, 2022

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

An Effective Loss Function for Generating 3D Models from Single 2D Image without Rendering Papers with code | Paper Nikola Zubić Pietro Lio University

213 Dec 27, 2022

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

What is xLearn? xLearn is a high performance, easy-to-use, and scalable machine learning package that contains linear model (LR), factorization machin

3k Jan 3, 2023

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

What is xLearn? xLearn is a high performance, easy-to-use, and scalable machine learning package that contains linear model (LR), factorization machin

2.8k Feb 12, 2021

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Trivial Augment This is the official implementation of TrivialAugment (https://arxiv.org/abs/2103.10158), as was used for the paper. TrivialAugment is

94 Dec 30, 2022

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

Related tags

Overview

CV Backbones

GhostNet Code

TinyNet Code

TNT Code

LegoNet Code

Versatile Filters Code

Citation

Other versions of GhostNet

Comments

Releases(GhostNetV2)

GhostNetV2(Dec 1, 2022)

g_ghost_regnet(Nov 13, 2022)

vig(Nov 13, 2022)

snnmlp(Sep 15, 2022)

pyramid-vig(Aug 23, 2022)

wavemlp(Mar 10, 2022)

v1.2.0(Aug 31, 2021)

ghostnet_pth(Apr 11, 2021)

v1.1.0(Apr 2, 2021)

v1.0.0(Sep 17, 2020)

Owner

HUAWEI Noah's Ark Lab

Facilitates implementing deep neural-network backbones, data augmentations

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

A trusty face recognition research platform developed by Tencent Youtu Lab

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

Yolo ros - YOLO-ROS for HUAWEI ATLAS200

piSTAR Lab is a modular platform built to make AI experimentation accessible and fun. (pistar.ai)

All the code and files related to the MI-Lab of UE19CS305 course in sem 5

SAS output to EXCEL converter for Cornell/MIT Language and acquisition lab

All-in-one Docker container that allows a user to explore Nautobot in a lab environment.

NHS AI Lab Skunkworks project: Long Stayer Risk Stratification

Code for the AI lab course 2021/2022 of the University of Verona

Train neural network for semantic segmentation (deep lab V3) with pytorch in less then 50 lines of code

Manipulation OpenAI Gym environments to simulate robots at the STARS lab

Experiments for Operating Systems Lab (ETCS-352)

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.