Simple implementation of Mobile-Former on Pytorch

Acheung

Last update: Dec 31, 2022

Related tags

Deep Learning Pytorch-implementation-of-Mobile-Former

Overview

Simple-implementation-of-Mobile-Former

At present, only the model but no trained. There may be some bug in the code, and some details may be different from the original paper, if you are interested in this, welcome to discuss.

Add: CutUp,MixUp,RandomErasing,SyncBatchNorm for DDP train

Inference:

paper:https://arxiv.org/pdf/2108.05895.pdf

https://github.com/xiaolai-sqlai/mobilenetv3

https://github.com/lucidrains/vit-pytorch

https://github.com/Islanna/DynamicReLU

You might also like...

CondenseNet: Light weighted CNN for mobile devices

CondenseNets This repository contains the code (in PyTorch) for "CondenseNet: An Efficient DenseNet using Learned Group Convolutions" paper by Gao Hua

690 Nov 30, 2022

Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

567 Dec 26, 2022

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

TF Watcher TF Watcher is a simple to use Python package and web app which allows you to monitor 👀 your Machine Learning training or testing process o

54 Nov 1, 2022

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

Codes for ECBSR Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices Xindong Zhang, Hui Zeng, Lei Zhang ACM Multimedia 202

236 Dec 26, 2022

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

BASNet Boundary-Aware Segmentation Network for Mobile and Web Applications This repository contain implementation of BASNet in tensorflow/keras. comme

8 Nov 24, 2022

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Guidedog Authors: Kyuhee Jo, Steven Gunarso, Jacky Wang, Raghav Sharma GuideDog is an AI/ML-based mobile app designed to assist the lives of the visua

5 Nov 24, 2021

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

99 Dec 31, 2022

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

We proposed a new approach to detect anomalies of mobile robot data. We investigate each data seperately with two clustering method hierarchical and k-means. There are two sub-method that we used for produce an anomaly score. Then, we merge these two score and produce merged anomaly score as a result.

1 Jan 9, 2022

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

DRL-robot-navigation Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gra

87 Jan 7, 2023

Comments

two issues
Great job! However, I think there are probably two tiny issues in you code.

The first one is in bridge.py(line 24 & line 53). I think there are some differences in the following two lines of code

x = x.reshape(b, c, h*w).transpose(1,2).unsqueeze(1) x = x.contiguous().view(b, h * w, c).unsqueeze(1)

May be the first line is correct?

The second one is in config.py.Accroding to the original paper, in page 13,

Figure 7. Visualization of cross attention on the two-way bridge: Mobile→Former and Mobile←Former. Mobile-Former-294M is used,which includes 6 tokens (each corresponds to a column) and 11 Mobile-Former blocks (block 2–12) across 4 stages. Each block has two attention heads that are visualized in two rows. Attention in Mobile→Former (left half) is normalized over pixels, showing the focused region per token. Attention in Mobile←Former (right half) is normalized over tokens showing the contribution per token at each pixel.

But in config.py, there are some stages with only one head.

I'm not sure whether the above is correct. Looking forward to your reply!
opened by fushh 4
some question

class Mobile(nn.Module): def init(self, ks, inp, hid, out, se, stride, dim, reduction=4, k=2):

hi, call you tell me, the k value why equal to 2, What is it used for

opened by 1962975362 1
Not converge

Great job！ However, i try the training on imagent and it does not converge. I also try another implement https://github.com/slwang9353/MobileFormer and it does not converge either. Does anyone successfully reproduce the results in the paper?

opened by zshen1993 10

Simple implementation of Mobile-Former on Pytorch

Related tags

Overview

Simple-implementation-of-Mobile-Former

You might also like...

CondenseNet: Light weighted CNN for mobile devices

Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

Monitor your ML jobs on mobile devices📱, especially for Google Colab / Kaggle

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

Code for Boundary-Aware Segmentation Network for Mobile and Web Applications

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Comments

two issues

some question

Not converge

Owner

Acheung

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

The UI as a mobile display for OP25

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Spatial Intention Maps for Multi-Agent Mobile Manipulation (ICRA 2021)

Spatial Action Maps for Mobile Manipulation (RSS 2020)

Lyapunov-guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks