Transformers based fully on MLPs

Fawaz Sammani

Last update: Dec 30, 2022

Related tags

Deep Learning awesome-mlp-mixer

Overview

Awesome MLP-based Transformers papers

An up-to-date list of Transformers based fully on MLPs without attention!

Why this repo?

After transformers and fully-based attention mechanism models took over most of the deep learning world since 2019, it appears that the power does not come from attention, and indeed replacing the feed-forward network in a transformer by attention performs horrible (~30% top-1 on ImageNet). It appears that Attention is not all we need. After all, we don't need inductive-biased models such as CNNs anymore, and we can lean back on MLPs since (1) we have enough data, (2) We have powerful optimization, regularization and data augmentation techniques. As we saw a big hipe on transformers awesome vision transformer and BERT-related papers, we expect to see a big hipe in fully MLP-based networks without attention, and the research focus is now shited to finding efficient ways of mixing tokens without involving attention mechanisms. This repository aims at gathering and collecting all these kind of papers.

Contributing

Please help in contributing to this list by submitting an issue or a pull request

- Paper Name [[pdf]](link) [[code]](link)

Papers

MLP-Mixer: An all-MLP Architecture for Vision [pdf] [official code] [code] [code] [code] [Yannic Kilcher Video]
Do You Even Need Attention? A Stack of Feed-Forward Layers Does Surprisingly Well on ImageNet [pdf] [code]
ResMLP: Feedforward networks for image classification with data-efficient training [pdf] [code] [code] [code]
Pay Attention to MLPs [pdf] [code] [code] [code]
FNet: Mixing Tokens with Fourier Transforms [pdf] [code] [Yannic Kilcher Video]
Can Attention Enable MLPs To Catch Up With CNNs? [pdf]
MixerGAN: An MLP-Based Architecture for Unpaired Image-to-Image Translation [pdf]
On the Bias Against Inductive Biases [pdf]
S² MLP: Spatial-Shift MLP Architecture for Vision [pdf]
Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition [pdf] [code]
Rethinking Token-Mixing MLP for MLP-based Vision Backbone [pdf]
Global Filter Networks for Image Classification [pdf] [code]
What Makes for Hierarchical Vision Transformer? [pdf]
As-MLP: An Axial Shifted MLP architecture for Vision [pdf][code]
CycleMLP: A MLP-like Architecture for Dense Prediction [pdf][code]
S² MLPv2: Improved Spatial-Shift MLP Architecture for Vision [pdf]
RaftMLP: Do MLP-based Models Dream of Winning Over Computer Vision? [pdf] [code]
Hire-MLP: Vision MLP via Hierarchical Rearrangement [pdf]
Sparse-MLP: A Fully-MLP Architecture with Conditional Computation [pdf]
Sparse MLP for Image Recognition: Is Self-Attention Really Necessary? [pdf]
Patches Are All You Need? [pdf] [code]
Exploring the Limits of Large Scale Pre-training [pdf]
Adversarial Robustness Comparison of Vision Transformer and MLP-Mixer to CNNs [pdf] [code]
Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation [pdf] [code]
Are We Ready for a New Paradigm Shift? A Survey on Visual Deep MLP [pdf]
MetaFormer is Actually What You Need for Vision [pdf] [code]
An Image Patch is a Wave: Phase-Aware Vision MLP [pdf]
MorphMLP: A Self-Attention Free, MLP-Like Backbone for Image and Video [pdf]
SWAT: Spatial Structure Within and Among Tokens [pdf]
MLP Architectures for Vision-and-Language Modeling: An Empirical Study [pdf] [code]
RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality [pdf] [code]

You might also like...

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems This is the implementation of the paper: Learning Knowledge Bases with Par

42 Nov 10, 2022

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

Implemented fully documented Particle Swarm Optimization (PSO) algorithm in Python which includes a basic model along with few advanced features such as updating inertia weight, cognitive, social learning coefficients and maximum velocity of the particle.

9 Nov 29, 2022

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

111 Dec 27, 2022

Another pytorch implementation of FCN (Fully Convolutional Networks)

FCN-pytorch-easiest Trying to be the easiest FCN pytorch implementation and just in a get and use fashion Here I use a handbag semantic segmentation f

158 Dec 21, 2022

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

FPGA & FreeNet Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification by Zhuo Zheng, Yanfei Zhong, Ailong M

92 Jan 3, 2023

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

39 Aug 2, 2021

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition (PyTorch) Paper: https://arxiv.org/abs/2105.01883 Citation: @

260 Jan 3, 2023

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

About This repository provides data and code for the paper: Scalable Data Annotation Pipeline for High-Quality Large Speech Datasets Development (subm

86 Dec 7, 2022

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

A PyTorch implementation of V-Net Vnet is a PyTorch implementation of the paper V-Net: Fully Convolutional Neural Networks for Volumetric Medical Imag

606 Dec 21, 2022

Comments

FNet: Mixing Tokens with Fourier Transforms

FNet: Mixing Tokens with Fourier Transforms https://arxiv.org/abs/2105.03824 Does FNet belong? After all, DCT can be implemented (and is on TPUs) in one matrix-multiplication accross patch-dimension.

opened by etienne87 1

Transformers based fully on MLPs

Related tags

Overview

Awesome MLP-based Transformers papers

Why this repo?

Contributing

Papers

You might also like...

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Another pytorch implementation of FCN (Fully Convolutional Networks)

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Comments

FNet: Mixing Tokens with Fourier Transforms

Owner

Fawaz Sammani

PyTorch implementation of Pay Attention to MLPs

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Code for How To Create A Fully Automated AI Based Trading System With Python

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

Build fully-functioning computer vision models with PyTorch

End-to-End Object Detection with Fully Convolutional Network

ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

Transformers based fully on MLPs

Related tags

Overview

Awesome MLP-based Transformers papers

Why this repo?

Contributing

Papers

You might also like...

KE-Dialogue: Injecting knowledge graph into a fully end-to-end dialogue system.

Implemented fully documented Particle Swarm Optimization algorithm (basic model with few advanced features) using Python programming language

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network.

Another pytorch implementation of FCN (Fully Convolutional Networks)

FPGA: Fast Patch-Free Global Learning Framework for Fully End-to-End Hyperspectral Image Classification

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Comments

FNet: Mixing Tokens with Fourier Transforms

Owner

Fawaz Sammani

PyTorch implementation of Pay Attention to MLPs

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Code for How To Create A Fully Automated AI Based Trading System With Python

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

ViDT: An Efficient and Effective Fully Transformer-based Object Detector

Build fully-functioning computer vision models with PyTorch

End-to-End Object Detection with Fully Convolutional Network

ICLR21 Tent: Fully Test-Time Adaptation by Entropy Minimization

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .