A PyTorch library for Vision Transformers

Society for Artificial Intelligence and Deep Learning

Last update: Nov 28, 2022

Related tags

Deep Learning vformer

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Read the contributing guidelines in CONTRIBUTING.rst to learn how to start contributing.

Comments

Add attention visualization methods
This article details different ways of visualizing a transformer's attention. It also talks about how such visualizations can aid in explainability of the models.

They also provide their code here.

We would like to have such visualization methods in the viz module.

good first issue
opened by NeelayS 7
Remove _Projection class

We can replace _Projection class with a one-liner if-else statement.

Should we replace it with if-else or should we keep the current implementation?

cc: @NeelayS @aditya-agrawal-30502 @alvanli

opened by abhi-glitchhg 6
Enhanced docstring

During the last PR (#45), I had to revert back because of compatibility issues

In this PR I have added some docstrings and Minor changes like changing variable names

this PR is the same as - #48 with edited title :)

@NeelayS

opened by abhi-glitchhg 3
Restructuring AbsolutePositionEmbedding class

AbsolutePositionEmbedding class was structured specifically for the PVT, but we can use it in other models too if we re-structure it properly, it should also support sinusoidal position embedding or a separate class for Sinusoidal embedding also works.
enhancement

opened by abhi-glitchhg 2
Add sharpness-aware optimizer

This paper describes how promoting smoothness with a recently proposed sharpness-aware optimizer substantially improves the performance of ViTs.

It would be good to have an implementation of this optimizer in our library. It would fit in the functional module.

A couple of PyTorch implementations are here and here.

opened by NeelayS 2
Documentation related to visualization methods

I have added some fixes for page breaks in #86.

Still, we need to enhance the docs for visualization methods.
We can include the license/copyright disclaimer for visualization methods in our license or have a separate file.

Additionally, we can add the sample outputs from these methods into the doc.

CC : @NeelayS @aditya-agrawal-30502 @alvanli
documentation enhancement good first issue

opened by abhi-glitchhg 1
[Paper] Visual Attention Network

paper - https://arxiv.org/abs/2202.09741 code- https://github.com/Visual-Attention-Network/VAN-Classification https://github.com/Visual-Attention-Network/VAN-Segmentation
Paper implementation

opened by abhi-glitchhg 0

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.2(Apr 7, 2022)

Source code(tar.gz)
Source code(zip)
v0.1.0(Feb 9, 2022)

First release of VFormer!
Source code(tar.gz)
Source code(zip)

Owner

Society for Artificial Intelligence and Deep Learning

GitHub

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

CvT: Introducing Convolutions to Vision Transformers Pytorch implementation of CvT: Introducing Convolutions to Vision Transformers Usage: img = torch

193 Jan 3, 2023

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Self-Supervised Vision Transformers with DINO PyTorch implementation and pretrained models for DINO. For details, see Emerging Properties in Self-Supe

4.2k Jan 3, 2023

This repository contains PyTorch code for Robust Vision Transformers.

117 Dec 7, 2022

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

Less is More: Pay Less Attention in Vision Transformers Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers. By

73 Jan 1, 2023

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

Out-of-distribution Generalization Investigation on Vision Transformers This repository contains PyTorch evaluation code for Delving Deep into the Gen

72 Dec 13, 2022

A PyTorch implementation of ViTGAN based on paper ViTGAN: Training GANs with Vision Transformers.

ViTGAN: Training GANs with Vision Transformers A PyTorch implementation of ViTGAN based on paper ViTGAN: Training GANs with Vision Transformers. Refer

127 Dec 23, 2022

Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM

Class Activation Map methods implemented in Pytorch pip install grad-cam ⭐ Tested on many Common CNN Networks and Vision Transformers. ⭐ Includes smoo

6.6k Jan 6, 2023

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

12.6k Jan 9, 2023

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Spacetimeformer Multivariate Forecasting This repository contains the code for the paper, "Long-Range Transformers for Dynamic Spatiotemporal Forecast

440 Jan 2, 2023

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

Chasing Sparsity in Vision Transformers: An End-to-End Exploration Codes for [Preprint] Chasing Sparsity in Vision Transformers: An End-to-End Explora

64 Dec 8, 2022

A PyTorch library for Vision Transformers

Related tags

Overview

VFormer

A PyTorch library for Vision Transformers

Getting Started

Comments

Add attention visualization methods

Remove _Projection class

Enhanced docstring

Restructuring AbsolutePositionEmbedding class

Add sharpness-aware optimizer

Documentation related to visualization methods

[Paper] Visual Attention Network

Releases(v0.1.3)

v0.1.3(Jul 3, 2022)

v0.1.2(Apr 7, 2022)

v0.1.0(Feb 9, 2022)

Owner

Society for Artificial Intelligence and Deep Learning

PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

This repository contains PyTorch code for Robust Vision Transformers.

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

A PyTorch implementation of ViTGAN based on paper ViTGAN: Training GANs with Vision Transformers.

Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Implementation of various Vision Transformers I found interesting

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

Exploring whether attention is necessary for vision transformers

Contains code for the paper "Vision Transformers are Robust Learners".

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Official repository for "Intriguing Properties of Vision Transformers" (2021)

DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Official repository for "On Improving Adversarial Transferability of Vision Transformers" (2021)

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang