Unofficial PyTorch implementation of TokenLearner by Google AI

Rishabh Anand

Last update: Dec 20, 2022

Related tags

Deep Learning tokenlearner-pytorch

Overview

tokenlearner-pytorch

Unofficial PyTorch implementation of TokenLearner by Ryoo et al. from Google AI (abs, pdf)

Installation

You can install TokenLearner via pip:

pip install tokenlearner-pytorch

Usage

You can access the TokenLearner class from the tokenlearner_pytorch package. You can use this layer with a Vision Transformer, MLPMixer, or Video Vision Transformer as done in the paper.

import torch
from tokenlearner_pytorch import TokenLearner

tklr = TokenLearner(S=8)
x = torch.rand(512, 32, 32, 3)
y = tklr(x) # [512, 8, 3]

You can also use TokenLearner and TokenFuser together with Multi-head Self-Attention as done in the paper:

import torch
import torch.nn as nn
from tokenlearner_pytorch import TokenLearner, TokenFuser

mhsa = nn.MultiheadAttention(3, 1)
tklr = TokenLearner(S=8)
tkfr = TokenFuser(H=32, W=32, C=3, S=8)

x = torch.rand(512, 32, 32, 3) # a batch of images

y = tklr(x)
y = y.view(8, 512, 3)
y, _ = mhsa(y, y, y) # ignore attn weights
y = y.view(512, 8, 3)

out = tkfr(y, x) # [512, 32, 23, 3]

TODO

Add support for temporal dimension T
Implement TokenFuser with ViT
Implement TokenFuser with ViViT

Contributions

If I've made any errors or you have any suggestions, feel free to raise an Issue or PR. All contributions welcome!!

License

MIT

You might also like...

Unofficial Pytorch Implementation of WaveGrad2

WaveGrad 2 — Unofficial PyTorch Implementation WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis Unofficial PyTorch+Lightning Implementati

104 Nov 29, 2022

The author's officially unofficial PyTorch BigGAN implementation.

BigGAN-PyTorch The author's officially unofficial PyTorch BigGAN implementation. This repo contains code for 4-8 GPU training of BigGANs from Large Sc

2.6k Jan 2, 2023

Unofficial PyTorch Implementation of UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

UnivNet UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation This is an unofficial PyTorch

170 Jan 4, 2023

StarGAN-ZSVC: Unofficial PyTorch Implementation

This repository is an unofficial PyTorch implementation of StarGAN-ZSVC by Matthew Baas and Herman Kamper. This repository provides both model architectures and the code to inference or train them.

11 Aug 28, 2022

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

An Image Captioning codebase This is a codebase for image captioning research. It supports: Self critical training from Self-critical Sequence Trainin

906 Jan 3, 2023

Unofficial PyTorch Implementation of UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

UnivNet UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation This is an unofficial PyTorch

54 Aug 30, 2021

Unofficial PyTorch implementation of Fastformer based on paper

Comments

Implementation details of TokenFuser

In the paper, it said:

where X^j_{t} is the residual input to the previous TokenLearner module

So the fuser output = BY + X^j

But in the code, the fuser output = BY + SpatialAttention(X^j) https://github.com/rish-16/tokenlearner-pytorch/blob/a6908107c5b53b837127806fc1d46c64694bffc5/tokenlearner_pytorch/tokenlearner_pytorch.py#L59-L62

Why does the residual structure add to SpatialAttention(X^j) instead of X^j?

opened by leijue222 1
Error when using TokenLearner

An error will be reported during the SpatialAttention conv：

RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same

opened by leijue222 1
y = y.view(8, 512, 3) is wrong

y = y.view(8, 512, 3) y, _ = mhsa(y, y, y) # ignore attn weights y = y.view(512, 8, 3)

I think it's wrong.

It should be corrected as y = y.transpose(0, 1)

opened by xiamenwcy 1

Unofficial PyTorch implementation of TokenLearner by Google AI

Related tags

Overview

tokenlearner-pytorch

Installation

Usage

TODO

Contributions

License

You might also like...

Unofficial Pytorch Implementation of WaveGrad2

The author's officially unofficial PyTorch BigGAN implementation.

Unofficial PyTorch Implementation of UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

StarGAN-ZSVC: Unofficial PyTorch Implementation

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Unofficial PyTorch Implementation of UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation

Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."

Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing"

Unofficial Pytorch Lightning implementation of Contrastive Syn-to-Real Generalization (ICLR, 2021)

Comments

Implementation details of TokenFuser

Error when using TokenLearner

y = y.view(8, 512, 3) is wrong

Owner

Rishabh Anand

Unofficial PyTorch implementation of SimCLR by Google Brain

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

Red Team tool for exfiltrating files from a target's Google Drive that you have access to, via Google's API.

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

Google-drive-to-sqlite - Create a SQLite database containing metadata from Google Drive

This is an unofficial PyTorch implementation of Meta Pseudo Labels

An unofficial PyTorch implementation of a federated learning algorithm, FedAvg.

Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.

Unofficial PyTorch implementation of Neural Additive Models (NAM) by Agarwal, et al.

Unofficial implementation of Alias-Free Generative Adversarial Networks. (https://arxiv.org/abs/2106.12423) in PyTorch