A pytorch &keras implementation and demo of Fastformer.

Last update: Dec 28, 2022

Related tags

Deep Learning Fastformer

Overview

Fastformer

Notes from the authors

Pytorch/Keras implementation of Fastformer. The keras version only includes the core fastformer attention part. The pytorch version is written in a huggingface transformers style. The jupyter notebooks contain the quickstart codes for text classification on AG's News (without pretrained word embeddings for simplicity), which can be directly run. We noticed that in our experiments, NOT all tasks need FFNN, residual connection, layer normalization and even position embedding. For example, we find that in news recommendation, it is better to directly use Fastformer without layer normalization and position embedding. However, in Ad CVR prediction, both position embedding and layer normalization are needed.

Keras version: 2.2.4 (may not be compatible with higher versions)

TF version: from 1.12 to 1.15 (may be compatible with lower versions)

Pytorch version: 1.6.0 (may be compatible with higher/lower versions)

Citation

@article{wu2021fastformer,
  title={Fastformer: Additive Attention Can Be All You Need},
  author={Wu, Chuhan and Wu, Fangzhao and Qi, Tao and Huang, Yongfeng},
  journal={arXiv preprint arXiv:2108.09084},
  year={2021}
}

Comments

Reference to 'datasets' module

The first line of Fastformer.ipynb is from datasets import load_dataset. But there is no datasets.py in the repository. Can you please share where I can find the datasets module/ script?

opened by Sheshansh 2
Amazon dataset

Hello,

First of all, thank you very much for your work. I'm currently trying to reproduce your paper and I didn't find the proper Amazon dataset that you used in the paper. All of them are above 1M rows but you said to have used 41K rows for your train, val and test set. Did you sample 41K randomly from the original dataset or did you do something else ?

Thank you very much. Mathias

opened by MathVast 0
load_dataset('ag_news') ConnectionError

There's a error when i try to load the ag_news dataset. It said that Couldn't reach https://raw.githubusercontent.com/huggingface/datasets/1.12.1/datasets/ag_news/ag_news.py. But I can open the url in my chrome. If you load the dataset successfully, could u tell me how to solve the problem? ConnectionError: Couldn't reach https://raw.githubusercontent.com/huggingface/datasets/1.12.1/datasets/ag_news/ag_news.py

opened by z972778371 2

Implementation of gaze tracking and demo

Predicting Customer Demand by Using Gaze Detecting and Object Tracking This project is the integration of gaze detecting and object tracking. Predict

2 Oct 20, 2022

Visualization toolkit for neural networks in PyTorch! Demo --

FlashTorch A Python visualization toolkit, built with PyTorch, for neural networks in PyTorch. Neural networks are often described as "black box". The

692 Dec 29, 2022

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

MMdnn MMdnn is a comprehensive and cross-framework tool to convert, visualize and diagnose deep learning (DL) models. The "MM" stands for model manage

5.7k Jan 9, 2023

Hyperparameter Optimization for TensorFlow, Keras and PyTorch

Hyperparameter Optimization for Keras Talos • Key Features • Examples • Install • Support • Docs • Issues • License • Download Talos radically changes

1.6k Dec 15, 2022

PyGAD, a Python 3 library for building the genetic algorithm and training machine learning algorithms (Keras & PyTorch).

PyGAD: Genetic Algorithm in Python PyGAD is an open-source easy-to-use Python 3 library for building the genetic algorithm and optimizing machine lear

1.1k Dec 26, 2022

The tool under this branch fork can be used to crack devices above A12 and up to A15. After cracking, you can also use SSH channel strong opening tool to open SSH channel and activate it with Demo or Shell script. The file can be extracted from my Github homepage, and the SSH channel opening tool can be extracted from Dr238 account.

Welcome to C0xy-A12-A15-Attack-Tool The tool under this branch fork can be used to crack devices above A12 and up to A15. After cracking, you can also

13 Dec 23, 2022

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

63 Sep 21, 2022

Implementation of ConvMixer in TensorFlow and Keras

ConvMixer ConvMixer, an extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on

8 Oct 3, 2022

A pytorch &keras implementation and demo of Fastformer.

Related tags

Overview

Fastformer

Notes from the authors

Citation

You might also like...

Implementation of gaze tracking and demo

Implementation of gaze tracking and demo

Visualization toolkit for neural networks in PyTorch! Demo --

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Hyperparameter Optimization for TensorFlow, Keras and PyTorch

PyGAD, a Python 3 library for building the genetic algorithm and training machine learning algorithms (Keras & PyTorch).

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

Implementation of ConvMixer in TensorFlow and Keras

Comments

Reference to 'datasets' module

Amazon dataset

load_dataset('ag_news') ConnectionError

Owner

An implementation of Fastformer: Additive Attention Can Be All You Need in TensorFlow

ONNX Runtime Web demo is an interactive demo portal showing real use cases running ONNX Runtime Web in VueJS.

This is an implementation of Googles Yogi-Optimizer in Keras (tf.keras)

Keras udrl - Keras implementation of Upside Down Reinforcement Learning

Classification models 1D Zoo - Keras and TF.Keras

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

An implementation demo of the ICLR 2021 paper Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks in PyTorch.

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)