517 Python Transformers-arithmetic Libraries

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

DALL-E in Pytorch Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch. It will also contain CLIP for ranking the ge

5k Jan 4, 2023

DeiT: Data-efficient Image Transformers

DeiT: Data-efficient Image Transformers This repository contains PyTorch evaluation code, training code and pretrained models for DeiT (Data-Efficient

3.2k Jan 6, 2023

Implementation of Bottleneck Transformer in Pytorch

Bottleneck Transformer - Pytorch Implementation of Bottleneck Transformer, SotA visual recognition model with convolution + attention that outperforms

621 Jan 6, 2023

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

GPT-NeoX An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hun

3.1k Jan 8, 2023

Open-AI's DALL-E for large scale training in mesh-tensorflow.

DALL-E in Mesh-Tensorflow [WIP] Open-AI's DALL-E in Mesh-Tensorflow. If this is similarly efficient to GPT-Neo, this repo should be able to train mode

432 Dec 16, 2022

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Deep Daze mist over green hills shattered plates on the grass cosmic love and attention a time traveler in the crowd life during the plague meditative

4.4k Jan 3, 2023

Graph Transformer Architecture. Source code for

Graph Transformer Architecture Source code for the paper "A Generalization of Transformer Networks to Graphs" by Vijay Prakash Dwivedi and Xavier Bres

561 Jan 8, 2023

Big Bird: Transformers for Longer Sequences

BigBird, is a sparse-attention based transformer which extends Transformer based models, such as BERT to much longer sequences. Moreover, BigBird comes along with a theoretical understanding of the capabilities of a complete transformer that the sparse model can handle.

457 Dec 23, 2022

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Segmentation Transformer Implementation of Segmentation Transformer in PyTorch, a new model to achieve SOTA in semantic segmentation while using trans

161 Dec 8, 2022

Bottleneck Transformers for Visual Recognition

Bottleneck Transformers for Visual Recognition Experiments Model Params (M) Acc (%) ResNet50 baseline (ref) 23.5M 93.62 BoTNet-50 18.8M 95.11% BoTNet-

236 Jan 3, 2023

Explainability for Vision Transformers (in PyTorch)

Explainability for Vision Transformers (in PyTorch) This repository implements methods for explainability in Vision Transformers

442 Jan 4, 2023

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

SE3 Transformer - Pytorch Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch. May be needed for replicating Alphafold2 resu

207 Dec 23, 2022

Pytorch Implementation of Various Point Transformers

Pytorch Implementation of Various Point Transformers Recently, various methods applied transformers to point clouds: PCT: Point Cloud Transformer (Men

434 Dec 30, 2022

KoBART model on huggingface transformers

KoBART-Transformers SKT에서 공개한 KoBART를 편리하게 사용할 수 있게 transformers로 포팅하였습니다. Install (Optional) BartModel과 PreTrainedTokenizerFast를 이용하면 설치하실 필요 없습니다. p

58 Dec 7, 2022

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

NERDA Not only is NERDA a mesmerizing muppet-like character. NERDA is also a python package, that offers a slick easy-to-use interface for fine-tuning

141 Dec 30, 2022

SAINT PyTorch implementation

SAINT-pytorch A Simple pyTorch implementation of "Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing" based on https://arx

63 Dec 25, 2022

Transformers are Graph Neural Networks!

🚀 Gated Graph Transformers Gated Graph Transformers for graph-level property prediction, i.e. graph classification and regression. Associated article

46 Jun 30, 2022

Python Transformers-arithmetic Resources

Python transformers-arithmetic Libraries

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

DeiT: Data-efficient Image Transformers

Implementation of Bottleneck Transformer in Pytorch

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Graph Transformer Architecture. Source code for

Big Bird: Transformers for Longer Sequences

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Bottleneck Transformers for Visual Recognition

Explainability for Vision Transformers (in PyTorch)

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Pytorch Implementation of Various Point Transformers

KoBART model on huggingface transformers

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

SAINT PyTorch implementation

Transformers are Graph Neural Networks!

Python Transformers-arithmetic Resources

Python transformers-arithmetic Libraries

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

DeiT: Data-efficient Image Transformers

Implementation of Bottleneck Transformer in Pytorch

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Open-AI's DALL-E for large scale training in mesh-tensorflow.

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Graph Transformer Architecture. Source code for

Big Bird: Transformers for Longer Sequences

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Bottleneck Transformers for Visual Recognition

Explainability for Vision Transformers (in PyTorch)

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

Pytorch Implementation of Various Point Transformers

KoBART model on huggingface transformers

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

SAINT PyTorch implementation

Transformers are Graph Neural Networks!

Related tags