A GPT, made only of MLPs, in Jax

Phil Wang

Last update: Sep 27, 2022

Related tags

Overview

MLP GPT - Jax (wip)

A GPT, made only of MLPs, in Jax. The specific MLP to be used are gMLPs with the Spatial Gating Units.

Working Pytorch implementation

Install

$ pip install mlp-gpt-jax

Usage

from jax import random, numpy as np
from mlp_gpt_jax import MLPGpt

gpt = MLPGpt(
    num_tokens = 20000,
    dim = 512,
    depth = 6,
    seq_len = 512
)

key    = random.PRNGKey(0)
seq    = random.randint(key, (512,), 0, 20000)

params = gpt.init(key, seq)
logits = gpt.apply(params, seq) # (512, 20000)

Citations

@misc{liu2021pay,
    title   = {Pay Attention to MLPs}, 
    author  = {Hanxiao Liu and Zihang Dai and David R. So and Quoc V. Le},
    year    = {2021},
    eprint  = {2105.08050},
    archivePrefix = {arXiv},
    primaryClass = {cs.LG}
}

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

Storium GPT-2 Models This is the official repository for the GPT-2 models described in the EMNLP 2020 paper [STORIUM: A Dataset and Evaluation Platfor

27 Dec 20, 2022

Training data extraction on GPT-2

Training data extraction from GPT-2 This repository contains code for extracting training data from GPT-2, following the approach outlined in the foll

62 Dec 7, 2022

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

GPT2-Pytorch with Text-Generator Better Language Models and Their Implications Our model, called GPT-2 (a successor to GPT), was trained simply to pre

775 Jan 8, 2023

ChatBot-Pytorch - A GPT-2 ChatBot implemented using Pytorch and Huggingface-transformers

ChatBot-Pytorch A GPT-2 ChatBot implemented using Pytorch and Huggingface-transf

42 Dec 9, 2022

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

AI-Bot 一个基于watermelon改造的OpenAI-GPT-2的智能机器人在Binder上直接运行测试目前有两种实现方式 TF2的GPT-2 TF

9 Nov 16, 2022

Building Ellee — A GPT-3 and Computer Vision Powered Talking Robotic Teddy Bear With Human Level Conversation Intelligence

Using an object detection and facial recognition system built on MobileNetSSDV2 and Dlib and running on an NVIDIA Jetson Nano, a GPT-3 model, Google Speech Recognition, Amazon Polly and servo motors, I built Ellee - a robotic teddy bear who can move her head and converse naturally.

24 Oct 26, 2022

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning Authors repo (alphabetical) Constantin (CoEich), Mayukh (Mayukh

331 Jan 3, 2023

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

FedJAX: Federated learning with JAX What is FedJAX? FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX. FedJAX priori

208 Dec 14, 2022

Flax is a neural network ecosystem for JAX that is designed for flexibility.

Flax: A neural network library and ecosystem for JAX designed for flexibility Overview | Quick install | What does Flax look like? | Documentation See

3.9k Jan 2, 2023

Comments

mistake in parameter initialization

floor division will always return 0 :(

https://github.com/lucidrains/mlp-gpt-jax/blob/c8a6d7738562e44d3c0b3018c83ae577f7931e78/mlp_gpt_jax/mlp_gpt_jax.py#L75

opened by guyd1995 1

Releases(0.0.19)

0.0.19(Jun 23, 2021)

Source code(tar.gz)
Source code(zip)
0.0.18(Jun 22, 2021)

Source code(tar.gz)
Source code(zip)
0.0.17(Jun 22, 2021)

Source code(tar.gz)
Source code(zip)
0.0.16(Jun 3, 2021)

Source code(tar.gz)
Source code(zip)
0.0.15(Jun 3, 2021)

Source code(tar.gz)
Source code(zip)
0.0.14(Jun 2, 2021)

Source code(tar.gz)
Source code(zip)
0.0.12(Jun 2, 2021)

Source code(tar.gz)
Source code(zip)
0.0.11(Jun 2, 2021)

Source code(tar.gz)
Source code(zip)
0.0.10(Jun 2, 2021)

Source code(tar.gz)
Source code(zip)
0.0.9(Jun 2, 2021)

Source code(tar.gz)
Source code(zip)
0.0.8(May 29, 2021)

Source code(tar.gz)
Source code(zip)
0.0.7(May 27, 2021)

Source code(tar.gz)
Source code(zip)
0.0.6(May 26, 2021)

Source code(tar.gz)
Source code(zip)
0.0.5(May 25, 2021)

Source code(tar.gz)
Source code(zip)
0.0.4(May 23, 2021)

Source code(tar.gz)
Source code(zip)
0.0.3(May 22, 2021)

Source code(tar.gz)
Source code(zip)
0.0.2(May 21, 2021)

Source code(tar.gz)
Source code(zip)
0.0.1(May 21, 2021)

Source code(tar.gz)
Source code(zip)

Owner

Phil Wang

Working with Attention

GitHub

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model -- based on GPT-3, called GPT-Codex -- that is fine-tuned on publicly available code from GitHub.

2.3k Jan 9, 2023

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs Check out the paper on arXiv: https://arxiv.org/abs/2103.13744 This repo cont

373 Dec 20, 2022

PyTorch implementation of Pay Attention to MLPs

gMLP PyTorch implementation of Pay Attention to MLPs. Quickstart Clone this repository. git clone https://github.com/jaketae/g-mlp.git Navigate to th

34 Dec 13, 2022

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

Convolutional MLP ConvMLP: Hierarchical Convolutional MLPs for Vision Preprint link: ConvMLP: Hierarchical Convolutional MLPs for Vision By Jiachen Li

143 Jan 3, 2023

GAN JAX - A toy project to generate images from GANs with JAX

GAN JAX - A toy project to generate images from GANs with JAX This project aims to bring the power of JAX, a Python framework developped by Google and

14 Nov 29, 2022

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

mini-hmc-jax This is a simple implementation of Hamiltonian Monte Carlo in JAX t

6 Mar 3, 2022

CLOOB training (JAX) and inference (JAX and PyTorch)

cloob-training Pretrained models There are two pretrained CLOOB models in this repo at the moment, a 16 epoch and a 32 epoch ViT-B/16 checkpoint train

64 Nov 27, 2022

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

GPT-Neo-2.7B Fine-Tuning Example Using HuggingFace & DeepSpeed Installation cd venv/bin ./pip install -r ../../requirements.txt ./pip install deepspe

180 Jan 5, 2023

Few-shot Learning of GPT-3

Few-shot Learning With Language Models This is a codebase to perform few-shot "in-context" learning using language models similar to the GPT-3 paper.

224 Dec 28, 2022

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

P-tuning A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''. How to use our code We have released the code

562 Dec 27, 2022

A GPT, made only of MLPs, in Jax

Related tags

Overview

MLP GPT - Jax (wip)

Install

Usage

Citations

You might also like...

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

Training data extraction on GPT-2

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

ChatBot-Pytorch - A GPT-2 ChatBot implemented using Pytorch and Huggingface-transformers

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

Building Ellee — A GPT-3 and Computer Vision Powered Talking Robotic Teddy Bear With Human Level Conversation Intelligence

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

Flax is a neural network ecosystem for JAX that is designed for flexibility.

Comments

mistake in parameter initialization

Releases(0.0.19)

0.0.19(Jun 23, 2021)

0.0.18(Jun 22, 2021)

0.0.17(Jun 22, 2021)

0.0.16(Jun 3, 2021)

0.0.15(Jun 3, 2021)

0.0.14(Jun 2, 2021)

0.0.12(Jun 2, 2021)

0.0.11(Jun 2, 2021)

0.0.10(Jun 2, 2021)

0.0.9(Jun 2, 2021)

0.0.8(May 29, 2021)

0.0.7(May 27, 2021)

0.0.6(May 26, 2021)

0.0.5(May 25, 2021)

0.0.4(May 23, 2021)

0.0.3(May 22, 2021)

0.0.2(May 21, 2021)

0.0.1(May 21, 2021)

Owner

Phil Wang

GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

PyTorch implementation of Pay Attention to MLPs

[Preprint] ConvMLP: Hierarchical Convolutional MLPs for Vision, 2021

GAN JAX - A toy project to generate images from GANs with JAX

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

CLOOB training (JAX) and inference (JAX and PyTorch)

Fine-Tune EleutherAI GPT-Neo to Generate Netflix Movie Descriptions in Only 47 Lines of Code Using Hugginface And DeepSpeed

Few-shot Learning of GPT-3

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.