Implementation of FitVid video prediction model in JAX/Flax.

Google Research

Last update: Nov 25, 2022

Related tags

Deep Learning fitvid

Overview

FitVid Video Prediction Model

Implementation of FitVid video prediction model in JAX/Flax.

If you find this code useful, please cite it in your paper:

@article{babaeizadeh2021fitvid,
  title={FitVid: Overfitting in Pixel-Level Video Prediction},
  author= {Babaeizadeh, Mohammad and Saffar, Mohammad Taghi and Nair, Suraj 
  and Levine, Sergey and Finn, Chelsea and Erhan, Dumitru},
  journal={arXiv preprint arXiv:2106.13195},
  year={2020}
}

Method

FitVid is a new architecture for conditional variational video prediction. It has ~300 million parameters and can be trained with minimal training tricks.

Sample Videos

Human3.6M	RoboNet

For more samples please visit FitVid. website: https://sites.google.com/view/fitvidpaper

Instructions

Get dependencies:

pip3 install --user tensorflow
pip3 install --user tensorflow_addons
pip3 install --user flax
pip3 install --user ffmpeg

Train on RoboNet:

python -m fitvid.train  --output_dir /tmp/output

Disclaimer: Not an official Google product.

CLOOB training (JAX) and inference (JAX and PyTorch)

cloob-training Pretrained models There are two pretrained CLOOB models in this repo at the moment, a 16 epoch and a 32 epoch ViT-B/16 checkpoint train

64 Nov 27, 2022

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

RoBERTa base model for Marathi Language (मराठी भाषा) Pretrained model on Marathi language using a masked language modeling (MLM) objective. RoBERTa wa

23 Oct 19, 2022

Price-Prediction-For-a-Dream-Home - A machine learning based linear regression trained model for house price prediction.

Price-Prediction-For-a-Dream-Home ROADMAP TO THIS LINEAR REGRESSION BASED HOUSE PRICE PREDICTION PREDICTION MODEL Import all the dependencies of the p

1 Dec 29, 2021

A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.

BraVe This is a JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short. The model provided in this package wa

44 Nov 20, 2022

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Contrastive Learning of Object Representations Supervisor: Prof. Dr. Gemma Roig Institutions: Goethe University CVAI - Computational Vision & Artifici

6 Dec 8, 2022

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

RealTime Sign Language Detection using Action Recognition Approach Real-Time Sign Language is commonly predicted using models whose architecture consi

15 Aug 20, 2022

Model parallel transformers in Jax and Haiku

Mesh Transformer Jax A haiku library using the new(ly documented) xmap operator in Jax for model parallelism of transformers. See enwik8_example.py fo

4.8k Jan 1, 2023

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Optimal Model Design for Reinforcement Learning This repository contains JAX code for the paper Control-Oriented Model-Based Reinforcement Learning wi

43 Sep 28, 2022

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

1 Jan 23, 2022

Comments

what tensorflow version?

Hi, really interested in trying this out.

I see that the augment file uses tensorflow.contrib which I assume means tensorflow 1.x. However when I set that and then run python -m fitvid.train.py --help it says it requires tensorflow 2.2 or greater

opened by dvschultz 0

Owner

Google Research

GitHub

Implementation of FitVid video prediction model in JAX/Flax.

Related tags

Overview

FitVid Video Prediction Model

Method

Sample Videos

Instructions

You might also like...

CLOOB training (JAX) and inference (JAX and PyTorch)

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

Price-Prediction-For-a-Dream-Home - A machine learning based linear regression trained model for house price prediction.

A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

Model parallel transformers in Jax and Haiku

JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

Comments

what tensorflow version?

Owner

Google Research

Advantage Actor Critic (A2C): jax + flax implementation

Flax is a neural network ecosystem for JAX that is designed for flexibility.

Very deep VAEs in JAX/Flax

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

Standalone pre-training recipe with JAX+Flax

Local Attention - Flax module for Jax

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.

JAXDL: JAX (Flax) Deep Learning Library

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

GAN JAX - A toy project to generate images from GANs with JAX