Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Sergei Belousov

Last update: Jul 19, 2022

Related tags

Deep Learning vqvae_dwt_distiller.pytorch

Overview

vqvae_dwt_distiller.pytorch

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline. It allows to generate 512x512 images using ruDALL-E.

POC checkpoint: https://drive.google.com/file/d/1GjGXs1l0mOiFxKJwutjTQyCHEaF-wrIL/view?usp=sharing

You might also like...

FairMOT - A simple baseline for one-shot multi-object tracking

3.6k Jan 8, 2023

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline Ankit Goyal, Hei Law, Bowei Liu, Alejandro Newell, Jia Deng Internati

115 Jan 4, 2023

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

This repository contains code for the following two papers: VisualBERT: A Simple and Performant Baseline for Vision and Language (arxiv) with a short

463 Dec 9, 2022

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

7 Aug 28, 2022

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

45 Dec 12, 2022

PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".

A Simple Baseline for Low-Budget Active Learning This repository is the implementation of A Simple Baseline for Low-Budget Active Learning. In this pa

10 Nov 14, 2022

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

Minimal Body A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image. The model file is only 51.2 MB and runs a

49 Dec 5, 2022

A Simple Long-Tailed Rocognition Baseline via Vision-Language Model

BALLAD This is the official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model. Requirements Python3 Pytorch(1.7.

4 Jan 20, 2022

This is the official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

BALLAD This is the official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model. Requirements Python3 Pytorch(1.7.

11 Dec 1, 2021

Comments

How to use this repository?

I'm very interested in using this repository to get higher resolution images out of ruDALL-E. I already have ruDALL-E set up. Can you provide any instructions?

opened by njbbaer 4

Owner

Sergei Belousov

GitHub

Image-generation-baseline - MUGE Text To Image Generation Baseline

MUGE Text To Image Generation Baseline Requirements and Installation More detail

23 Oct 17, 2022

Jingju baseline - A baseline model of our project of Beijing opera script generation

Jingju Baseline It is a baseline of our project about Beijing opera script gener

1 Jan 14, 2022

LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

Deep-Leafsnap Convolutional Neural Networks have become largely popular in image tasks such as image classification recently largely due to to Krizhev

48 Nov 27, 2022

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

2 Dec 17, 2021

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

SVM Données Une base d’images contient 490 images pour l’apprentissage (400 voitures et 90 bateaux), et encore 21 images pour fait des tests. Prétrait

3 Nov 30, 2021

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

Posture: Pose Tracking and Machine Learning for prescribing corrective suggestions to improve posture and form while exercising. This repository conta

10 Nov 11, 2022

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

CLIP-ViL In our paper "How Much Can CLIP Benefit Vision-and-Language Tasks?", we show the improvement of CLIP features over the traditional resnet fea

310 Dec 28, 2022

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

BasicRL: easy and fundamental codes for deep reinforcement learning BasicRL is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up. It is

12 Apr 28, 2022

🔥 TensorFlow Code for technical report: "YOLOv3: An Incremental Improvement"

?? Are you looking for a new YOLOv3 implemented by TF2.0 ? If you hate the fucking tensorflow1.x very much, no worries! I have implemented a new YOLOv

3.6k Dec 26, 2022

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps Here is the code for ssbassline model. We also provide OCR results/features/mode

51 Nov 18, 2022

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Related tags

Overview

vqvae_dwt_distiller.pytorch

You might also like...

FairMOT - A simple baseline for one-shot multi-object tracking

Official Code for ICML 2021 paper "Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline"

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".

A very simple baseline to estimate 2D & 3D SMPL-compatible keypoints from a single color image.

A Simple Long-Tailed Rocognition Baseline via Vision-Language Model

This is the official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

Comments

How to use this repository?

Owner

Sergei Belousov

Image-generation-baseline - MUGE Text To Image Generation Baseline

Jingju baseline - A baseline model of our project of Beijing opera script generation

LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

An image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testingAn image base contains 490 images for learning (400 cars and 90 boats), and another 21 images for testing

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

🔥 TensorFlow Code for technical report: "YOLOv3: An Incremental Improvement"

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]