PyTorch reimplementation of Diffusion Models

Patrick Esser

Last update: Jan 1, 2023

Related tags

Deep Learning pytorch_diffusion

Overview

PyTorch pretrained Diffusion Models

A PyTorch reimplementation of Denoising Diffusion Probabilistic Models with checkpoints converted from the author's TensorFlow implementation.

Quickstart

Running

pip install -e git+https://github.com/pesser/pytorch_diffusion.git#egg=pytorch_diffusion
pytorch_diffusion_demo

will start a Streamlit demo. It is recommended to run the demo with a GPU available.

Usage

Diffusion models with pretrained weights for cifar10, lsun-bedroom, lsun_cat or lsun_church can be loaded as follows:

from pytorch_diffusion import Diffusion

diffusion = Diffusion.from_pretrained("lsun_church")
samples = diffusion.denoise(4)
diffusion.save(samples, "lsun_church_sample_{:02}.png")

Prefix the name with ema_ to load the averaged weights that produce better results. The U-Net model used for denoising is available via diffusion.model and can also be instantiated on its own:

from pytorch_diffusion import Model

model = Model(resolution=32,
              in_channels=3,
              out_ch=3,
              ch=128,
              ch_mult=(1,2,2,2),
              num_res_blocks=2,
              attn_resolutions=(16,),
              dropout=0.1)

This configuration example corresponds to the model used on CIFAR-10.

Producing samples

If you installed directly from github, you can find the cloned repository in <venv path>/src/pytorch_diffusion for virtual environments, and <cwd>/src/pytorch_diffusion for global installs. There, you can run

python pytorch_diffusion/diffusion.py <name> <bs> <nb>

where <name> is one of cifar10, lsun-bedroom, lsun_cat, lsun_church, or one of these names prefixed with ema_, <bs> is the batch size and <nb> the number of batches. This will produce samples from the PyTorch models and save them to results/<name>/.

Results

Evaluating 50k samples with torch-fidelity gives

Dataset	EMA	Framework	Model	FID
CIFAR10 Train	no	PyTorch	`cifar10`	12.13775
		TensorFlow	`tf_cifar10`	12.30003
	yes	PyTorch	`ema_cifar10`	3.21213
		TensorFlow	`tf_ema_cifar10`	3.245872
CIFAR10 Validation	no	PyTorch	`cifar10`	14.30163
		TensorFlow	`tf_cifar10`	14.44705
	yes	PyTorch	`ema_cifar10`	5.274105
		TensorFlow	`tf_ema_cifar10`	5.325035

To reproduce, generate 50k samples from the converted PyTorch models provided in this repo with

`python pytorch_diffusion/diffusion.py <Model> 500 100`

and with

python -c "import convert as m; m.sample_tf(500, 100, which=['cifar10', 'ema_cifar10'])"

for the original TensorFlow models.

Running conversions

The converted pytorch checkpoints are provided for download. If you want to convert them on your own, you can follow the steps described here.

Setup

This section assumes your working directory is the root of this repository. Download the pretrained TensorFlow checkpoints. It should follow the original structure,

diffusion_models_release/
  diffusion_cifar10_model/
    model.ckpt-790000.data-00000-of-00001
    model.ckpt-790000.index
    model.ckpt-790000.meta
  diffusion_lsun_bedroom_model/
    ...
  ...

Set the environment variable TFROOT to the directory where you want to store the author's repository, e.g.

export TFROOT=".."

Clone the diffusion repository,

git clone https://github.com/hojonathanho/diffusion.git ${TFROOT}/diffusion

and install their required dependencies (pip install ${TFROOT}/requirements.txt). Then add the following to your PYTHONPATH:

export PYTHONPATH=".:./scripts:${TFROOT}/diffusion:${TFROOT}/diffusion/scripts:${PYTHONPATH}"

Testing operations

To test the pytorch implementations of the required operations against their TensorFlow counterparts under random initialization and random inputs, run

python -c "import convert as m; m.test_ops()"

Converting checkpoints

To load the pretrained TensorFlow models, copy the weights into the pytorch models, check for equality on random inputs and finally save the corresponding pytorch checkpoints, run

python -c "import convert as m; m.transplant_cifar10()"
python -c "import convert as m; m.transplant_cifar10(ema=True)"
python -c "import convert as m; m.transplant_lsun_bedroom()"
python -c "import convert as m; m.transplant_lsun_bedroom(ema=True)"
python -c "import convert as m; m.transplant_lsun_cat()"
python -c "import convert as m; m.transplant_lsun_cat(ema=True)"
python -c "import convert as m; m.transplant_lsun_church()"
python -c "import convert as m; m.transplant_lsun_church(ema=True)"

Pytorch checkpoints will be saved in

diffusion_models_converted/
  diffusion_cifar10_model/
    model-790000.ckpt
  ema_diffusion_cifar10_model/
    model-790000.ckpt
  diffusion_lsun_bedroom_model/
    model-2388000.ckpt
  ema_diffusion_lsun_bedroom_model/
    model-2388000.ckpt
  diffusion_lsun_cat_model/
    model-1761000.ckpt
  ema_diffusion_lsun_cat_model/
    model-1761000.ckpt
  diffusion_lsun_church_model/
    model-4432000.ckpt
  ema_diffusion_lsun_church_model/
    model-4432000.ckpt

Sample TensorFlow models

To produce N samples from each of the pretrained TensorFlow models, run

python -c "import convert as m; m.sample_tf(N)"

Pass a list of model names as keyword argument which to specify which models to sample from. Samples will be saved in results/.

Comments

implemetation for likelihood bits/dim

Really appreciate your work on a PyTorch code loading the TF model.

I use your code, and the lib denoising-diffusion-pytorch and the original DDPM code https://github.com/hojonathanho/diffusion/blob/master/diffusion_tf/diffusion_utils_2.py try to reproduce their NLL(bits per dim) result in Table 1.

However, my code gives some results far away from the official result, and I can't interpret the problem.

Are you interested in implementing the computing of NLL?

Thank you again for this repo. It's so useful!

opened by sndnyang 1
Could not reproduce the DDPM results on LSUN-Church.
Hi, I tried using your converted model on LSUN-Church dataset (ema_diffusion_lsun_church_model/model-4432000.ckpt), but I found my FID results did not match the paper results of the original DDPM paper (10.44 vs 7.89). Specifically, I generated 50k samples with DDIM repo. The evaluation protocol is the original DDPM with 1k steps, whose running command is

python main.py --config church.yml --exp ./exp/church-ddpm --doc church --sample --fid --timesteps 1000 --eta 1 --ni --use_pretrained --sample_type ddpm_noisy

I use clean-fid for my FID computation, whose code is

from cleanfid import fid score = fid.compute_fid('exp/church-ddpm/image_samples/images/', dataset_res=256, dataset_name='lsun_church', dataset_split='train', mode='legacy_tensorflow') print(score)

This FID computation protocol should be the same as the protocol reported in DDPM paper. But I found my reproduced FID is 10.44, while the reported FID is 7.89. I think this is a large gap.

I wonder if you could reproduce this number. Is there anything wrong with using your model? Thanks!
opened by lmxyy 2

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

This is a Pytorch Lightning version PSMNet which is based on JiaRenChang/PSMNet. use python main.py to start training. PSM-Net Pytorch reimplementatio

1 Nov 25, 2021

Unofficial PyTorch reimplementation of the paper Swin Transformer V2: Scaling Up Capacity and Resolution

PyTorch reimplementation of the paper Swin Transformer V2: Scaling Up Capacity and Resolution [arXiv 2021].

122 Dec 12, 2022

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

Smooth ReLU in PyTorch Unofficial PyTorch reimplementation of the Smooth ReLU (SmeLU) activation function proposed in the paper Real World Large Scale

10 Jan 2, 2023

Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models

Official PyTorch implementation for "On Fast Sampling of Diffusion Probabilistic Models". FastDPM generation on CIFAR-10, CelebA, and LSUN datasets. S

68 Dec 26, 2022

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

Retrieval-Augmented Denoising Diffusion Probabilistic Models (wip) Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in P

55 Jan 1, 2023

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

Stable Head Pose Estimation and Landmark Regression via 3D Dense Face Reconstruction Reimplementation of (ECCV 2020) Towards Fast, Accurate and Stable

221 Dec 30, 2022

Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020)`

Human Attention for Text Classification Re-implementation of the paper Human Attention Maps for Text Classification: Do Humans and Neural Networks Foc

15 Dec 13, 2021

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Implementation of the method described in the Speech Resynthesis from Di

253 Jan 6, 2023

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.

JAX + Attention Learn To Solve Routing Problems Reinplementation of the paper Attention, Learn to Solve Routing Problems! using Jax and Flax. Fully su

7 Dec 1, 2022

PyTorch reimplementation of Diffusion Models

Related tags

Overview

PyTorch pretrained Diffusion Models

Quickstart

Usage

Producing samples

Results

Running conversions

Setup

Testing operations

Converting checkpoints

Sample TensorFlow models

You might also like...

Pytorch reimplementation of PSM-Net: "Pyramid Stereo Matching Network"

Unofficial PyTorch reimplementation of the paper Swin Transformer V2: Scaling Up Capacity and Resolution

PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].

Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models

Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch

[ECCV 2020] Reimplementation of 3DDFAv2, including face mesh, head pose, landmarks, and more.

Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020)`

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.

Comments

implemetation for likelihood bits/dim

Could not reproduce the DDPM results on LSUN-Church.

Owner

Patrick Esser

text_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.

PyTorch reimplementation of minimal-hand (CVPR2020)

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

A PyTorch Reimplementation of TecoGAN: Temporally Coherent GAN for Video Super-Resolution

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

PyTorch reimplementation of REALM and ORQA

a reimplementation of UnFlow in PyTorch that matches the official TensorFlow version

a reimplementation of LiteFlowNet in PyTorch that matches the official Caffe version

a reimplementation of Holistically-Nested Edge Detection in PyTorch