Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

Soumya Tripathy

Last update: Mar 27, 2022

Related tags

Deep Learning deep-learning neural-network high-resolution vgg19 semantic-segmentation cityscapes photorealistic-based-rendering

Overview

1707.09405)

This is a Pytorch implementation of cascaded refinement networks to synthesize photographic images from semantic layouts. Now the pretrained model and codes for training the network from scratch are available for 256x512 resolution. Thanks to Qifeng Chen for his tensorflow implementation which helped a lot in developing this pytorch version.

Testing

Download this package and keep all the subsequent mentioned files in the same folder.
Download the pretrained VGG19 Net from VGG19
Download the pretrained weights for the CRN network for 256x512 CRN
Keep the mode=test and mention the semantic image name to be tested in the Cascadaed_Network_LM_256.py
The synthesized images will be saved in current folder.

Training

Follow steps 1 to 3 from the testing steps.
Resize all the training images to 256x512. Keep the semantic segmentated training images in Label256Full folder and
the RGB training images in RGB256Full (without any subfolders).
Set mode=train in Cascadaed_Network_LM_256.py and run it for desired epochs (default is 200).

Future Work

Soon the pretrained weights for resolution 512x1024 and 1024x20148 will be available along with training scripts.

Note

All the codes are written to run on GPU. Suitable changes should be done if you want to run on CPU. Also feel free to
customize it according to your need.

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

FRSKD Official implementation for Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation (CVPR-2021) Requirements Pytho

75 Dec 28, 2022

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699

ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement Recently, the power of unconditional image synthesis has significantly advanced th

967 Jan 4, 2023

Comments

Information on training process

Hi there,

I'm just wondering if it is possible to get some information on what hardware you used to train the model, as well as how long you took? I am currently writing my own version of the CRN, however I am encountering extremely long training times and massive memory requirements, and so I thought is worthwhile to find out more about other implementations and their requirments.

Kind regards

opened by Swidilator 2
pretrained VGG19 and CRN weights are no longer able to download

Hello, I am trying to reproduce your work with the readme instruction. I found I currently do not have access to you pretrained models in google drive, could you release it?

opened by ConvMech 1

The way that you get D and D_m seems computation heavy.

def recursive_img(label,res): #Resulution may refers to the final image output i.e. 256x512 or 512x1024
     dim=512 if res>=128 else 1024
#    #M_low will start from 4x8 to resx2*res
     if res == 4:
         downsampled = label #torch.unsqueeze(torch.from_numpy(label).float().permute(2,0,1), dim=0)
     else:
         max1=nn.AvgPool2d(kernel_size=2, padding=0, stride=2)
         downsampled=max1(label)
         img = recursive_img(downsampled, res//2)
         
     global D
     global count
     global D_m

     D.insert(count, downsampled)
     D_m.insert(count, dim)
     count+=1
     return downsampled

Why not directly assign each D_i and D_m_i with specific values.

opened by Naruto-Sasuke 1

Several questions
Can we use recursive_generator similar with the official code?

LayerNorm is in here http://pytorch.org/docs/master/nn.html#layernorm.(Seems in this week v0.4 will be released)
opened by Naruto-Sasuke 1

Photographic Image Synthesis with Cascaded Refinement Networks - Pytorch Implementation

Related tags

Overview

Photographic Image Synthesis with Cascaded Refinement Networks-Pytorch (https://arxiv.org/abs/1707.09405)

You might also like...

Official implementation for (Refine Myself by Teaching Myself : Feature Refinement via Self-Knowledge Distillation, CVPR-2021)

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699

Paddle implementation for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)

Implementation of the paper "Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning"

π-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)

Pytorch implementation of few-shot semantic image synthesis

PyTorch implementation for SDEdit: Image Synthesis and Editing with Stochastic Differential Equations

A PyTorch implementation of the paper "Semantic Image Synthesis via Adversarial Learning" in ICCV 2017

Comments

Information on training process

pretrained VGG19 and CRN weights are no longer able to download

The way that you get D and D_m seems computation heavy.

Several questions

Owner

Soumya Tripathy

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

Unoffical implementation about Image Super-Resolution via Iterative Refinement by Pytorch

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution SmallObject Detection

Cascaded Pyramid Network (CPN) based on Keras (Tensorflow backend)

Cascaded Deep Video Deblurring Using Temporal Sharpness Prior and Non-local Spatial-Temporal Similarity

Pytorch implementation for reproducing StackGAN_v2 results in the paper StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

PyTorch 1.5 implementation for paper DECOR-GAN: 3D Shape Detailization by Conditional Refinement.

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

Implementation of Invariant Point Attention, used for coordinate refinement in the structure module of Alphafold2, as a standalone Pytorch module

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models