Manifold Alignment for Semantically Aligned Style Transfer

Last update: Nov 14, 2022

Related tags

Deep Learning MAST

Overview

Manifold Alignment for Semantically Aligned Style Transfer

Getting Started

MAST has been tested on CentOS 7.6 with python >= 3.6. It supports both GPU and CPU inference. If you don't have a suitable device, try running our Colab demo.

Clone the repo:

git clone https://github.com/NJUHuoJing/MAST.git

prepare the checkpoints:

cd MAST
chmod 777 scripts/prepare_data.sh
scripts/prepare_data.sh

Install the requirements:

conda create -n mast-env python=3.6
conda activate mast-env
pip install -r requirements.txt

# If you want to use post smoothing as the same as PhotoWCT, then install the requirements below;
# You can also just skip it to use fast post smoothing, remember to change cfg.TEST.PHOTOREALISTIC.FAST_SMOOTHING=true
pip install -U setuptools
pip install cupy
pip install pynvrtc

Running the Demo

Artistic style transfer

First set MAST_CORE.ORTHOGONAL_CONSTRAINT=false in configs/config.yaml. Then use the script test_artistic.py to generate the artistic stylized image by following the command below:

# not use seg
python test_artistic.py --cfg_path configs/config.yaml --content_path data/default/content/4.png --style_path data/default/style/4.png --output_dir results/test/default

# use --content_seg_path and --style_seg_path to user edited style transfer
python test_artistic.py --cfg_path configs/config.yaml --content_path data/default/content/4.png --style_path data/default/style/4.png --output_dir results/test/default --content_seg_path data/default/content_segmentation/4.png --style_seg_path data/default/style_segmentation/4.png --seg_type labelme --resize 512

Photo-realistic style transfer

First set MAST_CORE.ORTHOGONAL_CONSTRAINT=true in configs/config.yaml. Then use the script test_photorealistic.py to generate the photo-realistic stylized image by following the command below:

# not use seg
python test_photorealistic.py --cfg_path configs/config.yaml --content_path data/photo_data/content/in1.png --style_path data/photo_data/style/tar1.png --output_dir results/test/photo --resize 512

# or use --content_seg_path and --style_seg_path to user edited style transfer
python test_photorealistic.py --cfg_path configs/config.yaml --content_path data/photo_data/content/in1.png --style_path data/photo_data/style/tar1.png --output_dir results/test/photo --content_seg_path data/photo_data/content_segmentation/in1.png --style_seg_path data/photo_data/style_segmentation/tar1.png --seg_type dpst --resize 512

GUI For Artistic style transfer and User Editing

We provide a gui for user-controllable artistic image stylization. Just use the command below to run test_gui.py

python test_gui.py --cfg_path configs/config.yaml

Features

You can use different colors to control the style transfer in different semantic areas.
The button Expand and Expand num respectively control whether to expand the selected semantic area and the degree of expansion.

See the gif demo for more details.

Google Colab

If you do not have a suitable environment to run this project then you could give Google Colab a try. It allows you to run the project in the cloud, free of charge. You may try our Colab demo using the notebook we have prepared: Colab Demo

Citation

@inproceedings{huo2021manifold,
    author = {Jing Huo and Shiyin Jin and Wenbin Li and Jing Wu and Yu-Kun Lai and Yinghuan Shi and Yang Gao},
    title = {Manifold Alignment for Semantically Aligned Style Transfer},
    booktitle = {IEEE International Conference on Computer Vision},
    pages     = {14861-14869},
    year = {2021}
}

References

The post smoothing module is borrowed from PhotoWCT

You might also like...

Robot Reinforcement Learning on the Constraint Manifold

Implementation of "Robot Reinforcement Learning on the Constraint Manifold"

31 Dec 5, 2022

This respository includes implementations on Manifoldron: Direct Space Partition via Manifold Discovery

Manifoldron: Direct Space Partition via Manifold Discovery This respository includes implementations on Manifoldron: Direct Space Partition via Manifo

4 Apr 28, 2022

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model The task of age transformation illustrates the change of an individual

444 Dec 30, 2022

Fast Neural Style for Image Style Transform by Pytorch

FastNeuralStyle by Pytorch Fast Neural Style for Image Style Transform by Pytorch This is famous Fast Neural Style of Paper Perceptual Losses for Real

81 Sep 3, 2022

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

MSAD Multi-Scale Aligned Distillation for Low-Resolution Detection Lu Qi*, Jason Kuen*, Jiuxiang Gu, Zhe Lin, Yi Wang, Yukang Chen, Yanwei Li, Jiaya J

115 Dec 23, 2022

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Skeleton Merger Skeleton Merger, an Unsupervised Aligned Keypoint Detector. The paper is available at https://arxiv.org/abs/2103.10814. A map of the r

48 Nov 14, 2022

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

MSAD Multi-Scale Aligned Distillation for Low-Resolution Detection Lu Qi*, Jason Kuen*, Jiuxiang Gu, Zhe Lin, Yi Wang, Yukang Chen, Yanwei Li, Jiaya J

115 Dec 23, 2022

PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)

2021-CVPR-MvCLN This repo contains the code and data of the following paper accepted by CVPR 2021 Partially View-aligned Representation Learning with

33 Nov 1, 2022

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021) PyTorch implementation of Learning RAW-to-sRGB Mappings with Inaccurat

53 Dec 20, 2022

Comments

pre-trained model for testing

Hi, thanks for sharing the code! I can't find your pre-trained models. can you please share it? also - do you have plans to share your train code? Thanks. Ofer

opened by oferidan1 1

Torchvision Encoder

Hi @NJUHuoJing, thanks for sharing your great code and congratulations on your amazing work. I have a question about the encoder that you applied in your framework, I can get similar results as yours when I use your code with your provided encoder (vgg_r51.pth). However, when I use the pre-trained VGG-19 model provided by pytorch-torchvision, I cannot get any reasonable result. I have tried the torchvision VGG-19 with the exact same layers as you. For instance, in the test_artistic.py code, I used the r41,r31,r21, and I defined the following code for the encoder.

from torchvision import models 
class VGG19_encoder(torch.nn.Module):
    def __init__(self,requires_grad=False,n_layers=[1,6,11,20,29]):       
        super(VGG19_encoder,self).__init__()
        vgg_pretrained_features=models.vgg19(pretrained=True).features   
        self.slice0=torch.nn.Sequential()
        self.slice1=torch.nn.Sequential()
        self.slice2=torch.nn.Sequential()
        self.slice3=torch.nn.Sequential()
        self.slice4=torch.nn.Sequential()
        for x in range(n_layers[0]):#relu1_1
            self.slice0.add_module(str(x),vgg_pretrained_features[x])        
        for x in range(n_layers[0],n_layers[1]): #relu2_1
            self.slice1.add_module(str(x),vgg_pretrained_features[x])        
        for x in range(n_layers[1],n_layers[2]): #relu3_1
            self.slice2.add_module(str(x),vgg_pretrained_features[x])
        for x in range(n_layers[2],n_layers[3]):#relu4_1
            self.slice3.add_module(str(x),vgg_pretrained_features[x])
        for x in range(n_layers[3],n_layers[4]):#relu5_1
            self.slice4.add_module(str(x),vgg_pretrained_features[x])        
        if not requires_grad:
            for param in self.parameters():
                param.requires_grad=False            
    def forward(self,x):
        h0=self.slice0(x)    #[1,64,512,512]
        h1=self.slice1(h0)   #[1,128,256,256]
        h2=self.slice2(h1)   #[1,256,128,128]
        h3=self.slice3(h2)   #[1,512,64,64]
        h4=self.slice4(h3)   #[1,512,32,32]
        return {'r11':h0,'r21':h1,'r31':h2,'r41':h3,'r51':h4}

Then, I defined the encoder as the following line and added it in multi_level_test by using just the r41,r31,r21 layers:

encoder = VGG19_encoder(n_layers=[1,6,11,20,29]).to(device)

I am wondering how did you train your encoder and what is the difference between your VGG-19 model and the VGG-19 model from torchvision since both of them are trained on ImageNet.

I would appreciate a lot any help that you could provide regarding this issue. Thank you very much.

opened by denabazazian 0

Owner

GitHub

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation This project attempted to implement the paper Putting NeRF on a

254 Dec 27, 2022

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

SCOOD-UDG (ICCV 2021) This repository is the official implementation of the paper: Semantically Coherent Out-of-Distribution Detection Jingkang Yang,

62 Nov 21, 2022

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

SESNet for remote sensing image change detection It is the implementation of the paper: "SESNet: A Semantically Enhanced Siamese Network for Remote Se

1 May 24, 2022

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021) PyTorch implementation of SnapMix | paper Method Overview Cite

126 Dec 30, 2022

SeMask: Semantically Masked Transformers for Semantic Segmentation.

SeMask: Semantically Masked Transformers Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi This repo co

186 Dec 30, 2022

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology Sharon Zhou, Eric Zelikman

34 Nov 16, 2022

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

Manifold-SCA Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning The repo is org

172 Dec 29, 2022

Manifold Alignment for Semantically Aligned Style Transfer

Related tags

Overview

Manifold Alignment for Semantically Aligned Style Transfer

Getting Started

Running the Demo

Artistic style transfer

Photo-realistic style transfer

GUI For Artistic style transfer and User Editing

Features

Google Colab

Citation

References

You might also like...

Robot Reinforcement Learning on the Constraint Manifold

This respository includes implementations on Manifoldron: Direct Space Partition via Manifold Discovery

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Fast Neural Style for Image Style Transform by Pytorch

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)

Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision (ICCV 2021)

Comments

pre-trained model for testing

Torchvision Encoder

Owner

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

From this paper "SESNet: A Semantically Enhanced Siamese Network for Remote Sensing Change Detection"

SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data (AAAI 2021)

SeMask: Semantically Masked Transformers for Semantic Segmentation.

Official repository for the ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology

Code for Learning Manifold Patch-Based Representations of Man-Made Shapes, in ICLR 2021.

Hierarchical Uniform Manifold Approximation and Projection

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning