Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Peter Schaldenbrand

Last update: Dec 23, 2022

Related tags

Overview

StyleCLIPDraw

Peter Schaldenbrand, Zhixuan Liu, Jean Oh September 2021

To be featured in the 2021 NeurIPS Workshop on Machine Learning and Design

StyleCLIPDraw adds a style loss to the CLIPDraw (Frans et al. 2021) (code) text-to-drawing synthesis model to allow artistic control of the synthesized drawings in addition to control of the content via text. Whereas performing decoupled style transfer on a generated image only affects the texture, our proposed coupled approach is able to capture a style in both texture and shape, suggesting that the style of the drawing is coupled with the drawing process itself.

Checkout our code on Colab

Method

Unlike most other image generation models, CLIPDraw produces drawings consisting of a series of Bezier curves defined by a list of coordinates, a color, and an opacity. The drawing begins as randomized Bezier curves on a canvas and is optimized to fit the given style and text. The StyleCLIPDraw model architecture is shown above. The brush strokes are rendered into a raster image via differentiable model. There are two losses for StyleCLIPDraw that correspond to each input. The text input and the augmented raster drawing are fed the the CLIP model and the difference in embeddings are compared using cosine distance to compute a loss that encourages the drawing to fit the text input. The image is augmented to avoid finding shallow solutions to optimizing through the CLIP model. The raster image and the style image are fed through early layers of the VGG-16 model and the difference in extracted features form the loss that encourages the drawings to fit the style of the style image.

Results

StyleCLIPDraw vs. CLIPDraw then Style Transfer

Comments

RuntimeError: radix_sort: failed on 1st step: cudaErrorInvalidDeviceFunction: invalid device function

Hi this research is really fantastic and exciting ! According Colab instruction, when I run img = style_clip_draw('A man is watching TV', 'https://raw.githubusercontent.com/pschaldenbrand/StyleCLIPDraw/master/images/fruit.jpg',\ num_iter=1000, style_opt_freq=5, style_opt_iter=50) show_img(img)

the runtime return

Downloading: "https://download.pytorch.org/models/vgg16-397923af.pth" to /root/.cache/torch/hub/checkpoints/vgg16-397923af.pth 100% 528M/528M [00:05<00:00, 103MB/s] RuntimeError Traceback (most recent call last) <ipython-input-6-c7386fef0808> in <module>() ----> 1 img = style_clip_draw('A man is watching TV', 'https://raw.githubusercontent.com/pschaldenbrand/StyleCLIPDraw/master/images/fruit.jpg', num_iter=1000, style_opt_freq=5, style_opt_iter=50) 2 show_img(img) 4 frames /usr/local/lib/python3.7/dist-packages/diffvg-0.0.1-py3.7-linux-x86_64.egg/pydiffvg/render_pytorch.py in backward(ctx, grad_img) 707 use_prefiltering, 708 diffvg.float_ptr(eval_positions.data_ptr()), --> 709 eval_positions.shape[0]) 710 time_elapsed = time.time() - start 711 global print_timing RuntimeError: radix_sort: failed on 1st step: cudaErrorInvalidDeviceFunction: invalid device function

And rerun twice or triple also show same info. Can this work's expert help me ? Thanks!

opened by lizekui 5
AttributeError: module 'diffvg' has no attribute 'FilterType'

Hi, thanks for your sharing the project, idea and code. I tried to run the colab and it stop at the pydiffvg import , showing the following error :

AttributeError: module 'diffvg' has no attribute 'FilterType'

I tried to google and install the diffvg GitHub manually but still no luck, do you have any idea I could make it work? thanks for your help!

opened by chikiuso 4

Colab StyleCLIPDraw error

Using Colab pro with a GPU 0: Tesla P100-PCIE-16GB

Executing:

img = style_clip_draw('A man is watching TV', 'https://raw.githubusercontent.com/pschaldenbrand/StyleCLIPDraw/master/images/fruit.jpg',\
                          num_iter=1000, style_opt_freq=5, style_opt_iter=50, debug=True) 
show_img(img)

I got this error

/usr/local/lib/python3.7/dist-packages/torch/autograd/__init__.py in backward(tensors, grad_tensors, retain_graph, create_graph, grad_variables, inputs)
    154     Variable._execution_engine.run_backward(
    155         tensors, grad_tensors_, retain_graph, create_graph, inputs,
--> 156         allow_unreachable=True, accumulate_grad=True)  # allow_unreachable flag
    157 
    158 
RuntimeError: merge_sort: failed to synchronize: cudaErrorIllegalAddress: an illegal memory access was encountered

opened by mcanet 3

Error on colab
Hey guys,

Thank you for sharing the code :) I am trying to use it in colab but after the installation cell is done I got the following error:

AttributeError: module 'diffvg' has no attribute 'FilterType'
opened by FrancescoSaverioZuppichini 1
use compatible cuda version; yield progressive outputs

Hi Peter! I was running into issues with Torch so I upgraded to 1.10.0 and it worked locally. However, when pushing to Replicate I needed to set CUDA==10.2 because the NVIDIA driver version on Replicate nodes is less than required by CUDA 11.3.

opened by andreasjansson 0

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Related tags

Overview

StyleCLIPDraw

Peter Schaldenbrand, Zhixuan Liu, Jean Oh September 2021

Method

Results

StyleCLIPDraw vs. CLIPDraw then Style Transfer

Comments

RuntimeError: radix_sort: failed on 1st step: cudaErrorInvalidDeviceFunction: invalid device function

AttributeError: module 'diffvg' has no attribute 'FilterType'

Colab StyleCLIPDraw error

Error on colab

use compatible cuda version; yield progressive outputs

Owner

Peter Schaldenbrand

Styled Augmented Translation

Guiding evolutionary strategies by (inaccurate) differentiable robot simulators @ NeurIPS, 4th Robot Learning Workshop

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

deep learning model that learns to code with drawing in the Processing language

I have created this Virtual Paint Program, in this you can paint(draw) on your screen using hand gestures, created in Python-3 using OpenCV and Mediapipe library. Gestures :- Index Finger for drawing and Index+Middle Finger for changing position and objects.

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Colour detection is necessary to recognize objects, it is also used as a tool in various image editing and drawing apps.

CL-Gym: Full-Featured PyTorch Library for Continual Learning

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.

labelpix is a graphical image labeling interface for drawing bounding boxes

PyTorch implementation of the method described in the paper VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop.

ElegantRL is featured with lightweight, efficient and stable, for researchers and practitioners.

Notepy is a full-featured Notepad Python app

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)