Generate vector graphics from a textual caption

Ajay Jain

Last update: Dec 15, 2022

Related tags

Text Data & NLP VectorAscent

Overview

VectorAscent: Generate vector graphics from a textual description

Example

"a painting of an evergreen tree"

python text_to_painting.py --prompt "a painting of an evergreen tree" --num_iter 2500 --use_blob --subdir vit_rn50_useblob

We rely on CLIP for its aligned text and image encoders, and diffvg, a differentiable vector graphics rasterizer. Differentiable rendering allows us to generate raster images from vector paths, but isn't provided textual descriptions. We use CLIP to score the similarity between raster graphics and textual captions. Using gradient ascent, we can then optimize for a vector graphic whose rasterization has high similarity with a user-provided caption, backpropagating through CLIP and diffvg to the vector graphics parameters. This project is partially inspired by Deep Daze, a caption-guided raster graphics generator.

Quick start

Requirements:

torch
torchvision
matplotlib
numpy
scikit-image
clip
diffvg

Install our dependencies and CLIP.

conda install --yes -c pytorch pytorch=1.7.1 torchvision cudatoolkit=11.0
pip install ftfy regex tqdm numpy matplotlib scikit-image
pip install git+https://github.com/openai/CLIP.git

Then follow these instructions to install diffvg.

You might also like...

Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingwai

TextCortex - HemingwAI Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingw

27 Nov 28, 2022

Clone a voice in 5 seconds to generate arbitrary speech in real-time

This repository is forked from Real-Time-Voice-Cloning which only support English. English | 中文 Features 🌍 Chinese supported mandarin and tested with

25.6k Jan 6, 2023

Generate text line images for training deep learning OCR model (e.g. CRNN)

532 Jan 6, 2023

"elect", "electoral", "electorate" etc." data-original="https://github.com/gutfeeling/word_forms/raw/master/logo.png" >

Accurately generate all possible forms of an English word e.g "election" -- "elect", "electoral", "electorate" etc.

Accurately generate all possible forms of an English word Word forms can accurately generate all possible forms of an English word. It can conjugate v

570 Dec 31, 2022

Script to generate VAD dataset used in Asteroid recipe

About the dataset LibriVAD is an open source dataset for voice activity detection in noisy environments. It is derived from LibriSpeech signals (clean

11 Sep 15, 2022

Creating an LSTM model to generate music

Music-Generation Creating an LSTM model to generate music music-generator Used to create basic sin wave sounds music-ai Contains the functions to conv

2 Dec 2, 2021

xFormers is a modular and field agnostic library to flexibly generate transformer architectures by interoperable and optimized building blocks.

Description xFormers is a modular and field agnostic library to flexibly generate transformer architectures by interoperable and optimized building bl

2.3k Jan 8, 2023

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

Auto-Research A no-code utility to generate a detailed well-cited survey with topic clustered sections (draft paper format) and other interesting arti

20 Dec 14, 2022

A relatively simple python program to generate one of those reddit text to speech videos dominating youtube.

Reddit text to speech generator A basic reddit tts video generator Current functionality Generate videos for subs based on comments,(askreddit) so rea

17 Dec 19, 2022

Comments

Installing diffvg with GPU acceleration?

I realize this isn't the scope of this repo, but the authors of diffVG seem to not respond to issues. If you could share what environment you used to install diffvg with GPU acceleration, I'd be very gfrateful. So far I've tried cuda 10.2, 11.0 + pytorch 1.8 with no luck.

opened by urimerhav 0
Doesn't seem to converge to anything meaningful for me

I tried a bunch of simple and complicated prompts with different numbers of iterations and different numbers of paths but don't really get anything meaningful. Even simple things like "cat". Could there be some problem with the versions?

opened by voodoohop 0

Generate vector graphics from a textual caption

Related tags

Overview

VectorAscent: Generate vector graphics from a textual description

Example

Quick start

You might also like...

Generate product descriptions, blogs, ads and more using GPT architecture with a single request to TextCortex API a.k.a Hemingwai

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Generate text line images for training deep learning OCR model (e.g. CRNN)

Accurately generate all possible forms of an English word e.g "election" -- "elect", "electoral", "electorate" etc.

Script to generate VAD dataset used in Asteroid recipe

Creating an LSTM model to generate music

xFormers is a modular and field agnostic library to flexibly generate transformer architectures by interoperable and optimized building blocks.

Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!

A relatively simple python program to generate one of those reddit text to speech videos dominating youtube.

Comments

Installing diffvg with GPU acceleration?

Doesn't seem to converge to anything meaningful for me

Owner

Ajay Jain

NLPIR tutorial: pretrain for IR. pre-train on raw textual corpus, fine-tune on MS MARCO Document Ranking

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

Semi-automated vocabulary generation from semantic vector models

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Shirt Bot is a discord bot which uses GPT-3 to generate text

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Generate a cool README/About me page for your Github Profile

📔️ Generate a text-based journal from a template file.

A method to generate speech across multiple speakers