A simple library that implements CLIP guided loss in PyTorch.

Sergei Belousov

Last update: Dec 26, 2022

Related tags

Deep Learning deep-learning gan neural-art image-synthesis vqgan-clip synthetic-media

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

A simple library that implements CLIP guided loss in PyTorch.

Install package

pip install pytorch_clip_guided_loss

Install the latest version

pip install --upgrade git+https://github.com/bes-dev/pytorch_clip_guided_loss.git

Features

The library supports multiple prompts (images or texts) as targets for optimization.
The library automatically detects the language of the input text, and multilingual translate it via google translate.
The library supports the original CLIP model by OpenAI and ruCLIP model by SberAI.

Usage

Simple code

import torch
from pytorch_clip_guided_loss import get_clip_guided_loss

loss_fn = get_clip_guided_loss(clip_type="ruclip", input_range = (-1, 1)).eval().requires_grad_(False)
# text prompt
loss_fn.add_prompt(text="text description of the what we would like to generate")
# image prompt
loss_fn.add_prompt(image=torch.randn(1, 3, 224, 224))

# variable
var = torch.randn(1, 3, 224, 224).requires_grad_(True)
loss = loss_fn(image=var)["loss"]
loss.backward()
print(var.grad)

VQGAN-CLIP

We provide our tiny implementation of the VQGAN-CLIP pipeline for image generation as an example of the usage of our library. To start using our implementation of the VQGAN-CLIP please follow by documentation.

A mini lib that implements several useful functions binding to PyTorch in C++.

Torch-gather A mini library that implements several useful functions binding to PyTorch in C++. What does gather do? Why do we need it? When dealing w

8 Sep 7, 2022

Implements pytorch code for the Accelerated SGD algorithm.

AccSGD This is the code associated with Accelerated SGD algorithm used in the paper On the insufficiency of existing momentum schemes for Stochastic O

205 Jan 2, 2023

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

63 Oct 17, 2022

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

Deep Daze mist over green hills shattered plates on the grass cosmic love and attention a time traveler in the crowd life during the plague meditative

4.4k Jan 3, 2023

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN.

Comments

why map pixels between `input_range`?

@bes-dev thanks for the awesome work! I have one question:

Why do you manually map the image pixels between -1 and 1, instead of directly using the https://huggingface.co/docs/transformers/model_doc/clip#transformers.CLIPFeatureExtractor?

opened by fcakyon 3
Slight issue in the "Simple Code" example
I think there's a small error in the example you provided. You have

loss = loss_fn(image=var)["loss"]

I think this needs to be the following in order to run, though:

loss = loss_fn.image_loss(image=var)["loss"]

Also, thank you for this resource - this library will be very useful!
opened by waltergerych 2

A simple library that implements CLIP guided loss in PyTorch.

Related tags

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

Install package

Install the latest version

Features

Usage

Simple code

VQGAN-CLIP

You might also like...

A mini lib that implements several useful functions binding to PyTorch in C++.

Implements pytorch code for the Accelerated SGD algorithm.

deep-table implements various state-of-the-art deep learning and self-supervised learning algorithms for tabular data using PyTorch.

Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network)

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN.

Simple image captioning model - CLIP prefix captioning.

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Official Pytorch implementation of the paper "MotionCLIP: Exposing Human Motion Generation to CLIP Space"

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

Comments

why map pixels between `input_range`?

Slight issue in the "Simple Code" example

Owner

Sergei Belousov

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

Just playing with getting CLIP Guided Diffusion running locally, rather than having to use colab.

A Jupyter notebook to play with NVIDIA's StyleGAN3 and OpenAI's CLIP for a text-based guided image generation.

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

Code for 'Self-Guided and Cross-Guided Learning for Few-shot segmentation. (CVPR' 2021)'

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

It is a simple library to speed up CLIP inference up to 3x (K80 GPU)

Simple implementation of OpenAI CLIP model in PyTorch.