Contrastively Disentangled Sequential Variational Audoencoder

Junwen Bai

Last update: Dec 24, 2022

Related tags

Deep Learning C-DSVAE

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

This is the implementation for our C-DSVAE, a novel self-supervised disentangled sequential representation learning method.

Requirements

Python 3
PyTorch 1.7
Numpy 1.18.5

Dataset

Sprites

We provide the raw Sprites .npy files. One can also find the dataset on a third-party repo.

For each split (train/test), we expect the following components for each sequence sample

x: raw sample of shape [8, 3, 64, 64]
c_aug: content augmentation of shape [8, 3, 64, 64]
m_aug: motion augmentation of shape [8, 3, 64, 64]
motion factors: action (3 classes), direction (3 classes)
content factors: skin, tops, pants, hair (each with 6 classes)

Running

Train

./run_cdsvae.sh

Test

./run_test_sprite.sh

Classification Judge

The judge classifiers are pretrained with full supervision separately.

Sprites judge

C-DSVAE Checkpoints

We provide a sample Sprites checkpoint. Checkpoint parameters can be found in ./run_test_sprite.sh.

Paper

If you are inspired by our work, please cite the following paper:

@inproceedings{bai2021contrastively,
  title={Contrastively Disentangled Sequential Variational Autoencoder},
  author={Bai, Junwen and Wang, Weiran and Gomes, Carla},
  booktitle={Advances in Neural Information Processing Systems},
  volume={},
  year={2021}
}

You might also like...

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Implementation of the method described in the Speech Resynthesis from Di

4 Mar 11, 2022

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Disentangled Representation Learning for Text-Video Retrieval This is a PyTorch implementation of the paper Disentangled Representation Learning for T

49 Dec 18, 2022

Official PyTorch implementation of BlobGAN: Spatially Disentangled Scene Representations

BlobGAN: Spatially Disentangled Scene Representations Official PyTorch Implementation Paper | Project Page | Video | Interactive Demo BlobGAN.mp4 This

148 Dec 29, 2022

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

CyGNet This repository reproduces the AAAI'21 paper “Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Network

89 Jan 3, 2023

Sequential Model-based Algorithm Configuration

778 Jan 5, 2023

Sequential model-based optimization with a `scipy.optimize` interface

Scikit-Optimize Scikit-Optimize, or skopt, is a simple and efficient library to minimize (very) expensive and noisy black-box functions. It implements

2.5k Jan 4, 2023

Leveraging Two Types of Global Graph for Sequential Fashion Recommendation, ICMR 2021

This is the repo for the paper: Leveraging Two Types of Global Graph for Sequential Fashion Recommendation Requirements OS: Ubuntu 16.04 or higher ver

10 Oct 10, 2022

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

LibraNet This repository includes the official implementation of LibraNet for crowd counting, presented in our paper: Weighing Counts: Sequential Crow

18 Nov 5, 2022

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking Datasets You can download datasets that have been pre-pr

25 May 29, 2022

Comments

'../dataset/Sprite/data.pkl' is not found.

Thank you for sharing your programs. I attempted to reproduce C-DSVAE but couldn't find the edited dataset(data.pkl). If you might see #1, Could you share the dataset or the program to create it?

opened by littletake 2
About time variant latent z_rnn
Hello, I have a question about the generation of time variant latent z_t

I think in the paper z_t depends on z_0,...,z_t-1.

q(z_t|z_{<t}) = ...

while in your code latents are sampled independently, although with a rnn hidden state.

features, _ = self.z_rnn(lstm_out) z_mean = self.z_mean(features) z_logvar = self.z_logvar(features) z_post = self.reparameterize(z_mean, z_logvar, random_sampling=True)

I think the randomness of previous latent are not introduced to the cureent latent, I'm wondering whether this is the same meaning with the paper?
opened by heyuanYao-pku 1

Contrastively Disentangled Sequential Variational Audoencoder

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

Requirements

Dataset

Sprites

Running

Train

Test

Classification Judge

C-DSVAE Checkpoints

Paper

You might also like...

Implementation of the method described in the Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Official PyTorch implementation of BlobGAN: Spatially Disentangled Scene Representations

Learning from History: Modeling Temporal Knowledge Graphs with Sequential Copy-Generation Networks

Sequential Model-based Algorithm Configuration

Sequential model-based optimization with a `scipy.optimize` interface

Leveraging Two Types of Global Graph for Sequential Fashion Recommendation, ICMR 2021

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

StackRec: Efficient Training of Very Deep Sequential Recommender Models by Iterative Stacking

Comments

'../dataset/Sprite/data.pkl' is not found.

About time variant latent z_rnn

Owner

Junwen Bai

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

DeepFaceEditing: Deep Face Generation and Editing with Disentangled Geometry and Appearance Control

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search, accepted by IJCAI 2021.

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

Code for CVPR2021 paper 'Where and What? Examining Interpretable Disentangled Representations'.

Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch

Disentangled Lifespan Face Synthesis

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation (ICCV 2021)