Pytorch implementation of VAEs for heterogeneous likelihoods.

Adrián Javaloy

Last update: Nov 29, 2022

Related tags

Deep Learning heterogeneous_vaes

Overview

Heterogeneous VAEs

Beware: This repository is under construction 🛠️

Pytorch implementation of different VAE models to model heterogeneous data. Here, we call heterogeneous data those for which we assume that each feature is of a different type, and therefore each feature is assumed to have a different likelihood. Heterogeneous data is also known as mixed-type data and tabular data.

Usage

This repository is not meant to be a library which you can install and use as it is, but rather as a ML project code which you can freely fork and modify to fit your particular needs.

Dependencies

We are working on providing a conda requirements file. For the moment, there is a Dockerfile which you can build and use, or simply look at the project dependencies from there.

Example

You can find information about all the available arguments via python main.py --help. For example, you can train the Wine dataset on a heterogeneous VAE with default arguments using:

python main.py -model=vae -dataset=datasets/Wine -seed=2 -miss-perc=20 -miss-suffix=1

Models

This repository contains implementations of the following models, adapted for heterogeneous likelihoods (if you use them in your work, make sure to cite the original authors):

Autoencoding Variational Bayes (VAE): https://arxiv.org/abs/1312.6114
Importance Weighted Autoencoder (IWAE): http://arxiv.org/abs/1509.00519
Doubly Reparametrized Gradient Estimators for Monte Carlo objectives (DReG): http://arxiv.org/abs/1810.04152
Handling Incomplete Heterogeneous Data using VAEs (HI-VAE): http://arxiv.org/abs/1807.03653

Likelihoods

The code supports the following likelihoods at the moment:

Gaussian, for real-valued features.
Log-normal, for positive-valued real features.
Bernoulli, for binary features.
Categorical, for categorical features.
Poisson, for positive-valued integer (count) features.

Datasets

We provide with this code some example datasets taken from UCI and R package datasets. You can use any dataset as long as the format is the same.

Contributing

The code can be further simplified and polished, and we still have some legacy code. Pull requests and issues are more than welcome, as long as it contributes to making the code clean, simple, general, and elegant.

You might also like...

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

Comments

Question on model evaluation

Thanks for the implementation! I was curious as to why you have both mask and mask_bc under miscelanea.py -> function def test_mie_ll as dataset[:][2]. Shouldn't the mask_bc = dataset[:][1] ? i.e., we need to be applying the mask when encoding and then generate the data. dataset[:][2] refers to the variable nan_mask (in file datasets.py) which is essentially all ones. It will be great if you can clarify this.

Thanks

opened by VRM1 1
Evaluate only on missing data

Resolves #1.

Adds an option to choose whether to use only the missing elements for evaluation. It still allows evaluating reconstruction with the training dataset.
bug

opened by adrianjav 0

Pytorch implementation of VAEs for heterogeneous likelihoods.

Related tags

Overview

Heterogeneous VAEs

Usage

Dependencies

Example

Models

Likelihoods

Datasets

Contributing

You might also like...

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

HNECV: Heterogeneous Network Embedding via Cloud model and Variational inference

Code for KDD'20 "An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph"

A heterogeneous entity-augmented academic language model based on Open Academic Graph (OAG)

Revisiting, benchmarking, and refining Heterogeneous Graph Neural Networks.

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"

Comments

Question on model evaluation

Evaluate only on missing data

Owner

Adrián Javaloy

Very deep VAEs in JAX/Flax

Character Controllers using Motion VAEs

Generative Autoregressive, Normalized Flows, VAEs, Score-based models (GANVAS)

PyTorch implementation of our ICCV 2021 paper, Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents.

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL [Deep Graph Library] and PyTorch.

Implementation of Heterogeneous Graph Attention Network

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Deep generative modeling for time-stamped heterogeneous data, enabling high-fidelity models for a large variety of spatio-temporal domains.

Scalable Graph Neural Networks for Heterogeneous Graphs

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition, TPAMI 2021