Semi-Supervised Learning with Ladder Networks in Keras. Get 98% test accuracy on MNIST with just 100 labeled examples !

Divam Gupta

Last update: Sep 7, 2022

Related tags

Deep Learning ladder_network_keras

Overview

Semi-Supervised Learning with Ladder Networks in Keras

This is an implementation of Ladder Network in Keras. Ladder network is a model for semi-supervised learning. Refer to the paper titled Semi-Supervised Learning with Ladder Networks by A Rasmus, H Valpola, M Honkala,M Berglund, and T Raiko

This implementation was used in the official code of our paper Unsupervised Clustering using Pseudo-semi-supervised Learning . The code can be found here and the blog post can be found here

The model achives 98% test accuracy on MNIST with just 100 labeled examples.

The code only works with Tensorflow backend.

Requirements

Python 2.7+/3.6+
Tensorflow (1.4.0)
numpy
keras (2.1.4)

Note that other versions of tensorflow/keras should also work.

How to use

Load the dataset

from keras.datasets import mnist
import keras
import random

# get the dataset
(x_train, y_train), (x_test, y_test) = mnist.load_data()

x_train = x_train.reshape(60000, 28*28).astype('float32')/255.0
x_test = x_test.reshape(10000, 28*28).astype('float32')/255.0

y_train = keras.utils.to_categorical( y_train )
y_test = keras.utils.to_categorical( y_test )

# only select 100 training samples 
idxs_annot = range( x_train.shape[0])
random.seed(0)
random.shuffle( idxs_annot )
idxs_annot = idxs_annot[ :100 ]

x_train_unlabeled = x_train
x_train_labeled = x_train[ idxs_annot ]
y_train_labeled = y_train[ idxs_annot  ]

Repeat the labeled dataset to match the shapes

n_rep = x_train_unlabeled.shape[0] / x_train_labeled.shape[0]
x_train_labeled_rep = np.concatenate([x_train_labeled]*n_rep)
y_train_labeled_rep = np.concatenate([y_train_labeled]*n_rep)

Initialize the model

from ladder_net import get_ladder_network_fc
inp_size = 28*28 # size of mnist dataset 
n_classes = 10
model = get_ladder_network_fc( layer_sizes = [ inp_size , 1000, 500, 250, 250, 250, n_classes ]  )

Train the model

model.fit([ x_train_labeled_rep , x_train_unlabeled   ] , y_train_labeled_rep , epochs=100)

Get the test accuracy

from sklearn.metrics import accuracy_score
y_test_pr = model.test_model.predict(x_test , batch_size=100 )

print "test accuracy" , accuracy_score(y_test.argmax(-1) , y_test_pr.argmax(-1)  )

Comments

ladder net g_guass unclear
Can't find any reference to that formula in call and in g_guass in the paper and online, can you explain the origin of the formula

mu = a1 * tf.sigmoid(a2 * u + a3) + a4 * u + a5 v = a6 * tf.sigmoid(a7 * u + a8) + a9 * u + a10 z_est = (z_c - mu) * v + mu return z_est
opened by thebeancounter 3
Adapt to convolutional networks

Hi

Thank for making your code availiable. I'm trying to adapt the model into a convolutional network but i'm failing due to shape mismatching. Have you tried to do this?

BR.

opened by DBAFC 2
where is the “skip connection” in the codes

The paper Semi-Supervised Learning with Ladder Networks talks about "skip connection". I‘m wondering where shows “skip connection” in this codes？

opened by ShaoPengyang 2
How to modify the code to be compatible with tf.keras
When I replace the keras with tensorflow.keras, the code doesn't work:

File "ladder_net.py", line 140, in get_ladder_network_fc tr_m.metrics_tensors.append(u_cost) AttributeError: 'Model' object has no attribute 'metrics_tensors'

My tensorflow version is 1.14.
opened by wuliytTaotao 1
How to save and load the model

Can someone tell how to save and load the model? I tried to use the keras .save and .load_model functions. It saves the model but when I try to load it it shows a weird error saying

Two checkpoint references resolved to different objects (<tensorflow.python.keras.engine.functional.Functional object at 0x7f9b6c1c9100> and <tensorflow.python.keras.engine.input_layer.InputLayer object at 0x7f9b6e3efd30>). WARNING:tensorflow:Inconsistent references when loading the checkpoint into this object graph. Either the Trackable object references in the Python program have changed in an incompatible way, or the checkpoint was generated in an incompatible program.

Do, we need to define a custom saving and loading function for your ladder_net model?

opened by Abhinav-Singhal-8130 0
error when using validation set in fit function

model.fit([x_train_labeled_rep_LN, final_X_unlabel_train_LN], y_train_labeled_rep_LN,validation_data=(final_X_validation_LN,y_validation_LN) , epochs=20) when using validation data it gave me this error why? and how I can fix it, please

File "C:\Users\DANYA\Anaconda3\envs\test\lib\site-packages\keras\engine\training.py", line 972, in fit batch_size=batch_size) File "C:\Users\DANYA\Anaconda3\envs\test\lib\site-packages\keras\engine\training.py", line 751, in _standardize_user_data exception_prefix='input') File "C:\Users\DANYA\Anaconda3\envs\test\lib\site-packages\keras\engine\training_utils.py", line 102, in standardize_input_data str(len(data)) + ' arrays: ' + str(data)[:200] + '...') ValueError: Error when checking model input: the list of Numpy arrays that you are passing to your model is not the size the model expected. Expected to see 2 array(s), but instead got the following list of 1 arrays: [array([[0., 0., 0., ..., 0., 0., 0.], [0., 0., 0., ..., 0., 0., 0.], [0., 0., 0., ..., 0., 0., 0.], ..., [0., 0., 0., ..., 0., 0., 0.], [0., 0., 0., ..., 0., 0., 0....

opened by daniadag 0
Adapt to CNN2

Whoops, accidentally closed the issue.

Basically, I'm trying to replace the fully connected (Dense) layers in the encoder with Conv2D layers in order to classify (28,28,1) input volumes as opposed to (784,) inputs. Hope it makes sense.

opened by DBAFC 4

Owner

Divam Gupta

Graduate student at Carnegie Mellon University | Former Research Fellow at Microsoft Research

GitHub

Everything you want about DP-Based Federated Learning, including Papers and Code. (Mechanism: Laplace or Gaussian, Dataset: femnist, shakespeare, mnist, cifar-10 and fashion-mnist. )

Differential Privacy (DP) Based Federated Learning (FL) Everything about DP-based FL you need is here. （所有你需要的DP-based FL的信息都在这里） Code Tip: the code o

83 Dec 24, 2022

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Balanced-Evolutionary-Semi-Stacking Code for the paper ''BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalan

0 Jan 16, 2022

LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

Deep-Leafsnap Convolutional Neural Networks have become largely popular in image tasks such as image classification recently largely due to to Krizhev

48 Nov 27, 2022

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

Patch-Rotation(PatchRot) Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models Submitted to Neurips2021 To

4 Jul 12, 2021

TeST: Temporal-Stable Thresholding for Semi-supervised Learning

TeST: Temporal-Stable Thresholding for Semi-supervised Learning TeST Illustration Semi-supervised learning (SSL) offers an effective method for large-

1 Jul 14, 2022

Covid-19 Test AI (Deep Learning - NNs) Software. Accuracy is the %96.5, loss is the 0.09 :)

Covid-19 Test AI (Deep Learning - NNs) Software I developed a segmentation algorithm to understand whether Covid-19 Test Photos are positive or negati

28 Dec 4, 2021

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

2 Dec 17, 2021

The Rich Get Richer: Disparate Impact of Semi-Supervised Learning

The Rich Get Richer: Disparate Impact of Semi-Supervised Learning Preprocess file of the dataset used in implicit sub-populations: (Demographic groups

4 Oct 14, 2022

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning This is the official PyTorch implementation for UniMoCo pape

49 Jan 2, 2023

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

DoDNet This repo holds the pytorch implementation of DoDNet: DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datase

116 Dec 12, 2022

Semi-supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks

SSRL-for-image-classification Semi-supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks

2 Nov 19, 2021

This is the repository for the AAAI 21 paper [Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Learning].

CG3 This is the repository for the AAAI 21 paper [Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Learning]. R

12 Oct 28, 2022

This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels].

CGPN This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels]. Req

10 Sep 12, 2022

ColossalAI-Examples - Examples of training models with hybrid parallelism using ColossalAI

ColossalAI-Examples This repository contains examples of training models with Co

185 Jan 9, 2023

Hybrid CenterNet - Hybrid-supervised object detection / Weakly semi-supervised object detection

Hybrid-Supervised Object Detection System Object detection system trained by hybrid-supervision/weakly semi-supervision (HSOD/WSSOD): This project is

5 Dec 10, 2022

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

An Effective Loss Function for Generating 3D Models from Single 2D Image without Rendering Papers with code | Paper Nikola Zubić Pietro Lio University

213 Dec 27, 2022

Semi-Supervised Learning with Ladder Networks in Keras. Get 98% test accuracy on MNIST with just 100 labeled examples !

Related tags

Overview

Semi-Supervised Learning with Ladder Networks in Keras

Requirements

How to use

Comments

ladder net g_guass unclear

Adapt to convolutional networks

where is the “skip connection” in the codes

How to modify the code to be compatible with tf.keras

How to save and load the model

error when using validation set in fit function

Adapt to CNN2

Owner

Divam Gupta

Everything you want about DP-Based Federated Learning, including Papers and Code. (Mechanism: Laplace or Gaussian, Dataset: femnist, shakespeare, mnist, cifar-10 and fashion-mnist. )

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

LeafSnap replicated using deep neural networks to test accuracy compared to traditional computer vision methods.

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

TeST: Temporal-Stable Thresholding for Semi-supervised Learning

Covid-19 Test AI (Deep Learning - NNs) Software. Accuracy is the %96.5, loss is the 0.09 :)

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

The Rich Get Richer: Disparate Impact of Semi-Supervised Learning

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

Semi-supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks

This is the repository for the AAAI 21 paper [Contrastive and Generative Graph Convolutional Networks for Graph-based Semi-Supervised Learning].

This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels].

ColossalAI-Examples - Examples of training models with hybrid parallelism using ColossalAI

Hybrid CenterNet - Hybrid-supervised object detection / Weakly semi-supervised object detection

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Keras udrl - Keras implementation of Upside Down Reinforcement Learning

The toolkit to generate auto labeled datasets

DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort