Reference implementation for Structured Prediction with Deep Value Networks

Michael Gygli

Last update: Feb 2, 2022

Related tags

Deep Learning dvn

Overview

Deep Value Network (DVN)

This code is a python reference implementation of DVNs introduced in

Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs. Michael Gygli, Mohammad Norouzi, Anelia Angelova. ICML 2017. PDF

Note: This code implements the multi-layer perceptron version used for the multi-label classification experiments only (Section 5.1). The segmentation code was written while inside Google and thus not available.

Requirements

To run this code you need to have tensorflow, numpy, liac-arff, scikit-learn and torchfile installed. Install with

pip install -r requirements.txt

Playing around with a pre-trained Value Net

The pre-trained model for the Bibtex dataset is included in this repository. This allows you do play around with it and it's predictions, using our jupyter notebook.

Replicating the experiments in the paper

Bibtex

To replicate the numbers for bibtex provided in the paper, run:

import reproduce_results
# Reproduce results on the bibtex dataset
reproduce_results.run_bibtex()

By default, the model weights and logs are stored to ./bibtex_dvn. You can monitor the process using tensorboard with

tensorboard --logdir ./bibtex_dvn/

In order to understand the training process two quantities are important:

loss: The loss in estimating the true value of an output hypothesis
gt_f1_scores: The true f1 scores of the generated output hypothesis.

As training progresses, the generated output hypothesis should get better and better. As such, the validation performance reported here closely matches the performance of the test set. The curve should look something like this:

Bookmarks

For Bookmarks the splits are not provided on http://mulan.sourceforge.net/datasets-mlc.html. Thus, we use the splits provided by SPEN. To get the data, run:

cd mlc_datasets
wget http://www.cics.umass.edu/~belanger/icml_mlc_data.tar.gz
tar -xvf icml_mlc_data.tar.gz
cd ..

Then, you can reproduce the results with

import reproduce_results
# Reproduce results on the bookmarks dataset
reproduce_results.run_bookmarks()

The model weights and logs are stored to ./bookmarks_dvn/.

Contributors

Michael Gygli, Mohammad Norouzi, Anelia Angelova

Code by Michael Gygli

You might also like...

PyTorch implementation of ARM-Net: Adaptive Relation Modeling Network for Structured Data.

A ready-to-use framework of latest models for structured (tabular) data learning with PyTorch. Applications include recommendation, CRT prediction, healthcare analytics, and etc.

48 Nov 30, 2022

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Perceiver IO Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs Usage import torch from src.perceiver.

111 Nov 15, 2022

Pytorch implementation of the paper Progressive Growing of Points with Tree-structured Generators (BMVC 2021)

PGpoints Pytorch implementation of the paper Progressive Growing of Points with Tree-structured Generators (BMVC 2021) Hyeontae Son, Young Min Kim Pre

9 Jun 6, 2022

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

tf-fsvd TensorFlow Implementation of Functional Singular Value Decomposition for paper Fast Graph Learning with Unique Optimal Solutions Cite If you f

14 Nov 25, 2021

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

Value Retrieval with Arbitrary Queries for Form-like Documents Introduction Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-

13 Sep 15, 2022

Unofficial TensorFlow implementation of Protein Interface Prediction using Graph Convolutional Networks.

[TensorFlow] Protein Interface Prediction using Graph Convolutional Networks Unofficial TensorFlow implementation of Protein Interface Prediction usin

9 Oct 25, 2022

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

PGPElib A mini library for Policy Gradients with Parameter-based Exploration [1] and friends. This library serves as a clean re-implementation of the

56 Jan 1, 2023

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Faster R-CNN and Mask R-CNN in PyTorch 1.0 maskrcnn-benchmark has been deprecated. Please see detectron2, which includes implementations for all model

9k Jan 4, 2023

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

This repository is a toolkit to do machine learning for programming languages. It implements tokenization, dataset preprocessing, model training and m

408 Jan 1, 2023

Comments

Questions about DVN network structure for Weizmann horses
I am trying to replicate the experiments of DVN for image segmentation on Weizmann horse. I have some confusion about the network structure on Figure 2 in the paper.

How is the last convolution layer and the first fully connected layer connected? The paper said that there are three convolution layers, but on the output of “6x6”, there is still a “5x5 convolution kernel” with stride of 2. Is it a fourth convolution layer? Or a pooling? What exactly the input dimension of the first fully connected layer? Is it 6x6x128? Or 3x3x128?

For the input, in Figure 2, there are 3+k channels. The 3 channels are for the input image, k is for the musk. What does k represents here? Is it related to the number of classes? For example, for Weizmann horse, is k = 1 (0 as background and 1 for horse)?

Thanks!
opened by nkg114mc 0

Reference implementation for Structured Prediction with Deep Value Networks

Related tags

Overview

Deep Value Network (DVN)

Requirements

Playing around with a pre-trained Value Net

Replicating the experiments in the paper

Contributors

You might also like...

PyTorch implementation of ARM-Net: Adaptive Relation Modeling Network for Structured Data.

Unofficial implementation of Perceiver IO: A General Architecture for Structured Inputs & Outputs

Pytorch implementation of the paper Progressive Growing of Points with Tree-structured Generators (BMVC 2021)

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

Unofficial TensorFlow implementation of Protein Interface Prediction using Graph Convolutional Networks.

A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer from NNAISENSE.

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

Comments

Questions about DVN network structure for Weizmann horses

Owner

Michael Gygli

Viperdb - A tiny log-structured key-value database written in pure Python

TANL: Structured Prediction as Translation between Augmented Natural Languages

PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

Intel® Nervana™ reference deep learning framework committed to best performance on all hardware

Price-Prediction-For-a-Dream-Home - A machine learning based linear regression trained model for house price prediction.

Doge-Prediction - Coding Club prediction ig