Deeplab-resnet-101 in Pytorch with Jaccard loss

Maxim Berman

Last update: Apr 15, 2022

Related tags

Deep Learning jaccardSegment

Overview

Deeplab-resnet-101 Pytorch with Lovász hinge loss

Train deeplab-resnet-101 with binary Jaccard loss surrogate, the Lovász hinge, as described in http://arxiv.org/abs/1705.08790.

Parts of the code is adapted from tensorflow-deeplab-resnet (in particular the conversion from caffe to tensorflow with kaffe).

The code has not been tested for full training of Deeplab-Resnet yet. Refer to tensorflow-deeplab-resnet and possibly extract the weights after training with that framework.

Code status

The code is in early stage. Pull requests welcome.

Citation

Please cite

@ARTICLE{2017arXiv170508790B,
   author = {{Berman}, M. and {Blaschko}, M.~B.},
    title = "{Optimization of the Jaccard index for image segmentation with the Lov\'asz hinge}",
  journal = {ArXiv e-prints},
archivePrefix = "arXiv",
   eprint = {1705.08790},
 primaryClass = "cs.CV",
 keywords = {Computer Science - Computer Vision and Pattern Recognition},
     year = 2017,
    month = may,
   adsurl = {http://adsabs.harvard.edu/abs/2017arXiv170508790B},
}

if you use the code.

Dependencies and weights

Relies notably on Pytorch and the standalone tensorboard package

Using anaconda, install the full requirements using the provided conda environment file:

conda env create --f environemnt.yml
source activate jaccard-segment

Convert the Deeplab Caffe weights to tensorflow ckpt using caffe-tensorflow, then convert them to hdf5 using ckpt_to_dd.py and use our wrapper to load in Pytorch.

Important switches in the settings

By default, finetunes with cross-entropy loss. Use --binary class switch for selecting a particular class in the binary case, --jaccard for training with the Jaccard hinge loss described in the arxiv paper, --hinge to use the Hinge loss, and --proximal to use the prox. operator optimization variant for the Jaccard loss as described in the arxiv paper.

For the prox. operator, use a learning rate of 1. and set an equivalent regularization of 1/lr instead.

You might also like...

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

169 Dec 26, 2022

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

CLIP-ViL In our paper "How Much Can CLIP Benefit Vision-and-Language Tasks?", we show the improvement of CLIP features over the traditional resnet fea

310 Dec 28, 2022

Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet

Reproduce ResNet-v2 using MXNet Requirements Install MXNet on a machine with CUDA GPU, and it's better also installed with cuDNN v5 Please fix the ran

531 Dec 4, 2022

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

5 Nov 3, 2022

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Comments

Clarification for the greedy algorithm validity

Hello! Thank you for these fascinating research results. I guess this is a silly question, but still: Could you please provide an argument for the greediness in the algorithm used in the proximal operator computation? Precisely, what I am interested in is why, once reaching an edge of the polyhedron, we should move along that edge (not considering the other face) and why the edge direction is given by averaging the "clashed" components of the previous direction? The rest of the algorithm is well-grounded and clear to me. Thanks in advance!

opened by kartynnik 0

Deeplab-resnet-101 in Pytorch with Jaccard loss

Related tags

Overview

Deeplab-resnet-101 Pytorch with Lovász hinge loss

Code status

Citation

Dependencies and weights

Important switches in the settings

You might also like...

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet.

improvement of CLIP features over the traditional resnet features on the visual question answering, image captioning, navigation and visual entailment tasks.

Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet

NFT-Price-Prediction-CNN - Using visual feature extraction, prices of NFTs are predicted via CNN (Alexnet and Resnet) architectures.

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch.

PyTorch implementation of Soft-DTW: a Differentiable Loss Function for Time-Series in CUDA

PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)

Comments

Clarification for the greedy algorithm validity

Owner

Maxim Berman

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Implement of "Training deep neural networks via direct loss minimization" in PyTorch for 0-1 loss

Recall Loss for Semantic Segmentation (This repo implements the paper: Recall Loss for Semantic Segmentation)

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.

PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

Reproduces ResNet-V3 with pytorch

3D ResNet Video Classification accelerated by TensorRT

Quickly comparing your image classification models with the state-of-the-art models (such as DenseNet, ResNet, ...)