Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Abhay Gupta

Last update: Dec 8, 2022

Related tags

Overview

Segmentation Transformer

Implementation of Segmentation Transformer in PyTorch, a new model to achieve SOTA in semantic segmentation while using transformer style encoders.

Features

To Do:

Training Scripts

Installation

Create the environment:

conda env create -f environment.yml

UnpNet - Rethinking 3-D LiDAR Point Cloud Segmentation(IEEE TNNLS)

UnpNet Citation Please cite the following paper if you use this repository in your reseach. @article {PMID:34914599, Title = {Rethinking 3-D LiDAR Po

4 Jul 15, 2022

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers Figure 1: Performance of SegFormer-B0 to SegFormer-B5. Project page

1.4k Dec 31, 2022

SeMask: Semantically Masked Transformers for Semantic Segmentation.

SeMask: Semantically Masked Transformers Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi This repo co

186 Dec 30, 2022

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Learning Pixel-level Semantic Affinity with Image-level Supervision This code is deprecated. Please see https://github.com/jiwoon-ahn/irn instead. Int

337 Dec 15, 2022

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP Abstract: We introduce a method that allows to automatically se

134 Dec 19, 2022

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

This project is a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

147 Dec 3, 2022

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

ADE20k Semantic segmentation with MAE Getting started Install the mmsegmentation

97 Dec 17, 2022

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021) Citation Please cite as: @inproceedings{liu2020understan

22 Nov 25, 2022

Sequence to Sequence Models with PyTorch

Sequence to Sequence models with PyTorch This repository contains implementations of Sequence to Sequence (Seq2Seq) models in PyTorch At present it ha

708 Dec 19, 2022

Comments

redundant dropout at the same place

Line 103 in Transformer.py: PreNormDrop( dim, dropout_rate, SelfAttention( dim, heads=heads, dropout_rate=attn_dropout_rate ), ) In the PreNormDrop module, a dropout is implemented after SelfAttention. However, in the SelfAttention module, there already exists a dropout layer at the end. I believe PreNorm should be used instead of PreNormDrop.

opened by zaocan666 3
model building problem

Hi~ I think there exists some problem in model building. I notice that in the decode function of SETR_Naive/SETR_PUP/SETR_MLA, some layers are initialized, like nn.Conv2d. However, the decode function is in the forward function of the model, so these layers will be initialized every time the model is feed data. Therefore, these layers' weight are not trained at all. Initialization of these layers should be placed in the init function of the model.

opened by zaocan666 2
"pass the intermediate layers for MLA"

Thanks for the implementation. Can you please give an example of how to specify the intermediate layers for MLA?

assert intmd_layers is not None, "pass the intermediate layers for MLA"

opened by rawmean 4

Owner

Abhay Gupta

Engineer with an AI background.

GitHub https://arxiv.org/abs/2012.15840

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

897 Jan 5, 2023

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification This is the official pytorch implementation of t

5 Nov 14, 2022

Pytorch Implementation for NeurIPS (oral) paper: Pixel Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

Pixel-Level Cycle Association This is the Pytorch implementation of our NeurIPS 2020 Oral paper Pixel-Level Cycle Association: A New Perspective for D

87 Oct 19, 2022

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Related tags

Overview

Segmentation Transformer

Features

Installation

You might also like...

UnpNet - Rethinking 3-D LiDAR Point Cloud Segmentation(IEEE TNNLS)

Official implementation of "SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers"

SeMask: Semantically Masked Transformers for Semantic Segmentation.

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Segmentation in Style: Unsupervised Semantic Image Segmentation with Stylegan and CLIP

TorchDistiller - a collection of the open source pytorch code for knowledge distillation, especially for the perception tasks, including semantic segmentation, depth estimation, object detection and instance segmentation.

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

Sequence to Sequence Models with PyTorch

Comments

redundant dropout at the same place

model building problem

"pass the intermediate layers for MLA"

Owner

Abhay Gupta

[CVPR 2021] Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

GB-CosFace: Rethinking Softmax-based Face Recognition from the Perspective of Open Set Classification

Pytorch Implementation for NeurIPS (oral) paper: Pixel Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

CVPR2022 (Oral) - Rethinking Semantic Segmentation: A Prototype View

Recall Loss for Semantic Segmentation (This repo implements the paper: Recall Loss for Semantic Segmentation)

Spectralformer: Rethinking hyperspectral image classification with transformers

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

Rethinking the U-Net architecture for multimodal biomedical image segmentation

Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation

[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach