[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Last update: Dec 6, 2022

Related tags

Deep Learning SDGZSL

Overview

Semantics Disentangling for Generalized Zero-shot Learning

This is the official implementation for paper

Zhi Chen, Yadan Luo, Ruihong Qiu, Zi Huang, Jingjing Li, Zheng Zhang.
Semantics Disentangling for Generalized Zero-shot Learning
International Conference on Computer Vision (ICCV) 2021.

Abstract: Generalized zero-shot learning (GZSL) aims to classify samples under the assumption that some classes are not observable during training. To bridge the gap between the seen and unseen classes, most GZSL methods attempt to associate the visual features of seen classes with attributes or to generate unseen samples directly. Nevertheless, the visual features used in the prior approaches do not necessarily encode semantically related information that the shared attributes refer to, which degrades the model generalization to unseen classes. To address this issue, in this paper, we propose a novel semantics disentangling framework for the generalized zero-shot learning task (SDGZSL), where the visual features of unseen classes are firstly estimated by a conditional VAE and then factorized into semantic-consistent and semantic-unrelated latent vectors. In particular, a total correlation penalty is applied to guarantee the independence between the two factorized representations, and the semantic consistency of which is measured by the derived relation network. Extensive experiments conducted on four GZSL benchmark datasets have evidenced that the semantic-consistent features disentangled by the proposed SDGZSL are more generalizable in tasks of canonical and generalized zero-shot learning.

Requirements

The implementation runs on

Python 3.6
torch 1.3.1
Numpy
Sklearn
Scipy

Usage

Put your datasets in SDGZSL_data folder and run the scripts:

The extracted features for APY and AWA datasets are from [1], FLO and CUB datasets are from [2]. For the fine-tuned features, AWA,FLO and CUB are from [3]. The APY fine-tuned features are extracted from us.

[1] Xian, Yongqin, et al. "Feature generating networks for zero-shot learning." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.

[2] Yu, Yunlong, et al. "Episode-based prototype generating network for zero-shot learning." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.

[3] Narayan, Sanath, et al. "Latent embedding feedback and discriminative features for zero-shot classification." ECCV 2020.

Citation:

If you find this useful, please cite our work as follows:

@inproceedings{chen2021semantics,
	title={Semantics Disentangling for Generalized Zero-shot Learning},
	author={Chen, Zhi and Luo, Yadan and Qiu, Ruihong and Huang, Zi and Li, Jingjing and Zhang, Zheng},
	booktitle={ICCV},
	year={2021}
}

Comments

Could you provide the results of CUB using attributes

I directly used your code to train the model using CUB attributes, but the results were pretty poor (H is less than 0.50). I guess it's a matter of parameter settings. So could you provide the results of CUB using attributes, or the corresponding parameter settings? It would be better if you could also provide the results or the parameters of the SUN dataset. Thank you!

opened by NanAlbert 14
Question about optimizing the discriminator

I wanted to pair the TC and Ldis described in your paper with the tc_loss in your code. I felt confused about the discriminator in your code. Why does the output of the discriminator have two dim? Furthermore, there's no log operation in your code for tc_loss.

opened by tbw19970424 3
Dataset

"missing the CUB dataset." How can I get the visual features of CUB dataset(not the visual features extracted from the pre-trained deep models)

opened by KernLC 3
FLO-GBU and FLO-EPGN

Dear Author， May I ask what is the difference between the visual features given by FLO-GBU and FLO-EPGN? The paper you mentioned (Zero-Shot Learning The Good, the Bad and the Ugly) does not have FLO settings, so what is FLO-GBU ?

opened by yangjingqi99 2
About sent_splits.mat

Hello. I would like to know how to generate the CNN-RNN sentence embeddings (1024D), or where to get the sentence embeddings of other datasets, e.g., AwA2, SUN, APY. Thank you so much!

opened by NanAlbert 2

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

Related tags

Overview

Semantics Disentangling for Generalized Zero-shot Learning

Requirements

Usage

Citation:

Comments

Could you provide the results of CUB using attributes

Question about optimizing the discriminator

Dataset

FLO-GBU and FLO-EPGN

About sent_splits.mat

Owner

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance

Code to reproduce the results for Compositional Attention: Disentangling Search and Retrieval.

Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

PyTorch implementation of 1712.06087 "Zero-Shot" Super-Resolution using Deep Internal Learning

Audio-Visual Generalized Few-Shot Learning with Prototype-Based Co-Adaptation

Official code for "Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer. ICCV2021".

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

Pytorch Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic

Implementation for "Seamless Manga Inpainting with Semantics Awareness" (SIGGRAPH 2021 issue)

Official PyTorch Implementation of Rank & Sort Loss [ICCV2021]

Zero-shot Synthesis with Group-Supervised Learning (ICLR 2021 paper)

Shared Attention for Multi-label Zero-shot Learning

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"