Zero-shot Learning by Generating Task-specific Adapters

INK Lab @ USC

Last update: Dec 17, 2021

Related tags

Deep Learning hypter

Overview

Code for "Zero-shot Learning by Generating Task-specific Adapters"

This is the repository containing code for "Zero-shot Learning by Generating Task-specific Adapters" (arXiv). This is a beta version and we will add more details in the future.

Environment

We modified the code in shmsw25/bart-closed-book-qa (Thanks to the authors!).

Following their instructions, please install the environment with these commands:

pip install torch==1.1.0
pip install git+https://github.com/huggingface/transformers.git@7b75aa9fa55bee577e2c7403301ed31103125a35

Data

Download ZEST dataset from here and place (zest_{train|dev|test_unanswered}.jsonl) in ./data.

Run

See ./scripts/zest_bart_large.sh and ./scripts/zest_grouped_bart_large_from_trained.sh

Cite Us

@article{Ye2021ZeroshotLB,
  title={Zero-shot Learning by Generating Task-specific Adapters},
  author={Qinyuan Ye and Xiang Ren},
  journal={ArXiv},
  year={2021},
  volume={abs/2101.00420}
}

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"

Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds Björn Michele1), Alexandre Boulch1), Gilles Puy1), Maxime Bucher1) and Rena

15 Dec 22, 2022

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Zero-shot-dual-iter-LLE The code of Zero-shot learning for low-light image enhancement based on dual iteration. You can get the real night image tests

1 Mar 18, 2022

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

ZEROGEN This repository contains the code for our paper “ZeroGen: Efficient Zero

31 Dec 30, 2022

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

Counterfactual Zero-Shot and Open-Set Visual Recognition This project provides implementations for our CVPR 2021 paper Counterfactual Zero-S

144 Dec 24, 2022

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frede

92 Dec 9, 2022

code for CVPR paper Zero-shot Instance Segmentation

Code for CVPR2021 paper Zero-shot Instance Segmentation Code requirements python: python3.7 nvidia GPU pytorch1.1.0 GCC =5.4 NCCL 2 the other python

86 Dec 13, 2022

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Zero-shot-Fact-Verification-by-Claim-Generation This repository contains code and models for the paper: Zero-shot Fact Verification by Claim Generatio

47 Jan 1, 2023

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

VQGAN-CLIP-Docker About Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized This is a stripped and minimal dependency repository for running loca

73 Sep 11, 2022

An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation This is an official implementation of the paper "Exploiting a Joint

35 Oct 26, 2022

Comments

model too large for forwarding by gpu in collab?

i met this error when run ./scripts/zest_grouped_bart_large_from_trained.sh Epoch 0: 0% 0/538 [00:00<?, ?it/s] Traceback (most recent call last): File "cli_grouped.py", line 142, in main() File "cli_grouped.py", line 139, in main run(args, logger) File "/content/drive/MyDrive/hypter/run_grouped.py", line 87, in run train(args, logger, model, train_data, dev_data, optimizer, scheduler) File "/content/drive/MyDrive/hypter/run_grouped.py", line 154, in train is_training=True) File "/content/drive/MyDrive/hypter/growing_bart.py", line 119, in forward is_training=is_training File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/content/drive/MyDrive/hypter/bart_with_adapter.py", line 298, in forward use_cache=use_cache, File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/usr/local/lib/python3.7/dist-packages/transformers/modeling_bart.py", line 835, in forward encoder_outputs = self.encoder(input_ids=input_ids, attention_mask=attention_mask) File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/usr/local/lib/python3.7/dist-packages/transformers/modeling_bart.py", line 309, in forward x, attn = encoder_layer(x, attention_mask) File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/content/drive/MyDrive/hypter/bart_with_adapter.py", line 138, in forward query=x, key=x, key_padding_mask=encoder_padding_mask, need_weights=self.output_attentions File "/usr/local/lib/python3.7/dist-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/usr/local/lib/python3.7/dist-packages/transformers/modeling_bart.py", line 646, in forward attn_weights = attn_weights.masked_fill(reshaped, float("-inf")) RuntimeError: CUDA out of memory. Tried to allocate 256.00 MiB (GPU 0; 14.76 GiB total capacity; 13.24 GiB already allocated; 81.75 MiB free; 13.29 GiB reserved in total by PyTorch) Is model too large for forwarding by gpu in collab?

opened by trinh-hoang-hiep 1

Zero-shot Learning by Generating Task-specific Adapters

Related tags

Overview

Code for "Zero-shot Learning by Generating Task-specific Adapters"

Environment

Data

Run

Cite Us

You might also like...

Public repository of the 3DV 2021 paper "Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Clouds"

The code of Zero-shot learning for low-light image enhancement based on dual iteration

ZeroGen: Efficient Zero-shot Learning via Dataset Generation

[CVPR 2021] Released code for Counterfactual Zero-Shot and Open-Set Visual Recognition

SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

code for CVPR paper Zero-shot Instance Segmentation

Codes for ACL-IJCNLP 2021 Paper "Zero-shot Fact Verification by Claim Generation"

Zero-Shot Text-to-Image Generation VQGAN+CLIP Dockerized

An official implementation of "Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation" (ICCV 2021) in PyTorch.

Comments

model too large for forwarding by gpu in collab?

Owner

INK Lab @ USC

《K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters》(2020)

Zero-shot Synthesis with Group-Supervised Learning (ICLR 2021 paper)

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

Shared Attention for Multi-label Zero-shot Learning

PyTorch implementation of 1712.06087 "Zero-Shot" Super-Resolution using Deep Internal Learning

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

ZSL-KG is a general-purpose zero-shot learning framework with a novel transformer graph convolutional network (TrGCN) to learn class representation from common sense knowledge graphs.

IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization