code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

Zonghan Yang

Last update: Nov 30, 2022

Related tags

Deep Learning Robust-Prefix-Tuning

Overview

On Robust Prefix-Tuning for Text Classification

Prefix-tuning has drawed much attention as it is a parameter-efficient and modular alternative to adapting pretrained language models to downstream tasks. However, we find that prefix-tuning suffers from adversarial attacks. While, unfortunately, current robust NLP methods are unsuitable for prefix-tuning as they will inevitably hamper the modularity of prefix-tuning. In our ICLR'22 paper, we propose robust prefix-tuning for text classification. Our method leverages the idea of test-time tuning, which preserves the strengths of prefix-tuning and improves its robustness at the same time. This repository contains the code for the proposed robust prefix-tuning method.

Prerequisite

PyTorch>=1.2.0, pytorch-transformers==1.2.0, OpenAttack==2.0.1, and GPUtil==1.4.0.

Train the original prefix P_θ

For the training phase of standard prefix-tuning, the command is:

  source train.sh --preseqlen [A] --learning_rate [B] --tasks [C] --n_train_epochs [D] --device [E]

where

[A]: The length of the prefix P_θ.
[B]: The (initial) learning rate.
[C]: The benchmark. Default: sst.
[D]: The total epochs during training.
[E]: The id of the GPU to be used.

We can also use adversarial training to improve the robustness of the prefix. For the training phase of adversarial prefix-tuning, the command is:

  source train_adv.sh --preseqlen [A] --learning_rate [B] --tasks [C] --n_train_epochs [D] --device [E] --pgd_ball [F]

where

[A]~[E] have the same meanings with above.
[F]: where norm ball is word-wise or sentence-wise.

Note that the DATA_DIR and MODEL_DIR in train_adv.sh are different from those in train.sh. When experimenting with the adversarially trained prefix P_θ's in the following steps, remember to switch the DATA_DIR and MODEL_DIR in the corresponding scripts as well.

Generate Adversarial Examples

We use the OpenAttack package to generate in-sentence adversaries. The command is:

  source generate_adv_insent.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G] --attack [H]

where

[A],[B],[C],[E] have the same meanings with above.
[G]: Load the prefix P_θ parameters trained for [G] epochs for testing. We set G=D.
[H]: Generate adversarial examples based on clean test set with the in-sentence attack [H].

We also implement the Universal Adversarial Trigger attack. The command is:

  source generate_adv_uat.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G] --attack clean-[H2] --uat_len [I] --uat_epoch [J]

where

[A],[B],[C],[E],[G] have the same meanings with above.
[H2]: We should search for UATs for each class in the benchmark, and H2 indicates the class id. H2=0/1 for SST, 0/1/2/3 for AG News, and 0/1/2 for SNLI.
[I]: The length of the UAT.
[J]: The epochs for exploiting UAT.

Test the performance of P_θ

The command for performance testing of P_θ under clean data and in-sentence attacks is:

  source test_prefix_theta_insent.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G] --attack [H] --test_batch_size [K]

Under UAT attack, the test command is:

  source test_prefix_theta_uat.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G] --attack clean --uat_len [I] --test_batch_size [K]

where

[A]~[I] have the same meanings with above.
[K]: The test batch size. when K=0, the batch size is adaptive (determined by GPU memory); when K>0, the batch size is fixed.

Robust Prefix P'_ψ: Constructing the canonical manifolds

By constructing the canonical manifolds with PCA, we get the projection matrices. The command is:

  source get_proj.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G]

where [A]~[G] have the same meanings with above.

Robust Prefix P'_ψ: Test its performance

Under clean data and in-sentence attacks, the command is:

  source test_robust_prefix_psi_insent.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G] --attack [H] --test_batch_size [K] --PMP_lr [L] --PMP_iter [M]

Under UAT attack, the test command is:

  source test_robust_prefix_psi_uat.sh --preseqlen [A] --learning_rate [B] --tasks [C] --device [E] --test_ep [G] --attack clean --uat_len [I] --test_batch_size [K] --PMP_lr [L] --PMP_iter [M]

where

[A]~[K] have the same meanings with above.
[L]: The learning rate for test-time P'_ψ tuning.
[M]: The iterations for test-time P'_ψ tuning.

Running Example

# Train the original prefix P_θ
source train.sh --tasks sst --n_train_epochs 100 --device 0
source train_adv.sh --tasks sst --n_train_epochs 100 --device 1 --pgd_ball word

# Generate Adversarial Examples
source generate_adv_insent.sh --tasks sst --device 0 --test_ep 100 --attack bug
source generate_adv_uat.sh --tasks sst --device 0 --test_ep 100 --attack clean-0 --uat_len 3 --uat_epoch 10
source generate_adv_uat.sh --tasks sst --device 0 --test_ep 100 --attack clean-1 --uat_len 3 --uat_epoch 10

# Test the performance of P_θ
source test_prefix_theta_insent.sh --tasks sst --device 0 --test_ep 100 --attack bug --test_batch_size 0
source test_prefix_theta_uat.sh --tasks sst --device 0 --test_ep 100 --attack clean --uat_len 3 --test_batch_size 0

# Robust Prefix P'_ψ: Constructing the canonical manifolds
source get_proj.sh --tasks sst --device 0 --test_ep 100

# Robust Prefix P'_ψ: Test its performance
source test_robust_prefix_psi_insent.sh --tasks sst --device 0 --test_ep 100 --attack bug --test_batch_size 0 --PMP_lr 0.15 --PMP_iter 10
source test_robust_prefix_psi_uat.sh --tasks sst --device 0 --test_ep 100 --attack clean --uat_len 3 --test_batch_size 0 --PMP_lr 0.05 --PMP_iter 10

Released Data & Models

The training the original prefix P_θ and the process of generating adversarial examples can be time-consuming. As shown in our paper, the adversarial prefix-tuning is particularly slow. Efforts need to be paid on generating adversaries as well, since different attacks are to be performed on the test set based on each trained prefix. We also found that OpenAttack is now upgraded to v2.1.1, which causes compatibility issues in our codes (test_prefix_theta_insent.py).

In order to facilitate research on the robustness of prefix-tuning, we release the prefix checkpoints P_θ (with both std. and adv. training), the processed test sets that are perturbed by in-sentence attacks (including PWWS and TextBugger), as well as the generated projection matrices of the canonical manifolds in our runs for reproducibility and further enhancement. We have also hard-coded the exploited UAT tokens in test_prefix_theta_uat.py and test_robust_prefix_psi_uat.py. All the materials can be found here.

Acknowledgements:

The implementation of robust prefix tuning is based on the LAMOL repo, which is the code of LAMOL: LAnguage MOdeling for Lifelong Language Learning that studies NLP lifelong learning with GPT-style pretrained language models.

Bibtex

If you find this repository useful for your research, please consider citing our work:

@inproceedings{
  yang2022on,
  title={On Robust Prefix-Tuning for Text Classification},
  author={Zonghan Yang and Yang Liu},
  booktitle={International Conference on Learning Representations},
  year={2022},
  url={https://openreview.net/forum?id=eBCmOocUejf}
}

Comments

Collect activations

Excuse me but where did you implement the step of "collect activations by correctly classified data" ? Accordingly, where is the distance (step 3 in Figure 1) calculated? Thanks!

opened by QLiu-NLP 2
Question about generating adversarial examples

Hi,

In your experimental setting, are the adversarial examples generated in advance (statically) and then optimize robust prefix embeddings accordingly to defend against them?

Looking forward to your reply, Thanks.

opened by ruizheng20 1
Question about prefix embedding.
Hi, thanks for your novel work and release the code! Here I have a few question about prefix embedding. I'm forward your anwser! the batch of text classification is orgnaized as :

[[text1], [text2], [text3], ... [text_n]]

where n is batch_size when we conduct prefix-tuning, we get prefix weight, they concat with text embedding

[ [prefix1] | [text1], [prefix2] | [text2], [prefix3] | [text3], ... [prefix_n] | [text_n]]

Is this cancat way right or same as the paper? I want to know prefix1 to prefix_n is same or different？ If they are same, in the backward propagarion, all prefix_n will be gradient descent in same value? Thanks for your anwser!
opened by jiaruHithub 1
error when run train.py

when run train.py , I got error message

['NVIDIA A40'] 2022-04-25 07:39:28,908 - 0:01:22 - 0.0s - INFO - main - args = Namespace(PMP_iter=5, PMP_lr=5e-05, REG_TYPE_KEYS=['mas', 'ewc'], adam_epsilon=0.0001, add_task_tokens=False, attack='clean', control_len=1, data_dir='data', debug=False, decay_style='linear', device=2, device_ids=[2], dynamic_epochs=False, fp32=True, gen_lm_sample_percentage=0.05, learning_rate=5e-05, lm_lambda=0.25, logging_steps=1000, lr_schedule='warmup_linear', max_grad_norm=1, max_len=1024, max_n_epochs=9, memory_sizes=[45634.0], mid_dim=512, min_batch_size=4, min_n_steps=1500, model_dir_root='saved_models/gpt2-medium/finetune/sst', model_name='gpt2-medium', n_gpus=1, n_train_epochs={'sst': 100}, n_warmup_ratio=0.005, n_workers=4, olayer=24, pgd_ball='word', preseqlen=10, qp_margin=0.5, real_sample=False, reg_lambda=1.0, seed=101, seq_train_type='finetune', skip_tasks=None, tasks=['sst'], temperature_lm=1.0, temperature_qa=1.0, test_batch_size=[5476], test_ep=100, tokens_weight=5, top_k_lm=20, top_k_qa=20, top_p_lm=0.0, top_p_qa=0.0, train_batch_size=[5476], tune_layer=3, uat_epoch=10, uat_len=3, unbound=0, use_sep=False, weight_decay=0.01) 2022-04-25 07:39:28,909 - 0:01:22 - 0.0s - INFO - main - start to train { task: ['sst'], seq train type: finetune } 2022-04-25 07:39:28,909 - 0:01:22 - 0.0s - INFO - main - extra training data size: 0 2022-04-25 07:40:05,900 - 0:01:59 - 37.0s - INFO - main - gen token = gen , gen token id = 50260 torch.Size([1024]) tensor([-0.0115, 0.0031, -0.0073, ..., -0.0526, -0.1757, 0.0257], device='cuda:2') torch.Size([10, 1024]) torch.Size([512, 1024]) torch.Size([49152, 512]) Traceback (most recent call last): File "/root/Robust-Prefix-Tuning/train.py", line 149, in model = train([task_id], model) File "/root/Robust-Prefix-Tuning/train.py", line 69, in train train_qadata = QADataset(train_dataset, "train", SPECIAL_TOKEN_IDS[tasks[0]], train_extra_data) File "/root/Robust-Prefix-Tuning/utils.py", line 162, in init with open(data_path, "r") as f: FileNotFoundError: [Errno 2] No such file or directory: 'data/sst_to_squad-train-v2.0.json'

It seems that train data is lost

opened by baiyuting 1

Code and datasets for the paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"

KnowPrompt Code and datasets for our paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction" Requireme

137 Dec 31, 2022

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

SwinTextSpotter This is the pytorch implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text R

183 Jan 3, 2023

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

Simple-Image-Classification Simple Image Classification Code (PyTorch) Yechan Kim This repository contains: Python3 / Pytorch code for multi-class ima

8 Oct 29, 2022

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

Transformer-vocabulary-transfer Implementation of the paper "Fine-Tuning Transfo

13 Nov 30, 2022

Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

Self-Tuning for Data-Efficient Deep Learning This repository contains the implementation code for paper: Self-Tuning for Data-Efficient Deep Learning

101 Dec 11, 2022

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

T-Few This repository contains the official code for the paper: "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learni

220 Dec 31, 2022

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

TBE The source code for our paper "Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Le

150 Dec 28, 2022

Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

Robust Reflection Removal with Reflection-free Flash-only Cues (RFC) Paper | To be released: Project Page | Video | Data Tensorflow implementation for

162 Jan 5, 2023

Code for the paper: Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization (https://arxiv.org/abs/2002.11798)

Representation Robustness Evaluations Our implementation is based on code from MadryLab's robustness package and Devon Hjelm's Deep InfoMax. For all t

19 Dec 7, 2022

code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification

Related tags

Overview

On Robust Prefix-Tuning for Text Classification

Prerequisite

Train the original prefix P_θ

Generate Adversarial Examples

Test the performance of P_θ

Robust Prefix P'_ψ: Constructing the canonical manifolds

Robust Prefix P'_ψ: Test its performance

Running Example

Released Data & Models

Acknowledgements:

Bibtex

You might also like...

Code and datasets for the paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"

Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (CVPR 2022)

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Code for CVPR2021 paper "Robust Reflection Removal with Reflection-free Flash-only Cues"

Code for the paper: Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization (https://arxiv.org/abs/2002.11798)

Comments

Collect activations

Question about generating adversarial examples

Question about prefix embedding.

error when run train.py

Owner

Zonghan Yang

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss

Simple image captioning model - CLIP prefix captioning.

Black-Box-Tuning - Black-Box Tuning for Language-Model-as-a-Service

Code for EMNLP 2021 main conference paper "Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification"

The official implementation of the IEEE S&P`22 paper "SoK: How Robust is Deep Neural Network Image Classification Watermarking".

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

Code to reproduce the results for Statistically Robust Neural Network Classification, published in UAI 2021

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

Code release for NeurIPS 2020 paper "Co-Tuning for Transfer Learning"