Towards Debiasing NLU Models from Unknown Biases

Related tags

Deep Learning emnlp2020-debiasing-unknown

Overview

Towards Debiasing NLU Models from Unknown Biases

Abstract: NLU models often exploit biased features to achieve high dataset-specific performance without properly learning the intended task. Recently proposed debiasing methods are shown to be effective in mitigating this tendency. However, these methods rely on a major assumption that the type of biased features is known a-priori, which limits their application to many NLU tasks and datasets. In this work, we present the first step to bridge this gap by introducing a self-debiasing framework that prevents models from mainly utilizing biases without knowing them in advance. The proposed framework is general and complementary to the existing debiasing methods. We show that the proposed framework allows these existing methods to retain the improvement on the challenge datasets (i.e., sets of examples designed to expose models’ reliance to biases) without specifically targeting certain biases. Furthermore, the evaluation suggests that applying the framework results in improved overall robustness.

The repository contains the code to reproduce our work in debiasing NLU models without prior information on biases. We provide 3 runs of experiment that are shown in our paper:

Debias MNLI model from syntactic bias and evaluate on HANS as the out-of-distribution data using example reweighting.
Debias MNLI model from syntactic bias and evaluate on HANS as the out-of-distribution data using product of expert.
Debias MNLI model from syntactic bias and evaluate on HANS as the out-of-distribution data using confidence regularization.

Requirements

The code requires python >= 3.6 and pytorch >= 1.1.0.

Additional required dependencies can be found in requirements.txt. Install all requirements by running:

pip install -r requirements.txt

Data

Our experiments use MNLI dataset version provided by GLUE benchmark. Download the file from here, and unzip under the directory ./dataset The dataset directory should be structured as the following:

└── dataset 
    └── MNLI
        ├── train.tsv
        ├── dev_matched.tsv
        ├── dev_mismatched.tsv
        ├── dev_mismatched.tsv

Running the experiments

For each evaluation setting, use the --mode arguments to set the appropriate loss function. Choose the annealed version of the loss function for reproducing the annealed results.

To reproduce our result on MNLI ⮕ HANS, run the following:

cd src/
CUDA_VISIBLE_DEVICES=9 python train_distill_bert.py \
  --output_dir ../experiments_self_debias_mnli_seed111/bert_reweighted_sampled2K_teacher_seed111_annealed_1to08 \
  --do_train --do_eval --mode reweight_by_teacher_annealed \
  --custom_teacher ../teacher_preds/mnli_trained_on_sample2K_seed111.json --seed 111 --which_bias hans

Biased examples identification

To obtain predictions of the shallow models, we train the same model architecture on the fraction of the dataset. For MNLI we subsample 2000 examples and train the model for 5 epochs. For obtaining shallow models of other datasets please see the appendix of our paper. The shallow model can be obtained with the command below:

cd src/
CUDA_VISIBLE_DEVICES=9 python train_distill_bert.py \
 --output_dir ../experiments_shallow_mnli/bert_base_sampled2K_seed111 \
 --do_train --do_eval --do_eval_on_train --mode none\
 --seed 111 --which_bias hans --debug --num_train_epochs 5 --debug_num 2000

Once the training and the evaluation on train set is done, copy the probability json files in the output directory to ../teacher_preds/mnli_trained_on_sample2K_seed111.json.

Expected results

Results on the MNLI ⮕ HANS setting without annealing:

Mode	Seed	MNLI-m	MNLI-mm	HANS avg.
None	111	84.57	84.72	62.04
reweighting	111	81.8	82.3	72.1
PoE	111	81.5	81.1	70.3
conf-reg	222	83.7	84.1	68.7

You might also like...

Official codebase for running the small, filtered-data GLIDE model from GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models.

GLIDE This is the official codebase for running the small, filtered-data GLIDE model from GLIDE: Towards Photorealistic Image Generation and Editing w

2.9k Jan 4, 2023

Comments

No such file or directory: '../dataset/glue_multinli/dev_matched_easy.tsv'?

hello! I'm puzzled by this problem: No such file or directory: '../dataset/glue_multinli/dev_matched_easy.tsv' Is there any data missing? Looking forward to you reply. Thanks!

opened by dqxiu 0
Reproduction of "conf-reg" mode result

Hi, I have an inquiry about reproduction of "conf-reg" mode result. Refering to the source code, I set the mode input as "--mode smoothed_distill_annealed", but I got the results in MNLI as MNLI-m : 64.9 / MNLI-mm : 66.9 / HANS : 50.0, far behind the performances in the paper. Below are the commands I used. May I ask if there is anything wrong with them? Thank you.

CUDA_VISIBLE_DEVICES=0 python train_distill_bert.py
--output_dir ../experiments_shallow_mnli/bert_base_sampled2K_seed111
--do_train --do_eval --do_eval_on_train --mode none
--seed 111 --which_bias hans --debug --num_train_epochs 5.0 --debug_num 2000

(I then moved a resulting file "eval_mnli_train_answers.json" to ../teacher_preds directory.)

CUDA_VISIBLE_DEVICES=0 python train_distill_bert.py
--output_dir ../experiments_self_debias_mnli_seed111/bert_smoothed_distill_sampled2K_teacher_seed111_annealed
--do_train --do_eval --mode smoothed_distill_annealed --num_train_epochs 5.0
--custom_teacher ../teacher_preds/eval_mnli_train_answers.json --seed 111 --which_bias hans

opened by jej127 0
Release QQP and FEVER code?

Dear @putama, thank you so much for releasing the MNLI debiasing code.

It would be extremely useful to also have a look at the QQP and FEVER implementations. Would it be possible to release those?

opened by mariomeissner 0
Remaining bias files

Thank you for releasing this code.

Could you please also upload the remaining bias files (for FEVER and QQP)? Or point me towards a way to obtain them myself?

opened by mariomeissner 0

Towards Debiasing NLU Models from Unknown Biases

Related tags

Overview

Towards Debiasing NLU Models from Unknown Biases

Requirements

Data

Running the experiments

Biased examples identification

Expected results

You might also like...

This library provides an abstraction to perform Model Versioning using Weight & Biases.

Neural Factorization of Shape and Reflectance Under An Unknown Illumination

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

CUP-DNN is a deep neural network model used to predict tissues of origin for cancers of unknown of primary.

Multi-robot collaborative exploration and mapping through Voronoi partition and DRL in unknown environment

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

This is the repository for paper NEEDLE: Towards Non-invertible Backdoor Attack to Deep Learning Models.

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

Official codebase for running the small, filtered-data GLIDE model from GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models.

Comments

No such file or directory: '../dataset/glue_multinli/dev_matched_easy.tsv'?

Reproduction of "conf-reg" mode result

Release QQP and FEVER code?

Remaining bias files

Owner

Ubiquitous Knowledge Processing Lab

[ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models

Codes and models for the paper "Learning Unknown from Correlations: Graph Neural Network for Inter-novel-protein Interaction Prediction".

Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

This is the official pytorch implementation of AutoDebias, an automatic debiasing method for recommendation.

This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".

Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty

Study of human inductive biases in CNNs and Transformers.

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

PyTorch implementation of Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation.