Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.

Jian-Guo Zhang

Last update: Dec 26, 2022

Related tags

Deep Learning Few-Shot-Intent-Detection

Overview

Few-Shot-Intent-Detection

Few-Shot-Intent-Detection is a repository designed for few-shot intent detection with/without Out-of-Scope (OOS) intents. It includes popular challenging intent detection datasets and baselines. For more details of the new released OOS datasets, please check our paper.

Intent detection datasets

We process data based on previous published resources, all the data are in the same format as DNNC.

Dataset	Description	#Train	#Valid	#Test	Processed Data Link
BANKING77	one banking domain with 77 intents	8622	1540	3080	Link
CLINC150	10 domains and 150 intents	15000	3000	4500	Link
HWU64	personal assistant with 64 intents and several domains	8954	1076	1076	Link
SNIPS	snips voice platform with 7 intents	13084	700	700	Link
ATIS	airline travel information system	4478	500	893	Link

Intent detection datasets with OOS queries

What is OOS queires:

OOD-OOS: i.e., out-of-domain OOS. General out-of-scope queries which are not supported by the dialog systems, also called out-of-domain OOS. For instance, requesting an online NBA/TV show service in a banking system.

ID-OOS: i.e., in-domain OOS. Out-of-scope queries which are more related to the in-scope intents, which makes the intent detection task more challenging. For instance, requesting a banking service that is not supported by the banking system.

Dataset	Description	#Train	#Valid	#Test	#OOD-OOS-Train	#OOD-OOS-Valid	#OOD-OOS-Test	#ID-OOS-Train	#ID-OOS-Valid	#ID-OOS-Test	Processed Data Link
CLINC150	A dataset with general OOS-OOS queries	15000	3000	4500	100	100	1000	-	-	-	Link
CLINC-Single-Domain-OOS	Two domains with both general OOS-OOS queries and ID-OOS queries	500	500	500	-	200	1000	-	400	350	Link
BANKING77-OOS	One banking domain with both general OOS-OOS queries and ID-OOS queries	5905	1506	2000	-	200	1000	2062	530	1080	Link

Data structure:

Datasets/
├── BANKING77
│   ├── train
│   ├── train_10
│   ├── train_5
│   ├── valid
│   └── test
├── CLINC150
│   ├── train
│   ├── train_10
│   ├── train_5
│   ├── valid
│   ├── test
│   ├── oos
│       ├──train
│       ├──valid
│       └──test
├── HWU64
│   ├── train
│   ├── train_10
│   ├── train_5
│   ├── valid
│   └── test
├── SNIPS
│   ├── train
│   ├── valid
│   └── test
├── ATIS
│   ├── train
│   ├── valid
│   └── test
├── BANKING77-OOS
│   ├── train
│   ├── valid
│   ├── test
│   ├── id-oos
│   │   ├──train
│   │   ├──valid
│   │   └──test
│   ├── ood-oos
│       ├──valid
│       └──test
├── CLINC-Single-Domain-OOS
│   ├── banking
│   │   ├── train
│   │   ├── valid
│   │   ├── test
│   │   ├── id-oos
│   │   │   ├──valid
│   │   │   └──test
│   │   ├── ood-oos
│   │       ├──valid
│   │       └──test
│   ├── credit_cards
│   │   ├── train
│   │   ├── valid
│   │   ├── test
│   │   ├── id-oos
│   │   │   ├──valid
│   │   │   └──test
│   │   ├── ood-oos
│   │       ├──valid
└── └──     └──test

Briefly describe the BANKING77-OOS dataset.

A dataset with a single banking domain, includes both general Out-of-Scope (OOD-OOS) queries and In-Domain but Out-of-Scope (ID-OOS) queries, where ID-OOS queries are semantically similar intents/queries with in-scope intents. BANKING77 originally includes 77 intents. BANKING77-OOS includes 50 in-scope intents in this dataset, and the ID-OOS queries are built up based on 27 held-out semantically similar in-scope intents.

Briefly describe the CLINC-Single-Domain-OOS dataset.

A dataset with two separate domains, i.e., the "Banking'' domain and the "Credit cards'' domain with both general Out-of-Scope (OOD-OOS) queries and In-Domain but Out-of-Scope (ID-OOS) queries, where ID-OOS queries are semantically similar intents/queries with in-scope intents. Each domain in CLINC150 originally includes 15 intents. Each domain in the new dataset includes ten in-scope intents in this dataset, and the ID-OOS queries are built up based on five held-out semantically similar in-scope intents.

Both datasets can be used to conduct intent detection with and without OOD-OOS and ID-OOS queries

You can easily load the processed data:

class IntentExample:
    def __init__(self, text, label, do_lower_case):
        self.original_text = text
        self.text = text
        self.label = label

        if do_lower_case:
            self.text = self.text.lower()
        
def load_intent_examples(file_path, do_lower_case=True):
    examples = []

    with open('{}/seq.in'.format(file_path), 'r', encoding="utf-8") as f_text, open('{}/label'.format(file_path), 'r', encoding="utf-8") as f_label:
        for text, label in zip(f_text, f_label):
            e = IntentExample(text.strip(), label.strip(), do_lower_case)
            examples.append(e)

    return examples

More details can check code for load data and do random sampling for few-shot learning.

State-of-the art models and baselines

DNNC

Download pre-trained RoBERTa NLI checkpoint:

wget https://storage.googleapis.com/sfr-dnnc-few-shot-intent/roberta_nli.zip

Access to public code: Link

CONVERT

Download pre-trained checkpoint:

wget https://github.com/connorbrinton/polyai-models/releases/download/v1.0/model.tar.gz

Access to public code:

wget https://github.com/connorbrinton/polyai-models/archive/refs/tags/v1.0.zip

CONVBERT

Download pre-trained checkpoints:

Step-1: install AWS CL2: e.g., install MacOS PKG

Step-2:

aws s3 cp s3://dialoglue/ --no-sign-request `Your_folder_name` --recursive

Then the checkpoints are downloaded into Your_folder_name

Few-shot intent detection baselines/leaderboard:

5-shot learning

Model	BANKING77	CLICN150	HWU64
RoBERTa+Classifier (EMNLP 2020)	74.04	87.99	75.56
USE (ACL 2020 NLP4ConvAI)	76.29	87.82	77.79
CONVERT (ACL 2020 NLP4ConvAI)	75.32	89.22	76.95
USE+CONVERT (ACL 2020 NLP4ConvAI)	77.75	90.49	80.01
CONVBERT+MLM+Example+Observers (NAACL 2021)	-	-	-
DNNC (EMNLP 2020)	80.40	91.02	80.46
CPFT (EMNLP 2021)	80.86	92.34	82.03

10-shot learning

Model	BANKING77	CLICN150	HWU64
RoBERTa+Classifier (EMNLP 2020)	84.27	91.55	82.90
USE (ACL 2020 NLP4ConvAI)	84.23	90.85	83.75
CONVERT(ACL 2020 NLP4ConvAI)	83.32	92.62	82.65
USE+CONVERT (ACL 2020 NLP4ConvAI)	85.19	93.26	85.83
CONVBERT (ArXiv 2020)	83.63	92.10	83.77
CONVBERT+MLM (ArXiv 2020)	83.99	92.75	84.52
CONVBERT+MLM+Example+Observers (NAACL 2021)	85.95	93.97	86.28
DNNC (EMNLP 2020)	86.71	93.76	84.72
CPFT (EMNLP 2021)	87.20	94.18	87.13

Note: the 5-shot learning results of RoBERTa+Classifier, DNNC and CPFT, and the 10-shot learning results of all the models are reported by the paper authors.

Citation

Please cite our paper if you use above resources in your work:

@article{zhang2020discriminative,
  title={Discriminative nearest neighbor few-shot intent detection by transferring natural language inference},
  author={Zhang, Jian-Guo and Hashimoto, Kazuma and Liu, Wenhao and Wu, Chien-Sheng and Wan, Yao and Yu, Philip S and Socher, Richard and Xiong, Caiming},
  journal={EMNLP},
  pages={5064--5082},
  year={2020}
}

@article{zhang2021pretrained,
  title={Are Pretrained Transformers Robust in Intent Classification? A Missing Ingredient in Evaluation of Out-of-Scope Intent Detection},
  author={Zhang, Jian-Guo and Hashimoto, Kazuma and Wan, Yao and Liu, Ye and Xiong, Caiming and Yu, Philip S},
  journal={arXiv preprint arXiv:2106.04564},
  year={2021}
}

@article{zhang2021few,
  title={Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning},
  author={Zhang, Jianguo and Bui, Trung and Yoon, Seunghyun and Chen, Xiang and Liu, Zhiwei and Xia, Congying and Tran, Quan Hung and Chang, Walter and Yu, Philip},
  journal={EMNLP},
  year={2021}
}

Comments

Any Plan to release the source code of EMNLP2021'CPFT

Hi, Jian Guo, Thanks for your excellent work about few-shot learning. It is really interesting and insightful. I would like to know if there is a chance I can access your source code in order to study such interesting work's details. Do you have any plan to release the source code of CPFT?

opened by Doragd 5
about baseline score

I wonder where the baseline score came from? I don't find the corresponding score in the original paper, and the reproducible experiment results are quite different from that. For example, the Roberta-classifier performance on clinc 5-shot is much lower than the score here cause it is just a simple RobertaForSequenceClassification call. I don't think my code is wrong. Can you please tell me where the score comes from?

opened by everks 3
Hypermeters for reproducing DNNC results

Hi, Jian Guo. Sorry to bother you again. I wonder to know the hypermeters when implementing DNNC model on Banking77, CLICN150 and HWU64 to reproduce the results on the table in the readme. In the original paper, I only find the following table: However, I don't know which dataset is these hypermeters for, and the other hypermeters such as gradient_accumulation_steps have not been given. Could you help me at your convenience?

opened by Doragd 2
Open source CPFT code?

Hi, Jianguo,

Congratulations to you as the Research Scientist in Saleforces.

It is highly appreciated if you can open source the CPFT code. I fail to reproduce the results in the paper. Is there any critical training details, such as learning rate scheduler?

Best Regards,

opened by hdzhang-code 1

Propose a principled and practically effective framework for unsupervised accuracy estimation and error detection tasks with theoretical analysis and state-of-the-art performance.

Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles This project is for the paper: Detecting Errors and Estimating

13 Nov 21, 2022

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

OpenFace 2.2.0: a facial behavior analysis toolkit Over the past few years, there has been an increased interest in automatic facial behavior analysis

5.8k Dec 31, 2022

PyTorch implementation of popular datasets and models in remote sensing

PyTorch Remote Sensing (torchrs) (WIP) PyTorch implementation of popular datasets and models in remote sensing tasks (Change Detection, Image Super Re

222 Dec 28, 2022

git《FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding》(CVPR 2021) GitHub: [fig8]

FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding (CVPR 2021) This repo contains the implementation of our state-of-the-art fewshot ob

233 Dec 29, 2022

Novel Instances Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection

Novel Instances Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection (NimPme) The official implementation of Novel Instances Mining with

12 Sep 8, 2022

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Time series Timeseries Deep Learning Pytorch fastai - State-of-the-art Deep Learning with Time Series and Sequences in Pytorch / fastai

2.8k Jan 8, 2023

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

Deep Text Search - AI Based Text Search & Recommendation System Deep Text Search is an AI-powered multilingual text search and recommendation engine w

19 Sep 29, 2022

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

A selection of State Of The Art research papers (and code) on human trajectory prediction (forecasting). Papers marked with [W] are workshop papers.

40 Nov 18, 2022

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

PaddlePaddle Vision Transformers State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 🤖 PaddlePaddle Visual Transformers (PaddleViT or

1k Dec 28, 2022

Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.

Related tags

Overview

Few-Shot-Intent-Detection

Intent detection datasets

Intent detection datasets with OOS queries

State-of-the art models and baselines

Few-shot intent detection baselines/leaderboard:

Citation

You might also like...

Propose a principled and practically effective framework for unsupervised accuracy estimation and error detection tasks with theoretical analysis and state-of-the-art performance.

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

PyTorch implementation of popular datasets and models in remote sensing

git《FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding》(CVPR 2021) GitHub: [fig8]

Novel Instances Mining with Pseudo-Margin Evaluation for Few-Shot Object Detection

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Deep Text Search is an AI-powered multilingual text search and recommendation engine with state-of-the-art transformer-based multilingual text embedding (50+ languages).

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

Comments

Any Plan to release the source code of EMNLP2021'CPFT

about baseline score

Hypermeters for reproducing DNNC results

Open source CPFT code?

Owner

Jian-Guo Zhang

LWCC: A LightWeight Crowd Counting library for Python that includes several pretrained state-of-the-art models.

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

GPU-accelerated PyTorch implementation of Zero-shot User Intent Detection via Capsule Neural Networks

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Few-NERD: Not Only a Few-shot NER Dataset

Boost learning for GNNs from the graph structure under challenging heterophily settings. (NeurIPS'20)

Implementation of the state of the art beat-detection, downbeat-detection and tempo-estimation model

LaneDet is an open source lane detection toolbox based on PyTorch that aims to pull together a wide variety of state-of-the-art lane detection models

nnDetection is a self-configuring framework for 3D (volumetric) medical object detection which can be applied to new data sets without manual intervention. It includes guides for 12 data sets that were used to develop and evaluate the performance of the proposed method.