ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

Microsoft

Last update: Oct 2, 2022

Related tags

Deep Learning SCoRE

Overview

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing

This repository contains code for the ICLR 2021 paper "SCoRE: Pre-Training for Context Representation in Conversational Semantic Parsing".

If you use SCoRE in your work, please cite it as follows:

@inproceedings{yu2021SCoRE,
  title={{SCoRE}: Pre-Training for Context Representation in Conversational Semantic Parsing},
  author={Tao Yu and Rui Zhang and Oleksandr Polozov and Christopher Meek and Ahmed Hassan Awadallah},
  booktitle={International Conference on Learning Representations},
  year={2021},
  url={https://openreview.net/forum?id=oyZxhRI2RiE}
}

Environment Setup

At the time of development, we used the same environment setup as RAT-SQL. It assumes Python 3.7+ and CUDA 10.1. Thus, the simplest environment setup for all the experiments except SQA (find SQA's environment setup in sqa/README.md) is:

docker pull pytorch/pytorch:1.5-cuda10.1-cudnn7-devel
docker tag pytorch/pytorch:1.5-cuda10.1-cudnn7-devel score
docker run -it -v /path/to/this/repo:/workspace score
# or using GPUs
docker run --gpus 2 -it -v /path/to/this/repo:/workspace score

Run Experiments

Code and running commands for running all the experiments can be found in the following dirs. First, synthesize (or download) pre-training data and train a SCoRE checkpoint:

data_synthesis: Synthesize Contextual Pre-Training Data
SCoRE: Pre-Training SCoRE Using Synthesized Data

Then, to use the trained checkpoint as a base language model for conversational semantic parsing tasks:

mwoz: SCoRE for Dialog State Tracking (MWoZ)
sqa: SCoRE for Sequential Question Answering (SQA)
sparc_cosql: SCoRE for Context-Dependent Semantic Parsing (SParC and CoSQL)

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

You might also like...

Based on the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral

Geometry-aware Instance-reweighted Adversarial Training This repository provides codes for Geometry-aware Instance-reweighted Adversarial Training (ht

47 Dec 22, 2022

Official PyTorch implementation for paper Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

Context Matters: Graph-based Self-supervised Representation Learning for Medical Images Official PyTorch implementation for paper Context Matters: Gra

49 Nov 23, 2022

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

Code Transformer This is an official PyTorch implementation of the CodeTransformer model proposed in: D. Zügner, T. Kirschstein, M. Catasta, J. Leskov

131 Dec 13, 2022

The official implementation of CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing

CSGStumpNet The official implementation of CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing Paper | Project page

39 Dec 26, 2022

A Python framework for conversational search

Chatty Goose Multi-stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting Installation Ma

36 Oct 23, 2022

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

Towards Persona-Based Empathetic Conversational Models (PEC) This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (E

35 Nov 17, 2022

The Adapter-Bot: All-In-One Controllable Conversational Model

The Adapter-Bot: All-In-One Controllable Conversational Model This is the implementation of the paper: The Adapter-Bot: All-In-One Controllable Conver

37 Nov 4, 2022

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

NUANCED: Natural Utterance Annotation for Nuanced Conversation with Estimated Distributions Overview NUANCED is a user-centric conversational recommen

18 Dec 28, 2021

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Facestar Dataset Description Existing audio-visual datasets for human speech are either captured in a clean, controlled environment but contain only a

87 Dec 21, 2022

Comments

Could not find some files for synthesizing data

Thank for your great work! I'm trying synthesize data by myself, however i can't find some files in the provided data for synthesize: context_templates.json, nlsql_templates_context.txt, spider/db_data.json

opened by Challenging6 2

ICLR 2021: Pre-Training for Context Representation in Conversational Semantic Parsing

Related tags

Overview

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing

Environment Setup

Run Experiments

Contributing

Trademarks

You might also like...

Based on the paper "Geometry-aware Instance-reweighted Adversarial Training" ICLR 2021 oral

Official PyTorch implementation for paper Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

Implementation of the paper "Language-agnostic representation learning of source code from structure and context".

The official implementation of CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing

A Python framework for conversational search

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

The Adapter-Bot: All-In-One Controllable Conversational Model

NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Comments

Could not find some files for synthesizing data

Owner

Microsoft

Release of SPLASH: Dataset for semantic parse correction with natural language feedback in the context of text-to-SQL parsing

Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Build upon neural radiance fields to create a scene-specific implicit 3D semantic representation, Semantic-NeRF

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Code for the paper "Training GANs with Stronger Augmentations via Contrastive Discriminator" (ICLR 2021)

[ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, Yingyan Lin