Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"

Zhengyan Li

Last update: Dec 4, 2022

Related tags

Deep Learning SCAPT-ABSA

Overview

SCAPT-ABSA

Code for EMNLP2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"

Overview

In this repository, we provide code for Superived ContrAstive Pre-Training (SCAPT) and aspect-aware fine-tuning, retrieved sentiment corpora from YELP/Amazon reviews, and SemEval2014 Restaurant/Laptop with addtional implicit_sentiment labeling.

SCAPT aims to tackle implicit sentiments expression in aspect-based sentiment analysis(ABSA). In our work, we define implicit sentiment as sentiment expressions that contain no polarity markers but still convey clear human-aware sentiment polarity.

Here are examples for explicit and implicit sentiment in ABSA:

SCAPT

SCAPT gives an aligned representation of sentiment expressions with the same sentiment label, which consists of three objectives:

Supervised Contrastive Learning (SCL)
Review Reconstruction (RR)
Masked Aspect Prediction (MAP)

Aspect-aware Fine-tuning

Sentiment representation and aspect-based representation are taken into account for sentiment prediction in aspect-aware fine-tuning.

Requirement

cuda 11.0
python 3.7.9
- lxml 4.6.2
- numpy 1.19.2
- pytorch 1.8.0
- pyyaml 5.3.1
- tqdm 4.55.0
- transformers 4.2.2

Data Preparation & Preprocessing

For Pre-training

Retrieved sentiment corpora contain millions-level reviews, we provide download links for original corpora and preprocessed data. Download if you want to do pre-training and further use them:

File	Google Drive Link	Baidu Wangpan Link	Baidu Wangpan Code
scapt_yelp_json.zip	link	link	q7fs
scapt_amazon_json.zip	link	link	i1da
scapt_yelp_pkl.zip	link	link	j9ce
scapt_amazon_pkl.zip	link	link	3b8t

These pickle files can also be generated from json files by the preprocessing method:

bash preprocess.py --pretrain

For Fine-tuning

We have already combined the opinion term labeling to the original SemEval2014 datasets. For example:

    <sentence id="1634">
        <text>The food is uniformly exceptional, with a very capable kitchen which will proudly whip up whatever you feel like eating, whether it's on the menu or not.</text>
        <aspectTerms>
            <aspectTerm term="food" polarity="positive" from="4" to="8" implicit_sentiment="False" opinion_words="exceptional"/>
            <aspectTerm term="kitchen" polarity="positive" from="55" to="62" implicit_sentiment="False" opinion_words="capable"/>
            <aspectTerm term="menu" polarity="neutral" from="141" to="145" implicit_sentiment="True"/>
        </aspectTerms>
        <aspectCategories>
            <aspectCategory category="food" polarity="positive"/>
        </aspectCategories>
    </sentence>

implicit_sentiment indicates whether it is an implicit sentiment expression and yield opinion_words if not implicit. The opinion_words lebaling is credited to TOWE.

Both original and extended fine-tuning data and preprocessed dumps are uploaded to this repository.

Consequently, the structure of your data directory should be:

├── Amazon
│   ├── amazon_laptops.json
│   └── amazon_laptops_preprocess_pretrain.pkl
├── laptops
│   ├── Laptops_Test_Gold_Implicit_Labeled_preprocess_finetune.pkl
│   ├── Laptops_Test_Gold_Implicit_Labeled.xml
│   ├── Laptops_Test_Gold.xml
│   ├── Laptops_Train_v2_Implicit_Labeled_preprocess_finetune.pkl
│   ├── Laptops_Train_v2_Implicit_Labeled.xml
│   └── Laptops_Train_v2.xml
├── MAMS
│   ├── test_preprocess_finetune.pkl
│   ├── test.xml
│   ├── train_preprocess_finetune.pkl
│   ├── train.xml
│   ├── val_preprocess_finetune.pkl
│   └── val.xml
├── restaurants
│   ├── Restaurants_Test_Gold_Implicit_Labeled_preprocess_finetune.pkl
│   ├── Restaurants_Test_Gold_Implicit_Labeled.xml
│   ├── Restaurants_Test_Gold.xml
│   ├── Restaurants_Train_v2_Implicit_Labeled_preprocess_finetune.pkl
│   ├── Restaurants_Train_v2_Implicit_Labeled.xml
│   └── Restaurants_Train_v2.xml
└── YELP
    ├── yelp_restaurants.json
    └── yelp_restaurants_preprocess_pretrain.pkl

Pre-training

The pre-training is conducted on multiple GPUs.

Pre-training [TransEnc|BERT] on [YELP|Amazon]:

python -m torch.distributed.launch --nproc_per_node=${THE_CARD_NUM_YOU_HAVE} multi_card_train.py --config config/[yelp|amazon]_[TransEnc|BERT]_pretrain.yml

Model checkpoints are saved in results.

Fine-tuning

Directly train [TransEnc|BERT] on [Restaurants|Laptops|MAMS] As [TransEncAsp|BERTAsp]:

python train.py --config config/[restaurants|laptops|mams]_[TransEnc|BERT]_finetune.yml

Fine-tune the pre-trained [TransEnc|BERT] on [Restaurants|Laptops|MAMS] As [TransEncAsp+SCAPT|BERTAsp+SCAPT]:

python train.py --config config/[restaurants|laptops|mams]_[TransEnc|BERT]_finetune.yml --checkpoint PATH/TO/MODEL_CHECKPOINT

Model checkpoints are saved in results.

Evaluation

Evaluate [TransEnc|BERT]-based model on [Restaurants|Laptops|MAMS] dataset:

python evaluate.py --config config/[restaurants|laptops|mams]_[TransEnc|BERT]_finetune.yml --checkpoint PATH/TO/MODEL_CHECKPOINT

Our model parameters:

Model	Dataset	File	Google Drive Link	Baidu Wangpan Link	Baidu Wangpan Code
TransEncAsp+SCAPT	SemEval2014 Restaurant	TransEnc_restaurants.zip	link	link	5e5c
TransEncAsp+SCAPT	SemEval2014 Laptop	TransEnc_laptops.zip	link	link	8amq
TransEncAsp+SCAPT	MAMS	TransEnc_MAMS.zip	link	link	bf2x
BERTAsp+SCAPT	SemEval2014 Restaurant	BERT_restaurants.zip	link	link	1w2e
BERTAsp+SCAPT	SemEval2014 Laptop	BERT_laptops.zip	link	link	zhte
BERTAsp+SCAPT	MAMS	BERT_MAMS.zip	link	link	1iva

Citation

If you found this repository useful, please cite our paper:

@inproceedings{li-etal-2021-learning-implicit,
    title = "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training",
    author = "Li, Zhengyan  and
      Zou, Yicheng  and
      Zhang, Chong  and
      Zhang, Qi  and
      Wei, Zhongyu",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2021",
    address = "Online and Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-main.22",
    pages = "246--256",
    abstract = "Aspect-based sentiment analysis aims to identify the sentiment polarity of a specific aspect in product reviews. We notice that about 30{\%} of reviews do not contain obvious opinion words, but still convey clear human-aware sentiment orientation, which is known as implicit sentiment. However, recent neural network-based approaches paid little attention to implicit sentiment entailed in the reviews. To overcome this issue, we adopt Supervised Contrastive Pre-training on large-scale sentiment-annotated corpora retrieved from in-domain language resources. By aligning the representation of implicit sentiment expressions to those with the same sentiment label, the pre-training process leads to better capture of both implicit and explicit sentiment orientation towards aspects in reviews. Experimental results show that our method achieves state-of-the-art performance on SemEval2014 benchmarks, and comprehensive analysis validates its effectiveness on learning implicit sentiment.",
}

Comments

复现性能问题

作者您好，我使用您提供的预训练好的模型，在微调阶段进行实验，最终复现的数据远不如您在论文中提到的数据在ISE上，分数最多下降了9个百分点同时，在lap数据集上，使用您训练好的SCAPT在BERTAsp上的实验结果，会比没有使用SCAPT直接跑BERTAsp上的实验结果低参数的设置是和您一样的，想请问一下为什么，谢谢！

opened by wangyuting0628 4
Pre-training error
Hello, I get the following CUDA error when I run the pre-training code:

CUDA error: CUBLAS_STATUS_INTERNAL_ERROR when calling `cublasCreate(handle)`

I installed requirements, downloaded the data, and followed the pre-training instruction. Is there a way I could fix this error? And is there a way I could run the pre-training code in CPU, not GPU?
opened by cozymichelle 2
question about review reconstruction loss

https://github.com/Tribleave/SCAPT-ABSA/blob/5f341fd811af62e7c0c8c8417c3a89f45179d663/model/module/misc.py#L27

you add the encoder's [CLS] hidden states to every input word embeddings as transformer decoder's input, which may be not consistent with your claim "hi acts as a beginning- of-sentence input embedding in the decoding process to control the whole generation" Thanks for you reply in advance!

opened by wuyaoxuehun 2
where can i find the pretrained models?

hi nice work! I am wondering where can i got the pretrained models you mentioned? i only find the raw data link for pretrain on readme.

thanks! jackie

opened by jackie930 1
能否发布微调前的模型权重

你好！我注意到您发布了预训练+下游任务上微调的模型权重，请问您能否发布仅预训练的模型权重（BERT）?

非常感谢你们团队所提出的面向ABSA的对比学习预训练方法。我最近也在做预训练相关的工作。我需要在多个下游任务上对多个预训练模型进行比较，这包括你们提出的模型。但在下游任务上微调过的模型，存在数据泄露的风险。因此，如果您可以提供仅预训练的模型权重（在BERT的基础上在yelp和amazon上预训练的模型权重），我的比较将会更加的公平和客观。

希望得到您的帮助！

opened by 1140310118 0
关于预训练的数据量

您好，感谢您的工作带给我非常多的思考。因为想复现一下，所以想知道你在pre-training的时候，使用了多少的Amazon和Yelp数据呢？文章说manually check a small portion，但是网盘提供的分别是一百万和四百万条数据，请问您在预训练的时候使用部分可以发布一下吗？另外关于验证集的问题，好像您的代码里面并没有出现验证集？而是直接用的测试集去选取的模型吗？如果是这样的话是否欠妥？

opened by Surfing-a 3

Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"

Related tags

Overview

SCAPT-ABSA

Overview

SCAPT

Aspect-aware Fine-tuning

Requirement

Data Preparation & Preprocessing

For Pre-training

For Fine-tuning

Pre-training

Fine-tuning

Evaluation

Citation

Comments

复现性能问题

Pre-training error

question about review reconstruction loss

where can i find the pretrained models?

能否发布微调前的模型权重

关于预训练的数据量

Owner

Zhengyan Li

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Code for EMNLP 2021 paper Contrastive Out-of-Distribution Detection for Pretrained Transformers.

Code for EMNLP 2021 main conference paper "Text AutoAugment: Learning Compositional Augmentation Policy for Text Classification"

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".

Code for our EMNLP 2021 paper “Heterogeneous Graph Neural Networks for Keyphrase Generation”

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Implementation for the EMNLP 2021 paper "Interactive Machine Comprehension with Dynamic Knowledge Graphs".

This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing at EMNLP 2021.

Related resources for our EMNLP 2021 paper

Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.

Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)

EMNLP 2021 paper Models and Datasets for Cross-Lingual Summarisation.

Code and data for "Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning" (EMNLP 2021).

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

Code to reproduce the experiments in the paper "Transformer Based Multi-Source Domain Adaptation" (EMNLP 2020)

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"