Learning Logic Rules for Document-Level Relation Extraction

Last update: Dec 26, 2022

Related tags

Deep Learning LogiRE

Overview

LogiRE

Learning Logic Rules for Document-Level Relation Extraction

We propose to introduce logic rules to tackle the challenges of doc-level RE.

Equipped with logic rules, our LogiRE framework can not only explicitly capture long-range semantic dependencies, but also show more interpretability.

We combine logic rules and outputs of neural networks for relation extraction.

As shown in the example, the relation between kate and Britain can be identified according to the other relations and the listed logic rule.

The overview of LogiRE framework is shown below.

Data

Download the preprocessing script and meta data

DWIE
├── data
│   ├── annos
│   └── annos_with_content
├── en_core_web_sm-2.3.1
│   ├── build
│   ├── dist
│   ├── en_core_web_sm
│   ├── en_core_web_sm.egg-info
│   ├── MANIFEST.in
│   ├── meta.json
│   ├── PKG-INFO
│   ├── setup.cfg
│   └── setup.py
├── glove.6B.100d.txt
├── md5sum.txt
└── read_docred_style.py

Install Spacy (en_core_web_sm-2.3.1)
```
cd en_core_web_sm-2.3.1
pip install .
```
Download the original data from DWIE
Generate docred-style data
```
python3 read_docred_style.py
```
The docred-style doc-RE data will be generated at DWIE/data/docred-style. Please compare the md5sum codes of generated files with the records in md5sum.txt to make sure you generate the data correctly.

Train & Eval

Requirements

pytorch >= 1.7.1
tqdm >= 4.62.3
transformers >= 4.4.2

Backbone Preparation

The LogiRE framework requires a backbone NN model for the initial probabilistic assessment on each triple.

The probabilistic assessments of the backbone model and other related meta data should be organized in the following format. In other words, please train any doc-RE model with the docred-style RE data before and dump the outputs as below.

{
    'train': [
        {
            'N': <int>,
            'logits': <torch.FloatTensor of size (N, N, R)>,
            'labels': <torch.BoolTensor of size (N, N, R)>,
            'in_train': <torch.BoolTensor of size (N, N, R)>,
        },
        ...
    ],
    'dev': [
        ...
    ]
    'test': [
        ...
    ]
}

Each example contains four items:

N: the number of entities in this example.
logits: the logits of all triples as a tensor of size (N, N, R). R is the number of relation types (Na excluded)
labels: the labels of all triples as a tensor of size (N, N, R).
in_train: the in_train masks of all triples as a tensor of size(N, N, R), used for ign f1 evaluation. True indicates the existence of the triple in the training split.

For convenience, we provide the dump of ATLOP as examples. Feel free to download and try it directly.

Train

python3 main.py --mode train \
    --save_dir <the directory for saving logs and checkpoints> \
    --rel_num <the number of relation types (Na excluded)> \
    --ent_num <the number of entity types> \
    --n_iters <the number of iterations for optimization> \
    --max_depth <max depths of the logic rules> \
    --data_dir <the directory of the docred-style data> \
    --backbone_path <the path of the backbone model dump>

Evaluation

python3 main.py --mode test \
    --save_dir <the directory for saving logs and checkpoints> \
    --rel_num <the number of relation types (Na excluded)> \
    --ent_num <the number of entity types> \
    --n_iters <the number of iterations for optimization> \
    --max_depth <max depths of the logic rules> \
    --data_dir <the directory of the docred-style data> \
    --backbone_path <the path of the backbone model dump>

Results

LogiRE framework outperforms strong baselines on both relation performance and logical consistency.
Injecting logic rules can improve long-range dependencies modeling, we show the relation performance on each interval of different entity pair distances. LogiRE framework outperforms the baseline and the gap becomes larger when entity pair distances increase. Logic rules actually serve as shortcuts for capturing long-range semantics in concept-level instead of token-level.

Acknowledgements

We sincerely thank RNNLogic which largely inspired us and DWIE & DocRED for providing the benchmarks.

Reference

@inproceedings{ru-etal-2021-learning,
    title = "Learning Logic Rules for Document-Level Relation Extraction",
    author = "Ru, Dongyu  and
      Sun, Changzhi  and
      Feng, Jiangtao  and
      Qiu, Lin  and
      Zhou, Hao  and
      Zhang, Weinan  and
      Yu, Yong  and
      Li, Lei",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2021",
    address = "Online and Punta Cana, Dominican Republic",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.emnlp-main.95",
    pages = "1239--1250",
}

A project for developing transformer-based models for clinical relation extraction

Clinical Relation Extration with Transformers Aim This package is developed for researchers easily to use state-of-the-art transformers models for ext

101 Dec 19, 2022

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

PL-Marker Source code for Pack Together: Entity and Relation Extraction with Levitated Marker. Quick links Overview Setup Install Dependencies Data Pr

173 Dec 30, 2022

Code and datasets for the paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"

KnowPrompt Code and datasets for our paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction" Requireme

137 Dec 31, 2022

Wanli Li and Tieyun Qian: Exploit a Multi-head Reference Graph for Semi-supervised Relation Extraction, IJCNN 2021

MRefG Wanli Li and Tieyun Qian: "Exploit a Multi-head Reference Graph for Semi-supervised Relation Extraction", IJCNN 2021 1. Requirements To reproduc

5 Jul 26, 2022

Company clustering with K-means/GMM and visualization with PCA, t-SNE, using SSAN relation extraction

RE results graph visualization and company clustering Installation pip install -r requirements.txt python -m nltk.downloader stopwords python3.7 main.

1 Oct 6, 2022

Key information extraction from invoice document with Graph Convolution Network

Key Information Extraction from Scanned Invoices Key information extraction from invoice document with Graph Convolution Network Related blog post fro

39 Dec 16, 2022

A sample pytorch Implementation of ACL 2021 research paper "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Span-ASTE-Pytorch This repository is a pytorch version that implements Ali's ACL 2021 research paper Learning Span-Level Interactions for Aspect Senti

10 Dec 6, 2022

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

SafePicking Learning Safe Object Extraction via Object-Level Mapping Kentaro Wad

49 Oct 24, 2022

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

Aspect-level Sentiment Classification Code and dataset for ACL2018 [paper] ‘‘Exploiting Document Knowledge for Aspect-level Sentiment Classification’’

146 Nov 29, 2022

Comments

训练效果求教

您好！

很抱歉又打扰了，最近在DWIE数据集上尝试复现BiLSTM一类模式时发现，按照docred默认的参数设置，模型在验证集上的f1基本只能收敛到40左右，离您论文中报道的验证集f1达到50还有着较大的差距，一方面我感觉和验证集数据量太少有关，一方面也和具体参数设置有关，不知道您怎么看，可以的话，能不能请教一下你们的一些具体训练设置，比如学习率，batch_size，最大长度截断，以及有无使用比如dropout或者lr_schedule等技巧。

期待您的回复，非常感谢！

祝好

opened by FDUyjx 2
DWIE数据集文档长度问题

您好！

因为最近对你们在DWIE数据集上的实验十分感兴趣，我最近也在DWIE数据集上尝试复现一些模型结果，但结果好像都与您们论文里报道的有些差别，考虑原因可能是DWIE数据集上的很多文档长度会超过BERT的最大输入限制512，请问一下您们是用的ATLOP中的滑动窗口处理的吗？但ATLOP中的滑动窗口貌似也是扩充到了1024，实际还是不能满足很多文档长度需求，想请问一下您们是如何处理的呢？

期待您的回复，非常感谢！

祝好

opened by FDUyjx 2

Learning Logic Rules for Document-Level Relation Extraction

Related tags

Overview

LogiRE

Learning Logic Rules for Document-Level Relation Extraction

Data

Train & Eval

Requirements

Backbone Preparation

Train

Evaluation

Results

Acknowledgements

Reference

You might also like...

A project for developing transformer-based models for clinical relation extraction

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

Code and datasets for the paper "KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction"

Wanli Li and Tieyun Qian: Exploit a Multi-head Reference Graph for Semi-supervised Relation Extraction, IJCNN 2021

Company clustering with K-means/GMM and visualization with PCA, t-SNE, using SSAN relation extraction

Key information extraction from invoice document with Graph Convolution Network

A sample pytorch Implementation of ACL 2021 research paper "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

SafePicking: Learning Safe Object Extraction via Object-Level Mapping, ICRA 2022

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

Comments

训练效果求教

DWIE数据集文档长度问题

Owner

Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs

Code for the paper "Relation of the Relations: A New Formalization of the Relation Extraction Problem"

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Code for technical report "An Improved Baseline for Sentence-level Relation Extraction".

It's a implement of this paper：Relation extraction via Multi-Level attention CNNs

Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'

A toolkit for document-level event extraction, containing some SOTA model implementations

PURE: End-to-End Relation Extraction

git《Joint Entity and Relation Extraction with Set Prediction Networks》(2020) GitHub:

Source code for "UniRE: A Unified Label Space for Entity Relation Extraction.", ACL2021.