Anuvada: Interpretable Models for NLP using PyTorch

So, you want to know why your classifier arrived at a particular decision or why your flashy new deep learning classification model is not performing in the way which you would want it to perform? Or there could be bias in your dataset towards a particular class and you want to understand if there are any such edge cases.

One of the common criticisms of deep learning has been it's black box nature (life itself is a big black box, not at all interpretable, don't even ask me about love). To address this issue, researchers have developed many ways to visualise and explain the inference. It is not necessary that a model has to be explainable, but when important decisions like which jobs to recommend to a person or whether to give a person loan are being made, it would be helpful to cross-check the model's claims. In such domains, self-explainable models are necessary.

This library is an ongoing effort to provide a high-level access to such models by building on top of PyTorch.

Here is what you can expect to visualize from a trained model.

Note: This model is a convolutional neural network trained on IMDB sentiment analysis dataset. I trained the model using SGD till validation loss stopped improving. Here is sensitivity analysis on some sample inputs. You can find more details about training the model in the Jupyter notebooks from the examples directory.

Positive review

Negative review

Installing

Clone this repo and add it to your python library path.

Requirements

PyTorch
NumPy
Pandas
Spacy
Gensim
tqdm

To do list

Acknowledgments

https://github.com/henryre/pytorch-fitmodule

A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)

A2T: Towards Improving Adversarial Training of NLP Models This is the source code for the EMNLP 2021 (Findings) paper "Towards Improving Adversarial T

17 Oct 15, 2022

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks, which modifies the input text with a textual template and directly uses PLMs to conduct pre-trained tasks. This library provides a standard, flexible and extensible framework to deploy the prompt-learning pipeline. OpenPrompt supports loading PLMs directly from huggingface transformers. In the future, we will also support PLMs implemented by other libraries.

2.3k Jan 8, 2023

Behavioral Testing of Clinical NLP Models

Behavioral Testing of Clinical NLP Models This repository contains code for testing the behavior of clinical prediction models based on patient letter

2 Sep 20, 2022

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

MIDI Language Introduction Reference Paper: Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions: code This

3 May 25, 2022

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

TweebankNLP This repo contains the new Tweebank-NER dataset and Twitter-Stanza p

84 Dec 20, 2022

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

Anuvada: Interpretable Models for NLP using PyTorch

Related tags

Overview

Anuvada: Interpretable Models for NLP using PyTorch

Positive review

Negative review

Installing

Requirements

To do list

Acknowledgments

You might also like...

A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)

Prompt-learning is the latest paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks

Behavioral Testing of Clinical NLP Models

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

TweebankNLP - Pre-trained Tweet NLP Pipeline (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Models + Tweebank-NER

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

Twitter bot that uses NLP models to summarize news articles referenced in a user's twitter timeline

An easy-to-use framework for BERT models, with trainers, various NLP tasks and detailed annonations

Basic Utilities for PyTorch Natural Language Processing (NLP)

Owner

EDGE

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

GCRC: A Gaokao Chinese Reading Comprehension dataset for interpretable Evaluation

Super easy library for BERT based NLP models

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Super easy library for BERT based NLP models

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Machine learning models from Singapore's NLP research community