PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Salesforce

Last update: Nov 12, 2022

Related tags

Text Data & NLP nonauto-nmt

Overview

Non-Autoregressive Transformer

Code release for Non-Autoregressive Neural Machine Translation by Jiatao Gu, James Bradbury, Caiming Xiong, Victor O.K. Li, and Richard Socher.

Requires PyTorch 0.3, torchtext 0.2.1, and SpaCy.

The pipeline for training a NAT model for a given language pair includes:

run_alignment_wmt_LANG.sh (runs fast_align for alignment supervision)
run_LANG.sh (trains an autoregressive model)
run_LANG_decode.sh (produces the distillation corpus for training the NAT)
run_LANG_fast.sh (trains the NAT model)
run_LANG_fine.sh (fine-tunes the NAT model)

Implementation of ProteinBERT in Pytorch

ProteinBERT - Pytorch (wip) Implementation of ProteinBERT in Pytorch. Original Repository Install $ pip install protein-bert-pytorch Usage import torc

92 Dec 25, 2022

A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021

Chimera: Learning Shared Semantic Space for Speech-to-Text Translation This is a Pytorch implementation for the "Chimera" paper Learning Shared Semant

43 Dec 28, 2022

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

StyleSpeech - PyTorch Implementation PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation. Status (2021.06.09

142 Jan 6, 2023

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Cross-Covariance Image Transformer (XCiT) PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer L

605 Jan 2, 2023

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

RE2 This is a pytorch implementation of the ACL 2019 paper "Simple and Effective Text Matching with Richer Alignment Features". The original Tensorflo

286 Jan 2, 2023

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

67 Nov 14, 2022

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

Splitter ⠀⠀ A PyTorch implementation of Splitter: Learning Node Representations that Capture Multiple Social Contexts (WWW 2019). Abstract Recent inte

201 Nov 9, 2022

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

GPT2-Pytorch with Text-Generator Better Language Models and Their Implications Our model, called GPT-2 (a successor to GPT), was trained simply to pre

775 Jan 8, 2023

PyTorch original implementation of Cross-lingual Language Model Pretraining.

XLM NEW: Added XLM-R model. PyTorch original implementation of Cross-lingual Language Model Pretraining. Includes: Monolingual language model pretrain

2.7k Dec 27, 2022

Comments

import copy in train.py for line 36

flake8 testing of https://github.com/salesforce/nonauto-nmt on Python 3.6.3

$ flake8 . --count --select=E901,E999,F821,F822,F823 --show-source --statistics

./model.py:92:75: F821 undefined name 'the_mask'
        return targets[input_mask], out[out_mask].view(-1, out.size(-1)), the_mask
                                                                          ^
./self_learn.py:626:48: F821 undefined name 'align_index'
                decoding1 = unsorted(decoding, align_index)
                                               ^
./self_learn.py:905:45: F821 undefined name 'loss_alter'
            loss_alter, loss_worse = export(loss_alter), export(loss_worse)
                                            ^
./self_learn.py:905:65: F821 undefined name 'loss_worse'
            loss_alter, loss_worse = export(loss_alter), export(loss_worse)
                                                                ^
./self_learn.py:1342:78: F821 undefined name 'fertility_mode'
    names = ['dev.src.b{}={}.{}'.format(args.beam_size, args.load_from, args,fertility_mode),
                                                                             ^
./self_learn.py:1343:77: F821 undefined name 'fertility_mode'
            'dev.trg.b{}={}.{}'.format(args.beam_size, args.load_from, args,fertility_mode),
                                                                            ^
./self_learn.py:1344:77: F821 undefined name 'fertility_mode'
            'dev.dec.b{}={}.{}'.format(args.beam_size, args.load_from, args,fertility_mode)]
                                                                            ^
./train.py:35:17: F821 undefined name 'copy'
    new_batch = copy.copy(batch)
                ^
8     F821 undefined name 'the_mask'
8

cla:missing

opened by cclauss 2

enumerate function in for loop

Thank you for sharing your code. I have a problem with running the code, when I run the code there is no error, but in debug mode, there is an issue in line 142 of "train.py" file. There is a "for" loop that its condition doesn't satisfy because of the "enumerate" function. how can I fix this? It seems that's because of "train" variable and enumerate function can not convert it to numbers or something like this.

I would be appreciated for any help.

opened by VahidChahkandi 0
Transformer Architecture and other issues

In this implementation transformer decoder layer i is attending to encoder layer i output, which seems to be different from Google's original transformer implementation which always attends to the last layer output.

https://github.com/salesforce/nonauto-nmt/blob/efcbe4f2329b140ac3ce06abb6409457cebc8e49/model.py#L601

Plus, the provided scripts seem to contain options not supported by run.py, and many train_fast.sh relies on -load_from old_fast_model, which further makes the paper's results hard to reproduce.

opened by da03 1
Dataset

Thank you for sharing your codes. I have a question about how to preprocess the data. For example, for the iwslt en-de dataset, you use a file named train.tags.en-de.bpe.dev.en2 in the script run_alignment_iwslt.sh. It seems to not belong to the original dataset. Where does it come from?

opened by Maggione 4

Owner

Salesforce

A variety of vendor agnostic projects which power Salesforce

GitHub

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

The PyTorch-Kaldi Speech Recognition Toolkit PyTorch-Kaldi is an open-source repository for developing state-of-the-art DNN/HMM speech recognition sys

2.3k Dec 27, 2022

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Related tags

Overview

Non-Autoregressive Transformer

You might also like...

Implementation of ProteinBERT in Pytorch

A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Comments

import copy in train.py for line 36

enumerate function in for loop

Transformer Architecture and other issues

Dataset

Owner

Salesforce

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.

SAINT PyTorch implementation

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

A fast and easy implementation of Transformer with PyTorch.

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Pytorch implementation of Tacotron

Google AI 2018 BERT pytorch implementation

Unofficial PyTorch implementation of Google AI's VoiceFilter system