Argos Train
Trains an OpenNMT PyTorch model and SentencePiece tokenizer. Designed for use with Argos Translate and LibreTranslate.
Argos Translate packages are also available for download.
Training example
$ su argosopentech
$ source ~/argos-train-init
...
$ argos-train
From code (ISO 639): en
To code (ISO 639): es
From name: English
To name: Spanish
Package version: 1.0
Argos version: 1.0
...
Package saved to /home/argosopentech/argos-train/run/en_es.argosmodel
Data
Uses data from the Opus project in the Moses format stored in data index.
Environment
CUDA required, tested on vast.ai.
Docker
Docker image available at argosopentech/argostrain.
docker run -it argosopentech/argostrain /bin/bash
Run training
argos-train
Troubleshooting
- If you're running out of GPU memory reduce
batch_size
andvalid_batch_size
inconfig.yml
.
License
Licensed under either the MIT or CC0 License (same as Argos Translate).