420 Repositories
Python music-production Libraries
Music source separation is a task to separate audio recordings into individual sources
Music Source Separation Music source separation is a task to separate audio recordings into individual sources. This repository is an PyTorch implmeme
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
Serve angular production application from python flask backend. Quick and Easy
Serve angular production application from python flask backend. Quick and Easy
Desktop music recognition application for windows
MusicRecognizer Music recognition application for windows You can choose from which of the devices the recording will be made. If you choose speakers,
๐ผ ๐๐ค๐ฉ ๐ฉ๐๐๐ฉ ๐๐๐ฃ ๐ฅ๐ก๐๐ฎ ๐ข๐ช๐จ๐๐ ๐ค๐ฃ ๐๐๐ก๐๐๐ง๐๐ข ๐๐ง๐ค๐ช๐ฅ ๐๐ฃ๐ ๐พ๐๐๐ฃ๐ฃ๐๐ก ๐๐ค๐๐๐ ๐พ๐๐๐ฉ๐จ
Free and Open Source Channel/Group Voice chat music player for telegram โค๏ธ with button support, deezer and saavn playback support @Sadew451
Stream Music ๐ต ๐ผ ๐๐ค๐ฉ ๐ฉ๐๐๐ฉ ๐๐๐ฃ ๐ฅ๐ก๐๐ฎ ๐ข๐ช๐จ๐๐ ๐ค๐ฃ ๐๐๐ก๐๐๐ง๐๐ข ๐๐ง๐ค๐ช๐ฅ ๐๐ฃ๐ ๐พ๐๐๐ฃ๐ฃ๐๐ก ๐๐ค๐๐๐ ๐พ๐๐๐ฉ๐จ ๐ผ๐ซ๐๐๐ก?
Stream Music ๐ต ๐ผ ๐๐ค๐ฉ ๐ฉ๐๐๐ฉ ๐๐๐ฃ ๐ฅ๐ก๐๐ฎ ๐ข๐ช๐จ๐๐ ๐ค๐ฃ ๐๐๐ก๐๐๐ง๐๐ข ๐๐ง๐ค๐ช๐ฅ ๐๐ฃ๐ ๐พ๐๐๐ฃ๐ฃ๐๐ก ๐๐ค๐๐๐ ๐พ๐๐๐ฉ๐จ ๐ผ๐ซ๐๐๐ก?
Download Apple Music Cover Artwork in the best Quality by providing an Apple Music Link. It downloads the jpg, png and webp version since they often differ from another.
amogus.py - Version 0.0.5 amogus - Apple Music Hi-Res Artwork Fetcher this is my first real python tool so sorry if its bad amogus is a Python script
โจRubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.
โจA Python framework to explore, label, and monitor data for NLP projects
working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX
Music Demixing Challenge - xumx-sliCQ This repository is the GitHub mirror of my working submission repository for the AICrowd ISMIR 2021 Music Demixi
pedalboard is a Python library for adding effects to audio.
pedalboard is a Python library for adding effects to audio. It supports a number of common audio effects out of the box, and also allows the use of VST3ยฎ and Audio Unit plugin formats for third-party effects.
Learn to deploy a FastAPI application into production DigitalOcean App Platform
Learn to deploy a FastAPI application into production DigitalOcean App Platform. This is a microservice for our Try Django 3.2 project. The goal is to extract any and all text from images using a technique called OCR.
An Advanced Telegram Bot to Play Radio & Music in Voice Chat. This is Also The Source Code of The Bot Which is Being Used For Playing Radio in @AsmSafone Channel โค๏ธ
Telegram Radio Player V3 An Advanced Telegram Bot to Play Nonstop Radio/Music/YouTube Live in Channel or Group Voice Chats. This is also the source co
A Telegram Music Tag Editor Bot that can remove almost all usernames in the music tags and add own username instead.
Music Tag Editor Bot A Telegram Music Tag Editor Bot that can remove almost all usernames in the music tags and add own username instead. It can also
A Telegram Bot to manage your music channel with some cool features.
Music Channel Manager V2 A Telegram Bot to manage your music channel with some cool features like appending your predefined username to the musics tag
Based Telegram Bot and Userbot To Play Music in Your Telegram Groups With Some Cool Extra Features! ๐ฅฐ
CallMusicPlus69 This Repo base on! ๐ค๏ธ A CallsMusic Based Telegram Bot and Userbot To Play Music in Your Telegram Groups With Some Cool Extra Features
veez music is a telegram music bot project, allow you to play music on voice chat group telegram.
๐ถ VEEZ MUSIC BOT Veez Music is a telegram bot project that's allow you to play music on telegram voice chat group. Requirements ๐ FFmpeg NodeJS node
Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.
Music Source Separation with Channel-wise Subband Phase Aware ResUnet (CWS-PResUNet) Introduction This repo contains the pretrained Music Source Separ
PyTorch implementation of "Learn to Dance with AIST++: Music Conditioned 3D Dance Generation."
Learn to Dance with AIST++: Music Conditioned 3D Dance Generation. Installation pip install -r requirements.txt Prepare Dataset bash data/scripts/pre
EzilaX Music โค is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more POWERD By SDBOTs
EzilaX-Music ๐ต A bot that can play music on Telegram Group and Channel Voice Chats Available on telegram as @EzilaXMBot Features ๐ฅ Thumbnail Support
ไธไธชๅบไบPython3็Botใ็ฎๅๆฏๆไปฅDocker็ๆนๅผ้จ็ฝฒๅจvpsไธใๆฏๆAria2ใๆฌๅญไธ่ฝฝใ็ฝๆไบ้ณไนไธ่ฝฝใPixivๆฆๅไธ่ฝฝใYoutue-dlๆฏๆใๆๅพใ
ไป็ป ไธไธชๅบไบPython3็Botใ็ฎๅๆฏๆไปฅDocker็ๆนๅผ้จ็ฝฒๅจvpsไธใ ไธป่ฆๅ่ฝ: ๆไปถ็ฎก็ ไฟฎๆนไธป็้ขไธบ filebrowser๏ผ่ดฆๅทไธบadmin๏ผๅฏ็ ไธบadmin,ไธป็้ข่ทฏๅพ๏ผhttp://ip:port,่ฏท่ช่กไฟฎๆนๅฏ็ FolderMagic่ชๅธฆ็webdav๏ผ่ทฏๅพ:http://
Tune in is a Collaborative Music Playing Systems where multiple guests can join a room and enjoy the song being played
โจA collaborative music playing systems๐ถ where multiple guests can join a room โก๐ช and enjoy the song๐ง being played.
Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)
Progressive Transformers for End-to-End Sign Language Production Source code for "Progressive Transformers for End-to-End Sign Language Production" (B
Emotional conditioned music generation using transformer-based model.
This is the official repository of EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation. The paper has b
Production Grade Machine Learning Service
This project is made to help you scale from a basic Machine Learning project for research purposes to a production grade Machine Learning web service
PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
Foley Music: Learning to Generate Music from Videos This repo holds the code for the framework presented on ECCV 2020. Foley Music: Learning to Genera
The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.
DanceNet3D The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer. Dataset and Results Pleas
PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.
MuseMorphose This repository contains the official implementation of the following paper: Shih-Lun Wu, Yi-Hsuan Yang MuseMorphose: Full-Song and Fine-
MMDL (Mega Music Downloader) - A tool to easily download music.
mmdl - Mega Music Downloader What is mmdl โ TLDR: MMDL is a cli app which allows you to quickly and efficiently download one or multiple songs from Yo
veez music bot is a telegram music bot project, allow you to play music on voice chat group telegram.
๐ถ Veez Music Bot Music bot for playing music on telegram voice chat group. Requirements ๐ FFmpeg NodeJS nodesource.com Python 3.7+ PyTgCalls ๐งช Get
โค๏ธ This Is The EzilaXMusicPlayer Advaced Repo ๐ต
Telegram EzilaXMusicPlayer Bot ๐ต A bot that can play music on telegram group's voice Chat โค๏ธ Requirements ๐ FFmpeg NodeJS nodesource.com Python 3.7+
Evidently helps analyze machine learning models during validation or production monitoring
Evidently helps analyze machine learning models during validation or production monitoring. The tool generates interactive visual reports and JSON profiles from pandas DataFrame or csv files. Currently 6 reports are available.
Brandnew-flask is a CLI tool used to generate a powerful and mordern flask-app that supports the production environment.
Brandnew-flask is still in the initial stage and needs to be updated and improved continuously. Everyone is welcome to maintain and improve this CLI.
Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.
MidiBERT-Piano Authors: Yi-Hui (Sophia) Chou, I-Chun (Bronwin) Chen Introduction This is the official repository for the paper, MidiBERT-Piano: Large-
Telegram Radio - A User-bot who continuously play random audio files (from the famous telegram music channel @mveargasm) in the intended voice chat.
MvEargasmDJ: This is my submission for the Telegram Radio Project of Baivaru. Which required a userbot to continiously play random audio files from th
Stream music with ffmpeg and python
youtube-stream Stream music with ffmpeg and python original Usage set the KEY in stream.sh run server.py run stream.sh (You can use Git bash or WSL in
A module to complement discord.py that has Music, Paginator and Levelling.
discord-super-utils A modern python module including many useful features that make discord bot programming extremely easy. Features Modern leveling m
Minimal telegram voice chat music bot, in pyrogram.
VCBOT Fully working VC (user)Bot, based on py-tgcalls and py-tgcalls-wrapper with minimal features. Deploying To heroku: Local machine/VPS: git clone
Pretrained SOTA Deep Learning models, callbacks and more for research and production with PyTorch Lightning and PyTorch
Pretrained SOTA Deep Learning models, callbacks and more for research and production with PyTorch Lightning and PyTorch
Connect Playground - easy way to fill in your account with production-like objects
Just set of scripts to initialise accpunt with production-like data: A - Basic Distributor Account Initialization INPUT Distributor Account Token ACTI
Yandex Media Browser
ะัะฐัะทะตั ะผะตะดะธะฐ ะดะปั ะฟะปะฐะณะธะฝะฐ Yandex Station ะะบะปััะฐะนัะต ะผัะทัะบั, ะฟะปะตะนะปะธััั ะธ ัะฐะดะธะพ ะฝะฐ ะฏะฝะดะตะบั.ะกัะฐะฝัะธะธ ะธะท Home Assistant! ะกะบัะธะฝัะพั ะะพัะฝะตะฒะพะน ัะฐะทะดะตะป: ะะธะฑะปะธะพัะตะบะฐ
TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline.
TorchX is a library containing standard DSLs for authoring and running PyTorch related components for an E2E production ML pipeline
Learn chords with your MIDI keyboard !
miditeach miditeach is a music learning tool that can be used to practice your chords skills with a midi keyboard ๐น ! Features Midi keyboard input se
spafe: Simplified Python Audio-Features Extraction
spafe aims to simplify features extractions from mono audio files. The library can extract of the following features: BFCC, LFCC, LPC, LPCC, MFCC, IMFCC, MSRCC, NGCC, PNCC, PSRCC, PLP, RPLP, Frequency-stats etc. It also provides various filterbank modules (Mel, Bark and Gammatone filterbanks) and other spectral statistics.
In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.
End to End Automatic Speech Recognition In this repository, I have developed an end to end Automatic speech recognition project. I have developed the
music downloader written in python. (Uses jiosaavn API)
music downloader written in python. (Uses jiosaavn API)
๐บ YouTube Song Downloader Bot For Telegram ๐ฎ
๐บ YouTube Song Downloader Bot For Telegram ๐ฎ Powerd By TamilBots.
Source code and data from the RecSys 2020 article "Carousel Personalization in Music Streaming Apps with Contextual Bandits" by W. Bendada, G. Salha and T. Bontempelli
Carousel Personalization in Music Streaming Apps with Contextual Bandits - RecSys 2020 This repository provides Python code and data to reproduce expe
DaisyXmusic โค A bot that can play music on Telegram Group and Channel Voice Chats
DaisyXmusic โค is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more
banditml is a lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.
banditml is a lightweight contextual bandit & reinforcement learning library designed to be used in production Python services. This library is developed by Bandit ML and ex-authors of Facebook's applied reinforcement learning platform, Reagent.
A bot that can play music on Telegram Group and Channel Voice Chats
DaisyXmusic โค is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more
Play any song directly into your group voice chat.
Telegram VCPlayer Bot Play any song directly into your group voice chat. Official Bot : VCPlayerBot | Discussion Group : VoiceChat Music Player Suppor
A telegram bot that can send you high-quality audio ๐ง๐ง๐ง
Music downloader bot Still under development Please Report issues to improve this repo.I will try to fix bugs in next update Music downloader bot is a
Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)
MusCaps: Generating Captions for Music Audio Ilaria Manco1 2, Emmanouil Benetos1, Elio Quinton2, Gyorgy Fazekas1 1 Queen Mary University of London, 2
Music and video downloader, Made with love by Bryan Herrera
Python-Mp3Mp4-Downloader Music and video downloader, Made with love by Bryan Herrera Requirements CHOCOLATELY windows command If your system does not
This Is Advanced Version Of Old Radio Player, An Telegram Bot to Play Radio/Music in Channel or Group Voice Chats.
Telegram Radio Player V2 An Telegram Bot to Play Radio/Music in Channel or Group Voice Chats. This is also the source code of the bot which is being u
Telegram bot + userbot for streaming audio in group calls.
Calls Music โ Telegram bot + userbot for streaming audio in group calls โ๐ป Requirements FFmpeg Python 3.7+ ๐ Deployment ๐ Configuration Copy exampl
Starter kit for getting started in the Music Demixing Challenge.
Music Demixing Challenge - Starter Kit ๐ Challenge page This repository is the Music Demixing Challenge Submission template and Starter kit! Clone th
Multi-Track Music Generation with the Transfomer and the Johann Sebastian Bach Chorales dataset
MMM: Exploring Conditional Multi-Track Music Generation with the Transformer and the Johann Sebastian Bach Chorales Dataset. Implementation of the pap
Self-Supervised Contrastive Learning of Music Spectrograms
Self-Supervised Music Analysis Self-Supervised Contrastive Learning of Music Spectrograms Dataset Songs on the Billboard Year End Hot 100 were collect
Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.
Word2Wave is a simple method for text-controlled GAN audio generation. You can either follow the setup instructions below and use the source code and CLI provided in this repo or you can have a play around in the Colab notebook provided. Note that, in both cases, you will need to train a WaveGAN model first
Cinder is Instagram's internal performance-oriented production version of CPython
Cinder is Instagram's internal performance-oriented production version of CPython 3.8. It contains a number of performance optimizations, including bytecode inline caching, eager evaluation of coroutines, a method-at-a-time JIT, and an experimental bytecode compiler that uses type annotations to emit type-specialized bytecode that performs better in the JIT.
A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.
Investigating U-NETS With Various Intermediate Blocks For Spectrogram-based Singing Voice Separation A Pytorch Implementation of the paper "Investigat
๐คThe VC Music Source code of @DaisyXBot โค๏ธ v3 Out now
DAISYXMUSIC V3 ๐ต A bot that can play music on telegram group's voice call Available on telegram as @DaisyXbot Whats new ๐ฅ Thumbnail Support Playlist
JAKYM, Just Another Konsole YouTube-Music. A command line based Youtube music player written in Python with spotify and youtube playlist support
Just Another Konsole YouTube-Music Overview I wanted to create this application so that I could use the command line to play music easily. I often pla
A scriptable music downloader for Qobuz, Tidal, and Deezer
streamrip A scriptable stream downloader for Qobuz, Tidal, and Deezer. Features Downloads tracks, albums, playlists, discographies, and labels from Qo
Telegram Music Bot for YouTube/SoundCloud/Mixcloud
Telegram Music Bot Telegram Music Bot for YouTube/SoundCloud/Mixcloud This bot downloads and sends the audio when someone send a YouTube/SoundCloud/Mi
Telegram Voice Chat UserBot made with Pyrogram and MarshalX/tgcalls with playlist and Heroku support
Telegram Voice Chat UserBot A Telegram UserBot to Play Audio in Voice Chats. This is also the source code of the userbot which is being used for playi
Nicotine+: A graphical client for the SoulSeek peer-to-peer system
Nicotine+ Nicotine+ is a graphical client for the Soulseek peer-to-peer file sharing network. Nicotine+ aims to be a pleasant, Free and Open Source (F
The first open-source PyTgCalls-based project.
SU Music Player โ The first open-source PyTgCalls based Pyrogram bot to play music in voice chats Requirements FFmpeg NodeJS 15+ Python 3.7+ Deploymen
Python audio and music signal processing library
madmom Madmom is an audio signal processing library written in Python with a strong focus on music information retrieval (MIR) tasks. The library is i
A library for augmenting annotated audio data
muda A library for Musical Data Augmentation. muda package implements annotation-aware musical data augmentation, as described in the muda paper. The
Marsyas - Music Analysis, Retrieval and Synthesis for Audio Signals
Welcome to MARSYAS. MARSYAS is a software framework for rapid prototyping of audio applications, with flexibility and extensibility as primary concer
C++ library for audio and music analysis, description and synthesis, including Python bindings
Essentia Essentia is an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license.
a library for audio and music analysis
aubio aubio is a library to label music and sounds. It listens to audio signals and attempts to detect events. For instance, when a drum is hit, at wh
Build, test, deploy, iterate - Dev and prod tool for data science pipelines
Prodmodel is a build system for data science pipelines. Users, testers, contributors are welcome! Motivation ยท Concepts ยท Installation ยท Usage ยท Contr
Telegram Voice Chat Music Player UserBot Written with Pyrogram Smart Plugin and tgcalls
Telegram Voice Chat UserBot A Telegram UserBot to Play Audio in Voice Chats. This is also the source code of the userbot which is being used for playi
SU Music Player โ The first open-source PyTgCalls based Pyrogram bot to play music in voice chats
SU Music Player โ The first open-source PyTgCalls based Pyrogram bot to play music in voice chats Note Neither this, or PyTgCalls are fully
MoinMoin Wiki Development (2.0+), unstable, for production please use 1.9.x.
MoinMoin - a wiki engine in Python MoinMoin is an easy to use, full-featured and extensible wiki software package written in Python. It can fulfill a
Automatic music downloader for SABnzbd
Headphones Headphones is an automated music downloader for NZB and Torrent, written in Python. It supports SABnzbd, NZBget, Transmission, ยตTorrent, De
OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.
OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality v
Supysonic is a Python implementation of the Subsonic server API.
Supysonic Supysonic is a Python implementation of the Subsonic server API. Current supported features are: browsing (by folders or tags) streaming of
Music player and music library manager for Linux, Windows, and macOS
Ex Falso / Quod Libet - A Music Library / Editor / Player Quod Libet is a music management program. It provides several different ways to view your au
MusicBrainz Picard
MusicBrainz Picard MusicBrainz Picard is a cross-platform (Linux/Mac OS X/Windows) application written in Python and is the official MusicBrainz tagge
Music player - endlessly plays your music
Music player First, if you wonder about what is supposed to be a music player or what makes a music player different from a simple media player, read
Mopidy is an extensible music server written in Python
Mopidy Mopidy is an extensible music server written in Python. Mopidy plays music from local disk, Spotify, SoundCloud, Google Play Music, and more. Y
:notes: Cross-platform music player
Exaile Exaile is a music player with a simple interface and powerful music management capabilities. Features include automatic fetching of album art,
FastAPI Skeleton App to serve machine learning models production-ready.
FastAPI Model Server Skeleton Serving machine learning models production-ready, fast, easy and secure powered by the great FastAPI by Sebastiรกn Ramรญre
The earliest beta version of pytgcalls on Linux x86_64 and ARM64! Use in production at your own risk!
Public beta test. Use in production at your own risk! tgcalls - a python binding for tgcalls (c++ lib by Telegram); pytgcalls - library connecting pyt
A minimalist production ready plugin system
pluggy - A minimalist production ready plugin system This is the core framework used by the pytest, tox, and devpi projects. Please read the docs to l
FastAPI Skeleton App to serve machine learning models production-ready.
FastAPI Model Server Skeleton Serving machine learning models production-ready, fast, easy and secure powered by the great FastAPI by Sebastiรกn Ramรญre
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Kashgari Overview | Performance | Installation | Documentation | Contributing ๐ ๐ ๐ We released the 2.0.0 version with TF2 Support. ๐ ๐ ๐ If you
๐ฅ Fast State-of-the-Art Tokenizers optimized for Research and Production
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility. Main features: Train new vocabularies and tok
Python tools for the corpus analysis of popular music.
CATCHY Corpus Analysis Tools for Computational Hook discovery Python tools for the corpus analysis of popular music recordings. The tools can be used
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Audiomentations A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio a
Python library for handling audio datasets.
AUDIOMATE Audiomate is a library for easy access to audio datasets. It provides the datastructures for accessing/loading different datasets in a gener
Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and Wave files with python 2 or 3
tinytag tinytag is a library for reading music meta data of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and Wave files with python Install pip install tinytag
Pyrogram bot to automate streaming music in voice chats
Pyrogram bot to automate streaming music in voice chats Help If you face an error, want to discuss this project or get support for it, join it's group
LedFx is a network based LED effect controller with support for advanced real-time audio effects
Welcome to LedFx โจ -Making music come alive! LedFx website: https://ledfx.app/ What is LedFx? What LedFx offers is the ability to take audio input, an
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Kashgari Overview | Performance | Installation | Documentation | Contributing ๐ ๐ ๐ We released the 2.0.0 version with TF2 Support. ๐ ๐ ๐ If you
๐ฅ Fast State-of-the-Art Tokenizers optimized for Research and Production
Provides an implementation of today's most used tokenizers, with a focus on performance and versatility. Main features: Train new vocabularies and tok