11 Python Wav2vec2 Libraries

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation In this repo you can find the code of the Supervised Hybrid Audio Segmentatio

21 Dec 20, 2022

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

PART 2: CHAIN LINKING AUDIO-TO-TEXT NLP TASKS 2A: TRANSCRIBE-TRANSLATE-SENTIMENT-ANALYSIS In notebook3.0, I demo a simple workflow to: transcribe a lo

30 Jul 13, 2022

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Fine-tuning wav2vec2 for speaker recognition This is the code used to run the experiments in https://arxiv.org/abs/2109.15053. Detailed logs of each t

103 Dec 26, 2022

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Wav2Vec2 STT Python Beta Software Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 mode

22 Dec 29, 2022

Python Wav2vec2 Resources

Python wav2vec2 Libraries

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Transformers Wav2Vec2 + Parlance's CTCDecodeTransformers Wav2Vec2 + Parlance's CTCDecode

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding

GSoC'2021 | TensorFlow implementation of Wav2Vec2

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

Python Wav2vec2 Resources

Related tags

Python wav2vec2 Libraries

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downstream tasks like translation and summarisation.

Research code for the paper "Fine-tuning wav2vec2 for speaker recognition"

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Transformers Wav2Vec2 + Parlance's CTCDecodeTransformers Wav2Vec2 + Parlance's CTCDecode

Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding

GSoC'2021 | TensorFlow implementation of Wav2Vec2

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models