Speech Algorithms Collections

Ryuk

Last update: Jan 6, 2023

Related tags

Audio speech-processing

Overview

Speech Algorithms

标题	原文	代码
语音降噪初探——谱减法	Link	Code
基于Mask的语音分离	Link	Code
生成有噪声/回声/混响/啸叫的混合语音样本	Link	Code
解析自适应滤波回声消除	Link	Code
使用AMR编解码器生成VAD的标签	Link	Code
使用TDOA进行声源定位	Link	Code
以任意频率重采样语音信号	Link	Code
音频数字水印的嵌入和提取	Link	Code
语音变速和变调	Link	Code
生成下雨的声音	Link	Code

机器/深度学习语音算法

标题	原文	代码
DNN单通道语音增强	Link	Code
使用LSTM进行端点检测	Link	Code
使用CNN进行简单的指令识别	Link	Code
说话人性别识别	Link	Code
使用XGBoost进行环境声音分类	Link	Code

其他

标题	原文	代码
语音客观评价标准——语音质量评价	Link	Code

You might also like...

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Text to speech (using Python) Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and co

19 Jun 30, 2022

The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.

Add_noise_and_rir_to_speech The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal

7 Oct 30, 2022

Automatically creates genre collections for your Plex media

Plex Auto Genres Plex Auto Genres is a simple script that will add genre collection tags to your media making it much easier to search for genre speci

63 Dec 31, 2022

Command-line program to download image galleries and collections from several image hosting sites

gallery-dl gallery-dl is a command-line program to download image galleries and collections from several image hosting sites (see Supported Sites). It

6.4k Jan 6, 2023

Extensible memoizing collections and decorators

cachetools This module provides various memoizing collections and decorators, including variants of the Python Standard Library's @lru_cache function

1.5k Jan 5, 2023

Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.

Archivematica By Artefactual Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term

338 Dec 16, 2022

Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)

Open Semantic Search https://opensemanticsearch.org Integrated search server, ETL framework for document processing (crawling, text extraction, text a

684 Jan 6, 2023

Query multiple mongoDB database collections easily

leakscoop Perform queries across multiple MongoDB databases and collections, where the field names and the field content structure in each database ma

5 Jun 24, 2021

NeRD: Neural Reflectance Decomposition from Image Collections

NeRD: Neural Reflectance Decomposition from Image Collections Project Page | Video | Paper | Dataset Implementation for NeRD. A novel method which dec

Computergraphics (University of Tübingen)

195 Dec 29, 2022

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections Ruiqi Zhong, Kristy Lee*, Zheng Zhang*, Dan Klein EMN

42 Nov 3, 2022

PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.

deep-linear-shapes PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper. If you find this code useful i

27 Sep 24, 2022

A Blender python script for getting asset browser custom preview images for objects and collections.

asset_snapshot A Blender python script for getting asset browser custom preview images for objects and collections. Installation: Click the code butto

44 Nov 29, 2022

Collections of python projects

nppy, mostly contains projects written in Python. Some projects are very simple while some are a bit lenghty and difficult(for beginners) Requirements

75 Dec 20, 2022

Unique image & metadata generation using weighted layer collections.

nft-generator-py nft-generator-py is a python based NFT generator which programatically generates unique images using weighted layer files. The progra

243 Dec 31, 2022

A utility for quickly cropping large collections of images.

Crop Tool A utility for quickly cropping large collections of images. Inspired by Derrick Schultz's dataset-tools. Setup It's suggested that you use A

6 Nov 14, 2021

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Indobenchmark Toolkit Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG) resources fo

11 Aug 26, 2022

SQL queries to collections

SQC SQL Queries to Collections Examples from sqc import sqc data = [ {"a": 1, "b": 1}, {"a": 2, "b": 1}, {"a": 3, "b": 2}, ] Simple filte

0 Jul 6, 2022

trackbranch is a tool for developers that can be used to store collections of branches in the form of profiles.

trackbranch trackbranch is a tool for developers that can be used to store collections of branches in the form of profiles. This can be useful for sit

1 Oct 21, 2021

Collections of pydantic models

pydantic-collections The pydantic-collections package provides BaseCollectionModel class that allows you to manipulate collections of pydantic models

20 Dec 26, 2022

Comments

Can this repo be used to retrieve a mixed series without knowing any prior information of signal2?

In the code I see that the mask is calculated by: mask = np.around(snr, 0) And the snr is the signal to noise ratio of clean signal versus combined signal. The point is that if the clean signal is unknown, can we still use this method to seperate a signal2 from the mixed signal? Thanks a lot.

opened by SuperCrystal 3
AEC: kalman filter, post error matrix update

I have doubt with the post error matrix update formula ”Rmu = (IL - K @ X.T) * Rm“ . When I derive this formula, Rmu = (IL - K @ X.T) * Rm*(IL - K @ X.T)' + V(near noise). Can you help me?

opened by jeffery-work 0

Owner

Ryuk

Speech Algorithms

GitHub

This library provides common speech features for ASR including MFCCs and filterbank energies.

python_speech_features This library provides common speech features for ASR including MFCCs and filterbank energies. If you are not sure what MFCCs ar

2.2k Jan 4, 2023

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

SpeechPy Official Project Documentation Table of Contents Documentation Which Python versions are supported Citation How to Install? Local Installatio

870 Dec 27, 2022

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Project DeepSpeech DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Spee

20.8k Jan 3, 2023

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Summary Pyroomacoustics is a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the pack

1k Jan 9, 2023

Speech Algorithms Collections

498 Jan 6, 2023

Speech Algorithms Collections

Related tags

Overview

Speech Algorithms

目录

信号处理语音算法

机器/深度学习语音算法

其他

You might also like...

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generate far-field speech data using room impulse response data from BUT Speech@FIT Reverb Database.

Automatically creates genre collections for your Plex media

Command-line program to download image galleries and collections from several image hosting sites

Extensible memoizing collections and decorators

Free and open-source digital preservation system designed to maintain standards-based, long-term access to collections of digital objects.

Query multiple mongoDB database collections easily

NeRD: Neural Reflectance Decomposition from Image Collections

EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections

PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.

A Blender python script for getting asset browser custom preview images for objects and collections.

Collections of python projects

Unique image & metadata generation using weighted layer collections.

A utility for quickly cropping large collections of images.

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

SQL queries to collections

trackbranch is a tool for developers that can be used to store collections of branches in the form of profiles.

Collections of pydantic models

Comments

Can this repo be used to retrieve a mixed series without knowing any prior information of signal2?

AEC: kalman filter, post error matrix update

Owner

Ryuk

This library provides common speech features for ASR including MFCCs and filterbank energies.

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Conferencing Speech Challenge

Simple, hackable offline speech to text - using the VOSK-API.

Voicefixer aims at the restoration of human speech regardless how serious its degraded.

Some utils for auto speech recognition

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Speech Algorithms Collections