Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder

Nitin

Last update: Nov 17, 2022

Related tags

Overview

ASEGAN: Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder

中文版简介
 Readme with English Version

介绍

基于SEGAN模型的改进版本，使用自主设计的非对称自编码结构替换原有的全卷积结构，使得模型在保持原有性能的条件下更加轻量化。
（本模型未在任何期刊发布）

软件架构

安装教程

安装必要库
Anaconda
cuda
cudnn
创建python环境
conda create -n ASEGAN python=3.8
conda activate ASEGAN
安装必要环境
pip install -r requirements.txt
或者conda install --yes --file requirements.txt

使用说明

使用前检查config/config.yaml中参数配置是否正确，数据集文件结构参考data文件夹中结构

数据预处理
python data_preprocess.py
模型训练
python train.py
模型测试
python test.py

预训练模型下载

百度网盘，提取码6793
GoogleDrive
预训练模型命名方式为‘数据集-训练周期-数据量.pkl’

You might also like...

A Flow-based Generative Network for Speech Synthesis

WaveGlow: a Flow-based Generative Network for Speech Synthesis Ryan Prenger, Rafael Valle, and Bryan Catanzaro In our recent paper, we propose WaveGlo

2k Dec 26, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

WaveGlow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis Quick Start: Install requirements: pip install

204 Jul 14, 2022

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

FullSubNet This Git repository for the official PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech E

357 Jan 4, 2023

Python codes for Lite Audio-Visual Speech Enhancement.

Lite Audio-Visual Speech Enhancement (Interspeech 2020) Introduction This is the PyTorch implementation of Lite Audio-Visual Speech Enhancement (LAVSE

85 Dec 1, 2022

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

SF-Net for fullband SE This is the repo of the manuscript "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Ban

36 Dec 2, 2022

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

HiFi++ : a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement This is the unofficial implementation of Vocoder part of

118 Dec 29, 2022

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising By Tengfei Liang, Yi Jin, Yidong Li, Tao Wang. Th

115 Jan 5, 2023

House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects

House-GAN++ Code and instructions for our paper: House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent

122 Dec 28, 2022

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

PyTorch SRResNet Implementation of Paper: "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"(https://arxiv.org/abs

436 Jan 9, 2023

Owner

Nitin

GitHub

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes Implementation of CoSMA: Convolutional Semi-Regular Mesh Autoencoder arXiv p

10 Oct 11, 2022

Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder

Related tags

Overview

ASEGAN: Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder

介绍

软件架构

安装教程

使用说明

预训练模型下载

You might also like...

A Flow-based Generative Network for Speech Synthesis

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python codes for Lite Audio-Visual Speech Enhancement.

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

House-GAN++: Generative Adversarial Layout Refinement Network towards Intelligent Computational Agent for Professional Architects

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

Owner

Nitin

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

Adversarial Color Enhancement: Generating Unrestricted Adversarial Images by Optimizing a Color Filter

TCNN Temporal convolutional neural network for real-time speech enhancement in the time domain

Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Asymmetric Bilateral Motion Estimation for Video Frame Interpolation, ICCV2021

Code of Adverse Weather Image Translation with Asymmetric and Uncertainty aware GAN

PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis