This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Last update: Oct 19, 2022

Related tags

Deep Learning umss

Overview

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models.

It contains a re-implementation of parts of the DDSP library in PyTorch. We added a differentiable all-pole filter which can be parameterized by line spectral frequencies or reflection coefficients.

Please cite the paper, if you use parts of the code in your work.

Links

🔊 Audio examples

📄 Paper

Requirements

The following packages are required:

pytorch==1.6.0
matplotlib==3.3.1
python-sounddevice==0.4.0
scipy==1.5.2
torchaudio=0.6.0
tqdm==4.49.0
pysoundfile==0.10.3
librosa==0.8.0
scikit-learn==0.23.2
tensorboard==2.3.0
resampy==0.2.2
pandas==1.2.3
tensorboard==2.3.0

Training

python train.py -c config.txt

python train_u_nets.py -c unet_config.txt

Evaluation

python eval.py --tag 'TAG' --f0-from-mix --test-set 'CSD'

Acknowledgment

This project has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 765068.

Copyright

Comments

Evaluations Out-Of-The-Box

Starting from the base source code on a clean installation, I made it possible to quickly start making evaluations.

Changelist: -made addendums to the main ReadMe (cleared up the install, gave more links) -added directions on where to put the audio files -added the f0 mixtures calculated with the multi-pitch estimator -added the energy file -added the US-F and the US-S models -commented unneccessary lines that asked for CREPE files in data.py -removed an outdated ddsp.synthetic_data import -added tqdm for evaluations (this can be enabled with a parsed --show-progress argument) -added a python file to run multiple evaluations on multiple models

opened by liam-kelley 0
Request to release the pretrained model

Hello how are you?

I would like to know if it is possible to release the pre-trained model, along with that how to make the inference to process in my own music,

Another observation is that it is not possible to visualize the audio examples in "Audio examples",

Thank you very much for the hard work in creating this network!

opened by lucasbr15 0

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

TSForecasting This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the tim

80 Dec 30, 2022

The official implementation of the research paper "DAG Amendment for Inverse Control of Parametric Shapes"

DAG Amendment for Inverse Control of Parametric Shapes This repository is the official Blender implementation of the paper "DAG Amendment for Inverse

157 Dec 26, 2022

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

Investigating U-NETS With Various Intermediate Blocks For Spectrogram-based Singing Voice Separation A Pytorch Implementation of the paper "Investigat

63 Nov 14, 2022

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

openmc-plasma-source This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters. The OpenMC sources a

10 Oct 18, 2022

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering This repository holds all the code and data for our recent work on

118 Dec 6, 2022

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network This repository is the official implementation of Speech Separati

116 Nov 9, 2022

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

AudioCLIP Extending CLIP to Image, Text and Audio This repository contains implementation of the models described in the paper arXiv:2106.13043. This

458 Jan 2, 2023

A parametric soroban written with CADQuery.

A parametric soroban written in CADQuery The purpose of this project is to demonstrate how "code CAD" can be intuitive to learn. See soroban.py for a

4 Aug 13, 2022

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

DanceNet3D The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer. Dataset and Results Pleas

36 Dec 21, 2022

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Related tags

Overview

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Links

Requirements

Training

Evaluation

Acknowledgment

Copyright

You might also like...

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

The official implementation of the research paper "DAG Amendment for Inverse Control of Parametric Shapes"

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

A parametric soroban written with CADQuery.

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

Comments

Evaluations Out-Of-The-Box

Request to release the pretrained model

Owner

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Experiments with differentiable stacks and queues in PyTorch

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

The LaTeX and Python code for generating the paper, experiments' results and visualizations reported in each paper is available (whenever possible) in the paper's directory

audioLIME: Listenable Explanations Using Source Separation

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Related tags

Overview

Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

Links

Requirements

Training

Evaluation

Acknowledgment

Copyright

You might also like...

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

The official implementation of the research paper "DAG Amendment for Inverse Control of Parametric Shapes"

A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-based singing voice separation." 21th International Society for Music Information Retrieval Conference, ISMIR. 2020.

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network

Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)

A parametric soroban written with CADQuery.

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

Comments

Evaluations Out-Of-The-Box

Request to release the pretrained model

Owner

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"

Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

Experiments with differentiable stacks and queues in PyTorch

Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.

Official implementation of NPMs: Neural Parametric Models for 3D Deformable Shapes - ICCV 2021

The LaTeX and Python code for generating the paper, experiments' results and visualizations reported in each paper is available (whenever possible) in the paper's directory

audioLIME: Listenable Explanations Using Source Separation

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.