Classifying audio using Wavelet transform and deep learning

Aditya Dutt

Last update: Nov 29, 2022

Related tags

Deep Learning audio machine-learning deep-neural-networks deep-learning mnist batch-normalization classification wavelet speaker-recognition acoustics wavelet-transform dilated-convolution morlet-wavelet

Overview

Audio Classification using Wavelet Transform and Deep Learning

A step-by-step tutorial to classify audio signals using continuous wavelet transform (CWT) as features.

Steps to use this repository:
- Create a virtual environment by using the command: virtualenv venv
- Activate the environment: source venv/bin/activate
- Install the requirements.txt file by typing: pip install -r requirements.txt
- Extract the recordings.zip file
Files Description
- recordings.zip: The contains recordings from the Free Spoken Digit Dataset (FSDD). You can also find this data here.
- training_raw_audio.npz: We are only classifying 3 speakers here: george, jackson, and lucas. All the training data from these 3 speakers is in this numpy zip file.
- testing_raw_audio.npz: We are only classifying 3 speakers here: george, jackson, and lucas. All the testing data from these 3 speakers is in this numpy zip file.
- requirements.txt: It contains the required libraries.

classification_report

You might also like...

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Squirrel Core Share, load, and transform data in a collaborative, flexible, and efficient way What is Squirrel? Squirrel is a Python library that enab

249 Dec 7, 2022

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

Multi-Modal Self-Supervision using GDT and StiCa This is an official pytorch implementation of papers: Multi-modal Self-Supervision from Generalized D

42 Dec 9, 2022

MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

187 Dec 26, 2022

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation This project hosts the code for implementing the DCT-MASK algorithms

57 Nov 27, 2022

Simple Python application to transform Serial data into OSC messages

SerialToOSC-Bridge Simple Python application to transform Serial data into OSC messages. The current purpose is to be a compatibility layer between ha

Division of Applied Acoustics at Chalmers University of Technology

3 Jun 3, 2021

Fast Scattering Transform with CuPy/PyTorch

Announcement 11/18 This package is no longer supported. We have now released kymatio: http://www.kymat.io/ , https://github.com/kymatio/kymatio which

289 Dec 7, 2022

Fast Neural Style for Image Style Transform by Pytorch

FastNeuralStyle by Pytorch Fast Neural Style for Image Style Transform by Pytorch This is famous Fast Neural Style of Paper Perceptual Losses for Real

81 Sep 3, 2022

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.

face3d: Python tools for processing 3D face Introduction This project implements some basic functions related to 3D faces. You can use this to process

2.3k Dec 30, 2022

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Yoga Pose Identification and Icon Matching Project Goal Detect yoga poses performed by a user and overlay a corresponding icon image. Running the main

1 Dec 3, 2021

Owner

Aditya Dutt

ML PhD Researcher

GitHub

Sdf sparse conv - Deep Learning on SDF for Classifying Brain Biomarkers

Deep Learning on SDF for Classifying Brain Biomarkers To reproduce the results f

1 Jan 25, 2022

Selective Wavelet Attention Learning for Single Image Deraining

SWAL Code for Paper "Selective Wavelet Attention Learning for Single Image Deraining" Prerequisites Python 3 PyTorch Models We provide the models trai

9 Jun 17, 2022

Rafael Project- Classifying rockets to different types using data science algorithms.

Rocket-Classify Rafael Project- Classifying rockets to different types using data science algorithms. In this project we received data base with data

5 Sep 18, 2021

A PyTorch implementation of "Graph Wavelet Neural Network" (ICLR 2019)

Graph Wavelet Neural Network ⠀⠀ A PyTorch implementation of Graph Wavelet Neural Network (ICLR 2019). Abstract We present graph wavelet neural network

490 Dec 16, 2022

PyTorch implementation of the wavelet analysis from Torrence & Compo

Continuous Wavelet Transforms in PyTorch This is a PyTorch implementation for the wavelet analysis outlined in Torrence and Compo (BAMS, 1998). The co

262 Dec 21, 2022

Matplotlib Image labeller for classifying images

mpl-image-labeller Use Matplotlib to label images for classification. Works anywhere Matplotlib does - from the notebook to a standalone gui! For more

5 Sep 24, 2022

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

关于实现的一点说明山东大学 2020级苏博南 www.subonan.com 文件说明 tools.py 这里面主要有两个函数： resize(a, lenb) 这其实是我找同学写的一个小算法hhh。给出一个$28\times 28$的方阵a，返回一个$lenb\times lenb$的方阵。因

2 Aug 29, 2022

Style transfer, deep learning, feature transform

10.9k Jan 2, 2023

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution Figure: Example visualization of the method and baseline as a

16 Dec 23, 2022

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation This is a demo implementation of BYOL for Audio (BYOL-A), a self-sup

160 Jan 4, 2023

Classifying audio using Wavelet transform and deep learning

Related tags

Overview

Audio Classification using Wavelet Transform and Deep Learning

Steps to use this repository:

Files Description

You might also like...

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances to transformations applied to both the audio and video streams.

MINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Simple Python application to transform Serial data into OSC messages

Fast Scattering Transform with CuPy/PyTorch

Fast Neural Style for Image Style Transform by Pytorch

Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Owner

Aditya Dutt

Sdf sparse conv - Deep Learning on SDF for Classifying Brain Biomarkers

Selective Wavelet Attention Learning for Single Image Deraining

Rafael Project- Classifying rockets to different types using data science algorithms.

A PyTorch implementation of "Graph Wavelet Neural Network" (ICLR 2019)

PyTorch implementation of the wavelet analysis from Torrence & Compo

Matplotlib Image labeller for classifying images

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

Style transfer, deep learning, feature transform

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation