Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Last update: Dec 15, 2022

Related tags

Deep Learning tutorial

Overview

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

About the book

This is a web book written for a tutorial session of the 22nd International Society for Music Information Retrieval Conference, Nov 8-12, 2021, in an online format. The ISMIR conference is the world’s leading research forum on processing, searching, organising and accessing music-related data.

Motivation

Lower the barrier: As deep learning emerges, music classification research has entered a new phase, and many data-driven approaches have been proposed to solve the problem. However, researchers sometimes use jargon in various ways. Also, some implementation details and evaluation methods are ambiguously described in the papers, blocking access to the information without personal contact. These are tremendous obstacles when new researchers want to dive into this fascinating research area. Through this book, we would like to lower the barrier for newcomers and reduce miscommunication between researchers by sharing the secrets.

Cope with data issue: Another issue that we are facing under the deep learning era is the exhaustion of labeled data. Labeling musical attributes requires strong domain knowledge and a significant amount of time for listening; hence expensive. Because of this, deep learning researchers started actively utilizing large-scale unlabeled data. This book introduces the recent advances in semi- and self-supervised learning that enables music classification models to step further beyond supervised learning.

Narrow the gap: Music classification has been applied to solve real-world problems successfully. However, some important procedures and considerations for real-world applications are rarely discussed as research topics. In this book, based on the various industry experiences of the authors, we try our best to raise the awareness of these questions and provide answers and perspectives. We hope this helps academia and industries harmonize better together.

About the authors

Minz Won is a Ph.D candidate at the Music Technology Group (MTG) of Universitat Pompeu Fabra in Barcelona, Spain. His research focus is music representation learning. Along with his academic career, he has put his knowledge into practice with industry internships at Kakao Corp., Naver Corp., Pandora, Adobe, and he recently joined ByteDance as a research scientist. He contributed to the winning entry in the WWW 2018 Challenge: Learning to Recognize Musical Genre.

Janne Spijkervet graduated from the University of Amsterdam in 2021 with her Master's thesis titled "Contrastive Learning of Musical Representations". The paper with the same title was published in 2020 on self-supervised learning on raw audio in music tagging. She has started at ByteDance as a research scientist (2020 - present), developing generative models for music creation. She is also a songwriter and music producer, and explores the design and use of machine learning technology in her music.

Keunwoo Choi is a senior research scientist at ByteDance, developing machine learning products for music recommendation and discovery. He received a Ph.D degree from Queen Mary University of London (c4dm) in 2018. As a researcher, he also has been working at Spotify (2018 - 2020) and several other music companies as well as open-source projects such as Kapre, librosa, and torchaudio. He also writes some music.

Citing this book

@book{musicclassification:book,
	Author = {Minz Won, Janne Spijkervet, and Keunwoo Choi},
	Month = Nov.,
	Publisher = {https://music-classification.github.io/tutorial},
	Title = {Music Classification: Beyond Supervised Learning, Towards Real-world Applications},
	Year = 2021,
	Url = {https://music-classification.github.io/tutorial}
}

Accommodating supervised learning algorithms for the historical prices of the world's favorite cryptocurrency and boosting it through LightGBM.

1 Nov 27, 2021

Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

M-LSD: Towards Light-weight and Real-time Line Segment Detection Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line

357 Jan 4, 2023

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

M-LSD: Towards Light-weight and Real-time Line Segment Detection Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Det

123 Jan 4, 2023

This repository contains the source code for the paper "DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks",

DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks Project Page | Video | Presentation | Paper | Data L

281 Dec 22, 2022

Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'

YOLO-ReT This is the original implementation of the paper: YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs. Prakhar Ganesh, Ya

69 Oct 19, 2022

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets This is the official implementation of "Towards Good Pract

52 Nov 22, 2022

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

Comments

Import Error about library"torchaudio_augmentations" in colab

Hi, first I would like to say thank you very much for the tutorial, it's really great. ：）

I'm trying to implement the code from the tutorial into colab, but it seems that in “Audio Data Augmentations” subsection, the import encounters problems with cuda version compatibility, I'm not quite sure what's going on, could you please take a look at it?

Here is my colab note: https://colab.research.google.com/drive/1Df2-yf9-tSnYUo8juIhfLTvhSto5Bk7F?authuser=2#scrollTo=GezaZlg7kqgx&line=5&uniqifier=1

opened by ghost 1

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

Related tags

Overview

Music Classification: Beyond Supervised Learning, Towards Real-world Applications

About the book

Motivation

About the authors

Citing this book

You might also like...

Accommodating supervised learning algorithms for the historical prices of the world's favorite cryptocurrency and boosting it through LightGBM.

Official Tensorflow implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"

This repository contains the source code for the paper "DONeRF: Towards Real-Time Rendering of Compact Neural Radiance Fields using Depth Oracle Networks",

Implementation for the paper 'YOLO-ReT: Towards High Accuracy Real-time Object Detection on Edge GPUs'

Official implementation of "Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets" (CVPR2021)

TransPrompt - Towards an Automatic Transferable Prompting Framework for Few-shot Text Classification

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

The first dataset on shadow generation for the foreground object in real-world scenes.

Comments

Import Error about library"torchaudio_augmentations" in colab

Owner

Real-ESRGAN: Training Real-World Blind Super-Resolution with Pure Synthetic Data

A real world application of a Recurrent Neural Network on a binary classification of time series data

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach

Learning Generative Models of Textured 3D Meshes from Real-World Images, ICCV 2021

Official codebase for Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

A Real-World Benchmark for Reinforcement Learning based Recommender System

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Implementation for paper "Towards the Generalization of Contrastive Self-Supervised Learning"

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "