A deep learning model for style-specific music generation.

Henry Mao

Last update: Nov 23, 2022

Related tags

Deep Learning music learning tensorflow machine composition keras deep generation

Overview

DeepJ: A model for style-specific music generation

https://arxiv.org/abs/1801.00887

Abstract

Recent advances in deep neural networks have enabled algorithms to compose music that is comparable to music composed by humans. However, few algorithms allow the user to generate music with tunable parameters. The ability to tune properties of generated music will yield more practical benefits for aiding artists, filmmakers, and composers in their creative tasks. In this paper, we introduce DeepJ - an end-to-end generative model that is capable of composing music conditioned on a specific mixture of composer styles. Our innovations include methods to learn musical style and music dynamics. We use our model to demonstrate a simple technique for controlling the style of generated music as a proof of concept. Evaluation of our model using human raters shows that we have improved over the Biaxial LSTM approach.

Requirements

Python 3.5

Clone Python MIDI (https://github.com/vishnubob/python-midi) cd python-midi then install using python3 setup.py install.

Then, install other dependencies of this project.

pip install -r requirements.txt

The dataset is not provided in this repository. To train a custom model, you will need to include a MIDI dataset in the data/ folder.

Usage

To train a new model, run the following command:

python train.py

To generate music, run the following command:

python generate.py

Use the help command to see CLI arguments:

python generate.py --help

Comments

TypeError: int returned non-int (type NoneType)

After installing all requirements (python-midi and requirements.txt) I get this error message when executing generate.py:

Using TensorFlow backend.
Traceback (most recent call last):
  File "generate.py", line 153, in <module>
    main()
  File "generate.py", line 142, in main
    models = build_or_load()
  File "/Users/frederikriedel/Developer/DeepJ/util.py", line 15, in build_or_load
    models = build_models()
  File "/Users/frederikriedel/Developer/DeepJ/model.py", line 149, in build_models
    notes_out = naxis(time_out, chosen, style)
  File "/Users/frederikriedel/Developer/DeepJ/model.py", line 111, in f
    dense_layer_cache[l] = Dense(int(x.get_shape()[3]))
TypeError: __int__ returned non-int (type NoneType)

Do you maybe know what I'm missing here?

opened by frogg 17

write midi file error

Writing file out/samples/output_0.mid
Traceback (most recent call last):
  File "generate.py", line 153, in <module>
    main()
  File "generate.py", line 150, in main
    write_file('output', generate(models, args.bars, styles))
  File "generate.py", line 134, in write_file
    midi.write_midifile(fpath, mf)
  File "/usr/local/python3/lib/python3.5/site-packages/midi/fileio.py", line 169, in write_midifile
    return write_midifile(out,pattern)
  File "/usr/local/python3/lib/python3.5/site-packages/midi/fileio.py", line 171, in write_midifile
    return writer.write(pattern)
  File "/usr/local/python3/lib/python3.5/site-packages/midi/fileio.py", line 105, in write
    self.write_track(track)
  File "/usr/local/python3/lib/python3.5/site-packages/midi/fileio.py", line 122, in write_track
    buf.extend(self.encode_midi_event(event))
  File "/usr/local/python3/lib/python3.5/site-packages/midi/fileio.py", line 161, in encode_midi_event
    ret.extend(event.data)
ValueError: byte must be in range(0, 256)

opened by dannywu19910524 4

is it normal for loss to increased a lot while training?

Epoch 60/1000
10954/10954 [==============================] - 659s - loss: 0.0475
Epoch 61/1000
10954/10954 [==============================] - 658s - loss: 0.0474
Epoch 62/1000
10954/10954 [==============================] - 658s - loss: 0.0473
Epoch 63/1000
10954/10954 [==============================] - 659s - loss: 0.0471
Epoch 64/1000
10954/10954 [==============================] - 658s - loss: 0.0472
Epoch 65/1000
10954/10954 [==============================] - 658s - loss: 0.0612
Epoch 66/1000
10954/10954 [==============================] - 659s - loss: 0.0958
Epoch 67/1000
10954/10954 [==============================] - 657s - loss: 0.0877

you can see around epoch 66 the loss had increased a lot, does it mean hours of training wasted? sry I am a beginner of this... :(

opened by DefinitlyEvil 4

How to write the command line to generate music of different styles?

the genres and stylesc I set are like this:

genre = [ 'jazz', 'classical', 'hip_hop' ]

styles = [ [ 'data/jazz/CharlieParker', 'data/jazz/DavidLiebman', 'data/jazz/JJJohnson' ], [ 'data/classical/beethoven', 'data/classical/holst', 'data/classical/stravinsky' 'data/classical/sullivan' ], [ 'data/hip_hop/50_cent', 'data/hip_hop/ja_rule', 'data/hip_hop/pitbull', 'data/hip_hop/will_smith', ] ]

How to write the command line to generate music of different styles.

I use " python generate.py -h" to get the optional arguments,it says: --styles STYLES [STYLES ...] Styles to mix together

but i can't figure out the specific usage of the style control:

i have tried " python generate.py --styles classical"and " python generate.py --styles classical[beethoven holst], both showing "error: argument --styles: invalid int value: 'beethoven'"

opened by JucyCherry 3
how is train data fold like

the folder data/ only contains midi? I encountered error when i train model with data folder only contains midi. So i wonder if there is some misunderstanding about the train data I built. I'd appreciated if U can help me with the problem.（ฅ´ω`ฅ）

opened by JucyCherry 3
Unable to execute with any TF/Keras combination (wt/wo GPU)

Hello,

I've tried running this model with TF 1.6 /1.4 /1.2 (with and without GPU) and Keras 2.0 and am getting the stack trace below [ubuntu 16.04, python3.6, data folder with examples] when i execute the train.py script

Unable to load model from file. Loading data Training Traceback (most recent call last): File "train.py", line 32, in main() File "train.py", line 16, in main train(models) File "train.py", line 29, in train models[0].fit(train_data, train_labels, epochs=1000, callbacks=cbs, batch_size=BATCH_SIZE) File "/usr/local/lib/python3.6/dist-packages/keras/engine/training.py", line 1405, in fit batch_size=batch_size) File "/usr/local/lib/python3.6/dist-packages/keras/engine/training.py", line 1295, in _standardize_user_data exception_prefix='model input') File "/usr/local/lib/python3.6/dist-packages/keras/engine/training.py", line 121, in _standardize_input_data str(array.shape)) ValueError: Error when checking model input: expected input_1 to have 4 dimensions, but got array with shape (0, 1)

I tried downgrading Keras to 1.xx but the code uses Keras 2.0 syntax. can you please look into it?

opened by slave2sync 3
build_or_load(): TypeError: __int__ returned non-int (type NoneType)
build_or_load()

Traceback (most recent call last): File "util.py", line 36, in build_or_load() File "util.py", line 15, in build_or_load models = build_models() File "/Users/chenqi/workspace/DeepJ/model.py", line 148, in build_models notes_out = naxis(time_out, chosen, style) File "/Users/chenqi/workspace/DeepJ/model.py", line 110, in f dense_layer_cache[l] = Dense(int(x.get_shape()[3])) TypeError: int returned non-int (type NoneType)

shift_chosen = Lambda(lambda x: tf.pad(x[:, :, :-1, :], [[0, 0], [0, 0], [1, 0], [0, 0]]))(chosen) # shift_chosen: shape=(?, 128, 48, 3) shift_chosen = Reshape((time_steps, NUM_NOTES, -1))(shift_chosen) # shift_chosen: shape=(?, 128, 48, ?) x = Concatenate(axis=3)([x, shift_chosen]) # x: shape=(?, 128, 48, ?)

then int(x.get_shape()[3]) throw TypeError: int returned non-int (type NoneType)
opened by chenqi1990 3
python-midi in python3?

In Ubuntu 16.04 64 with NV tesla P4 box， I can install & run python-midi 0.2.3 in python2,but can not do the same with python3, because " No module named container".

opened by wuyongyi 2

Infinite loading

Website https://deepj.ai/ shows loading sign way too long.

Console shows errors:

Failed to load resource: the server responded with a status of 522 ()
https://server.deepj.ai/stream.mp3?length=1000&seed=0&Baroque=0.021820007785793072&Classical=0.8729091205455646&Romantic=0.5977987382883878&Modern=0.49897429867853504

Failed to load https://server.deepj.ai/stream.mp3?length=1000&seed=0&Baroque=0.021820007785793072&Classical=0.8729091205455646&Romantic=0.5977987382883878&Modern=0.49897429867853504: No 'Access-Control-Allow-Origin' header is present on the requested resource. Origin 'https://deepj.ai' is therefore not allowed access. The response had HTTP status code 522.

opened by DzyubSpirit 1

site is forcing https while the api call is to http, causing error and no sound

on the console: GET http://18.222.90.74/stream.mp3?length=1000&seed=0&Baroque=0.0982352937586819&Classical=0.7525334207246455&Romantic=0.8340969462751091&Modern=0.8574476342870909 net::ERR_CONNECTION_REFUSED

opened by gilamran 1
Help needed : Getting error TypeError: __int__ returned non-int (type NoneType) when calling train.py

I am on MAC OS X 10.3.1 Using TensorFlow backend. Traceback (most recent call last): File "train.py", line 32, in main() File "train.py", line 15, in main models = build_or_load() File "/Users/swarnaananthan/storycircles/ai_music/DeepJ/util.py", line 15, in build_or_load models = build_models() File "/Users/swarnaananthan/storycircles/ai_music/DeepJ/model.py", line 149, in build_models notes_out = naxis(time_out, chosen, style) File "/Users/swarnaananthan/storycircles/ai_music/DeepJ/model.py", line 111, in f dense_layer_cache[l] = Dense(int(x.get_shape()[3])) TypeError: int returned non-int (type NoneType)

Please help me in fixing the issue.

opened by swarna-a-26 1
Unable to load model from file.

Hi, when i use the python generate.py command, it always said 'Unable to load model from file.', but the provided model is there and the file path is right, so why it can't load? Also, although it can't load model, but it can generate one or two midi file and then begins to throw error. The error message is like this:

Writing file out\samples\output_0.mid Traceback (most recent call last): File "generate.py", line 153, in main() File "generate.py", line 150, in main write_file('output', generate(models, args.bars, styles)) File "generate.py", line 134, in write_file midi.write_midifile(fpath, mf) File "D:\anaconda\anaconda\lib\site-packages\midi-0.2.3-py3.7.egg\midi\fileio.py", line 162, in write_midifile File "D:\anaconda\anaconda\lib\site-packages\midi-0.2. write_file('output', generate(models, args.bars, sty les)) File "generate.py", line 134, in write_file midi.write_midifile(fpath, mf) File "D:\anaconda\anaconda\lib\site-packages\midi-0.2. 3-py3.7.egg\midi\fileio.py", line 162, in write_midifile File "D:\anaconda\anaconda\lib\site-packages\midi-0.2. 3-py3.7.egg\midi\fileio.py", line 110, in write File "D:\anaconda\anaconda\lib\site-packages\midi-0.2. 3-py3.7.egg\midi\fileio.py", line 124, in write_track File "D:\anaconda\anaconda\lib\site-packages\midi-0.2. 3-py3.7.egg\midi\fileio.py", line 152, in encode_midi_ev ent ValueError: byte must be in range(0, 256)

Other people have the same error, but seems no one knows how to solves this. so my question is: 1.how can i load model correctly? 2.if i haven't load the model, then why it can generate some midi results?How does it being generated? 3.how can i fix this ValueError? Thank you!

opened by KDcx35 2
Summary of the versions to be used

Can anybody please summarize the versions of all the different libraries that we have to use in this. I have been for the past few hours trying to run this, but always an error comes up. Currently, am stuck at "Failed to load the native TensorFlow runtime".

opened by shreyasv11 0
Could not create cudnn handle: CUDNN_STATUS_NOT_INITIALIZED

after installing requirements, I encountered this error(image below) while running generator.py

TensorFlow == 1.15.0

TensorFlow-gpu == 1.13.1

Keras == 2.0.0

ubuntu = 18.04

NVIDIA-SMI 410.48

opened by MahdiKoochali 0
midi_util.py can't generate replay matrix correctly, the replay matrix is always all zeros.

Hi, i'm try to use midi_util.py to extract play, replay, volume matrix from piano-midi.de datasets, but i find that the replay matrix is always the all zeros array. How can I deal with this?

opened by CSMT201986 0
Missing music module in distribution.py

Hi i was trying to install the music package using conda but couldn't find it. Is the music file written by you or its a inbuilt python program? I am using the tensorflow version of deepj

opened by syedrafee 1

A deep learning model for style-specific music generation.

Related tags

Overview

DeepJ: A model for style-specific music generation

Abstract

Requirements

Usage

Comments

build_or_load()

Owner

Henry Mao

PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.

Fast Neural Style for Image Style Transform by Pytorch

Emotional conditioned music generation using transformer-based model.

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Style transfer, deep learning, feature transform

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

PyTorch implementation of "Learn to Dance with AIST++: Music Conditioned 3D Dance Generation."

Project for music generation system based on object tracking and CGAN

😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc

A multi-functional library for full-stack Deep Learning. Simplifies Model Building, API development, and Model Deployment.

This is the official Pytorch implementation of the paper "Diverse Motion Stylization for Multiple Style Domains via Spatial-Temporal Graph-Based Generative Model"

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting (ICCV, 2021)

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

A weakly-supervised scene graph generation codebase. The implementation of our CVPR2021 paper ``Linguistic Structures as Weak Supervision for Visual Scene Graph Generation''

Image-generation-baseline - MUGE Text To Image Generation Baseline

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

In this project we investigate the performance of the SetCon model on realistic video footage. Therefore, we implemented the model in PyTorch and tested the model on two example videos.

`build_or_load()`

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.