A collection of python scripts for extracting and analyzing acoustics from audio files.

Tim

Last update: Dec 26, 2022

Related tags

Audio pyAcoustics

Overview

pyAcoustics

https://img.shields.io/badge/license-MIT-blue.svg?

A collection of python scripts for extracting and analyzing acoustics from audio files.

Contents

1 Common Use Cases
2 Major revisions
3 Features as they are added
4 Requirements
5 Installation
6 Example usage
7 Citing LMEDS
8 Acknowledgements

2 Major revisions

Ver 1.0 (June 7, 2015)

first public release.

3 Features as they are added

Mask speech with speech shaped noise (March 21, 2016)

Find syllable nuclei/estimate speech rate using Uwe Reichel's matlab code (July 29, 2015)

Find the valley bottom between peaks (July 7th, 2015)

4 Requirements

Many of the individual features require different packages. If you aren't using those packages then you don't need to install the dependencies.

pyacoustics.intensity_and_pitch.praat_pi requires praat

pyacoustics.intensity_and_pitch.get_f0 requires the ESPS getF0 function as implemented by Snack although I recall having difficulty installing it.

pyacoustics.speech_rate/dictionary_estimate.py requires my library psyle

pyacoustics.signals.data_fitting.py requires SciPy, NumPy, and scikit-learn

My praatIO library is used extensively and can be downloaded here

5 Installation

If you on Windows, you can use the installer found here (check that it is up to date though) Windows installer

PyAcoustics is on pypi and can be installed or upgraded from the command-line shell with pip like so:

python -m pip install pyacoustics --upgrade

Otherwise, to manually install, after downloading the source from github, from a command-line shell, navigate to the directory containing setup.py and type:

python setup.py install

If python is not in your path, you'll need to enter the full path e.g.:

C:\Python36\python.exe setup.py install

6 Example usage

See the example folders for a few real-world examples using this library.

examples/split_audio_on_silence.py

Detects the presence of speech in a recording based on acoustic intensity. Everything louder than some threshold specified by the user is considered speech.
examples/split_audio_on_tone.py

Detects the presence of pure tones in a recording. One can use this to automatically segment stimuli. Beeps can be played while the speech is being recorded and then later this tool can automatically segment the speech, based on the presence of those tones.

Also detects speech using a pitch analysis. Most syllables contain some voicing, so a stream of modulating pitch values suggests that someone is speaking. This aspect is not extensively tested but it works well for the example files.
examples/estimate_speech_rate.py

Calculates the speech rate through a matlab script written by Uwe Reichel that estimates the location of syllable boundaries.

7 Citing LMEDS

PyAcoustics is general purpose coding and doesn't need to be cited but if you would like to, it can be cited like so:

Tim Mahrt. PyAcoustics. https://github.com/timmahrt/pyAcoustics, 2016.

PyAcoustics is an ongoing collection of code with contributions from a number of projects worked on over several years. Development of various aspects of PyAcoustics was possible thanks to NSF grant IIS 07-03624 to Jennifer Cole and Mark Hasegawa-Johnson, NSF grant BCS 12-51343 to Jennifer Cole, José Hualde, and Caroline Smith, and NSF grant IBSS SMA 14-16791 to Jennifer Cole, Nancy McElwain, and Daniel Berry.

Comments

Getting error while running "split_audio_on_tone.py"

Wenn I try to run the script "examples/split_audio_on_tone.py" with the wav-data attached in the folder "files" I get this error:

Traceback (most recent call last):
  File "/Users/tamaki/Desktop/pyAcoustics/examples/split_audio_on_tone.py", line 72, in <module>
    audiosplitOnTone(_dataPath, _fn, _pitchPath, _tgPath, _wavOutputPath,
  File "/Users/tamaki/Desktop/pyAcoustics/examples/split_audio_on_tone.py", line 52, in audiosplitOnTone
    split_on_tone.extractSubwavs(timeDict, inputPath, fn, subwavPath)
  File "/Users/tamaki/miniconda3/envs/Praktikum/lib/python3.9/site-packages/pyacoustics/speech_detection/split_on_tone.py", line 83, in extractSubwavs
    audio_scripts.extractSubwav(join(path, fn),
  File "/Users/tamaki/miniconda3/envs/Praktikum/lib/python3.9/site-packages/pyacoustics/signals/audio_scripts.py", line 231, in extractSubwav
    audioFrames = getSubwav(fn, startT, endT, singleChannelFlag)
  File "/Users/tamaki/miniconda3/envs/Praktikum/lib/python3.9/site-packages/pyacoustics/signals/audio_scripts.py", line 210, in getSubwav
    audiofile.setpos(int(framerate * startT))
  File "/Users/tamaki/miniconda3/envs/Praktikum/lib/python3.9/wave.py", line 229, in setpos
    raise Error('position not in range')
wave.Error: position not in range

What am I doing wrong? I am on macOS Catalina 10.15.7 and using python 3.9.0. I also tried it with other versions and It didn't work. Thank you in advance for your answer.

opened by otamabon1015 2

speech segmentation without create file
i want to analyze every segment of speech without creating file...

i try to make a segmentation of speech with yout script.

from pyacoustics.signals import audio_scripts as ascr wavfile = 'D:/Temp/speech/test.wav' duration = ascr.getSoundFileDuration(wavfile) splitwav = ascr.getSubwav(wavfile, 0, duration, True)

the next step should be get the data for every segment. can you help me to do that?
opened by wahyubram82 2
where is praat_pi.py?

README mentions pyacoustics.intensity_and_pitch.praat_pi.getPraatPitchAndIntensity(), but there is no praat_pi.py file in that folder. Does someone need to check it in?

opened by bhomass 2
The code contains invalid names

openTextGrid: https://github.com/timmahrt/pyAcoustics/blob/master/examples/estimate_speech_rate.py#L100 openTextgrid: https://github.com/timmahrt/praatIO/blob/master/praatio/tgio.py#L1404

opened by acc-to-learn 1
Pyacoustics v2
This PR does the following:

[x] standardizes the tests

[x] formats files using black

[x] drops support for python 2.7

[x] upgrades to praatio 5.0 (breaking api changes)

[ ] ?
opened by timmahrt 0

Owner

Tim

I write tools for working with speech data.

GitHub

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

audioread Decode audio files using whichever backend is available. The library currently supports: Gstreamer via PyGObject. Core Audio on Mac OS X via

359 Feb 15, 2021

Audio spatialization over WebRTC and JACK Audio Connection Kit

Audio spatialization over WebRTC Spatify provides a framework for building multichannel installations using WebRTC.

34 Jun 29, 2022

Audio augmentations library for PyTorch for audio in the time-domain

Audio augmentations library for PyTorch for audio in the time-domain, with support for stochastic data augmentations as used often in self-supervised / contrastive learning.

166 Jan 8, 2023

praudio provides audio preprocessing framework for Deep Learning audio applications

praudio provides objects and a script for performing complex preprocessing operations on entire audio datasets with one command.

105 Dec 26, 2022

Automatically move or copy files based on metadata associated with the files. For example, file your photos based on EXIF metadata or use MP3 tags to file your music files.

14 Nov 2, 2022

Python I/O for STEM audio files

stempeg = stems + ffmpeg Python package to read and write STEM audio files. Technically, stems are audio containers that combine multiple audio stream

72 Dec 23, 2022

Using python to generate a bat script of repetitive lines of code that differ in some way but can sort out a group of audio files according to their common names

Batch Sorting Using python to generate a bat script of repetitive lines of code that differ in some way but can sort out a group of audio files accord

1 Oct 29, 2021

A collection of python scripts for extracting and analyzing acoustics from audio files.

Related tags

Overview

pyAcoustics

1 Common Use Cases

2 Major revisions

3 Features as they are added

4 Requirements

5 Installation

6 Example usage

7 Citing LMEDS

8 Acknowledgements

Comments

Getting error while running "split_audio_on_tone.py"

speech segmentation without create file

where is praat_pi.py?

The code contains invalid names

Pyacoustics v2

Owner

Tim

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Audio spatialization over WebRTC and JACK Audio Connection Kit

Audio augmentations library for PyTorch for audio in the time-domain

praudio provides audio preprocessing framework for Deep Learning audio applications

Automatically move or copy files based on metadata associated with the files. For example, file your photos based on EXIF metadata or use MP3 tags to file your music files.

Python I/O for STEM audio files

Using python to generate a bat script of repetitive lines of code that differ in some way but can sort out a group of audio files according to their common names

This bot can stream audio or video files and urls in telegram voice chats

Carnatic Notes Predictor for audio files

C++ library for audio and music analysis, description and synthesis, including Python bindings

An app made in Python using the PyTube and Tkinter libraries to download videos and MP3 audio.

Audio fingerprinting and recognition in Python

Python library for audio and music analysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python audio and music signal processing library

A Python 3 script for capturing and recording a SDR stream to a WAV file (or serving it to a HTTP audio stream).

Scalable audio processing framework written in Python with a RESTful API

Python module for handling audio metadata