Sound, Machine Learning, and Deep Generative Models: A Practical Introduction

align-right

Basics of Sound concepts

Waveform: dynamics, transients, envelopes
Frequency: pitch, overtones, timbre components
Time–frequency transforms: spectrograms, mel spectrograms
Energy and dynamics over time (envelope, loudness curves etc.)

McFee, Brian, Colin Raffel, Dawen Liang, Daniel PW Ellis, Matt McVicar, Eric Battenberg, and Oriol Nieto. "librosa: Audio and music signal analysis in python." In Proceedings of the 14th python in science conference, pp. 18-25. 2015.

Sound, Machine Learning, and Deep Generative Models: A Practical Introduction

Basics of Sound concepts

Waveforms and Their Features

Timbre and its features

FFT based

Wavelet based Timbre Features

Music Specific Features

Machine Learning for Sound and Music Computing

Importance of Feature Representations in Machine Learning

Dimensionalty reduction

Machine Learning

Supervised Learning - Regression

Supervised Learning - Regression for Interactive Audio

Supervised Learning - Classification

Supervised Learning - Audio Classification

Unsupervised Learning - Clustering

Unsupervised Learning - Clustering for Audio

Deep Learning

Deep Learning - Fun Visualizations

DL based feature representations

DL based feature representations - Audio Specific

Summary

Thank you!