Feature extraction¶
Spectral features¶
chroma_stft ([y, sr, S, norm, n_fft, …]) |
Compute a chromagram from a waveform or power spectrogram. |
chroma_cqt ([y, sr, C, hop_length, fmin, …]) |
Constant-Q chromagram |
melspectrogram ([y, sr, S, n_fft, hop_length]) |
Compute a Mel-scaled power spectrogram. |
mfcc ([y, sr, S, n_mfcc]) |
Mel-frequency cepstral coefficients |
rmse ([y, S, n_fft, hop_length]) |
Compute root-mean-square (RMS) energy for each frame. |
spectral_centroid ([y, sr, S, n_fft, …]) |
Compute the spectral centroid. |
spectral_bandwidth ([y, sr, S, n_fft, …]) |
Compute p’th-order spectral bandwidth: |
spectral_contrast ([y, sr, S, n_fft, …]) |
Compute spectral contrast [R11] |
spectral_rolloff ([y, sr, S, n_fft, …]) |
Compute roll-off frequency |
poly_features ([y, sr, S, n_fft, hop_length, …]) |
Get coefficients of fitting an nth-order polynomial to the columns of a spectrogram. |
tonnetz ([y, sr, chroma]) |
Computes the tonal centroid features (tonnetz), following the method of [R12]. |
zero_crossing_rate (y[, frame_length, …]) |
Compute the zero-crossing rate of an audio time series. |
Rhythm features¶
tempogram ([y, sr, onset_envelope, …]) |
Compute the tempogram: local autocorrelation of the onset strength envelope. |
Feature manipulation¶
delta (data[, width, order, axis, trim]) |
Compute delta features: local estimate of the derivative of the input data along the selected axis. |
stack_memory (data[, n_steps, delay]) |
Short-term history embedding: vertically concatenate a data vector or matrix with delayed copies of itself. |