Preprocessor classes and plugins

This module contains base classes for audio clip processors, feature extractors and instance processors, as well as a number of plugins for each of these.

Base classes

InstanceProcessor(config)

An instance processor.

AudioClipProcessor(config)

Processes raw audio data.

FeatureExtractor(config)

Extracts features from instances.

Plugins

audioset

Audioset feature extractors

encodec

Encodec feature extractor.

fairseq

Fairseq processor.

huggingface

Processor using HuggingFace models.

keras_apps

Feature extractor using Keras applications.

kmeans

Kmeans vector quantiser.

opensmile

OpenSMILE feature extraction.

openxbow

OpenXBOW extractor

phonemize

Phonemize text using the phonemizer library.

resample

Audio resampling using resampy.

spectrogram

Spectrogram extraction.

speechbrain

Processing using SpeechBrain models.

vad_trim

Voice activity detection (VAD) trimming.