Data Preparation

Convert audio files and annotations into training-ready format by splitting audio into clips, matching annotations, and creating positive/negative class labels.

$> # Prepare the project's training data
$> koogu-prepare my_project.config

The preprocessing step will:

Pre-process audio files by applying necessary filtering and splitting into fixed-duration segments

Match annotations to segments based on temporal overlap

Label each segment as positive or negative for each class

Save prepared segments and annotaions in a format optimized for training

Parameters

Positional arguments

<CONFIG FILE>

Path to config file.

Input control

--filetypes EXTN [EXTN ...]

Audio file types to restrict processing to. Can specify multiple types separated by whitespaces. By default, will process all discovered files with the following extensions: [.wav, .WAV, .flac, .aif, .mp3]. When train_audio_annotations_map is available, this option is only considered where audio sources are directories.

Process control

--threads NUM

Number of threads to spawn for parallel execution.

Default: as many CPUs

Logging

--log LOGFILE

If specified, logging will be written out to this file instead of the default.

Default: PROJECT-LOGS-DIR/prepare.log

--loglevel LEVEL

Logging level. Choices: CRITICAL, ERROR, WARNING, INFO, DEBUG.

Default: INFO