Audio Toolkit

A Python toolkit for performing various audio processing tasks, such as:

Removing silent audio files from a folder.
Trimming silence from a single large audio file.
Slicing audio files into smaller segments.
Trimming audio to the first 30 seconds.

Features

Batch Remove Silent Files:
- Scan a folder of audio files and move files below a specified volume threshold to a subfolder.
- Script: batch_remove_silent_files.py.
- Usage :
```
python scripts/batch_remove_silent_files.py /path/to/input/folder --threshold 0.02
```
Trim Silence From Large Audio:
- Remove silent segments from a single large audio file, generating a continuous output.
- Script: remove_silence_from_audio.py.
- Usage :
```
python scripts/remove_silent_from_audio.py input.wav output.wav --silence_thresh -40 --min_silence_len 300 --padding 200
```
Slice Audio Into Segments:
- Slice a large audio file into smaller segments based on specified time intervals or silence thresholds.
- Script: slice_audio.py.
- Usage :
```
python scripts/slice_audio.py /path/to/audio/file --slice_duration_ms 2000 --fade_duration_ms 50
```
Trim Audio to First $n$ Seconds:
- Extract the first $n$ seconds of an audio file and save it as a new file.
- Script: trim_audio.py.
- Usage :
```
python scripts/trim_audio.py input.wav output.wav
```

Installation

Clone the repository:

git clone https://github.com/yourusername/audio-toolkit.git
cd audio-toolkit

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Additional notes

How Loudness is Measured

RMS (Root Mean Square):

The most common way to measure loudness is to calculate the RMS of the audio signal. It computes the average power of the waveform over its duration. RMS gives a single value representing the "overall" loudness of the file.

The formula for calculating RMS is:

$$ \text{RMS} = \sqrt{\frac{1}{N} \sum_{i=1}^{N} x[i]^2} $$

Where:

$x[i]$ is the audio amplitude at sample $i$,
$N$ is the total number of samples.

Limitation: RMS averages the loudness over the entire file. If 1.9 seconds are silent and the last 0.1 seconds is loud, the loud portion will affect the RMS but may not be representative of the whole file's "perceived" silence.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Toolkit

Features

Installation

Additional notes

How Loudness is Measured

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Audio Toolkit

Features

Installation

Additional notes

How Loudness is Measured

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages