The Best 341 Audio Libraries

341 Components & Libraries

Sortby

Audio Repositories

A series of Jupyter notebooks and python files which stream audio from a microphone using pyaudio, then processes it.

A live audio spectrum analyzer using pyAudio, matplotlib and scipy. A series of Jupyter notebooks and python files which stream audio from a microphone using pyaudio.

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Decode audio files using whichever backend is available. The library currently supports: Use the library like so: Additional values are available as fields on the audio file object: Audioread support…

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

Still, these are the commands for Linux: You can build the image with: After getting your VAD token (see next sections) run: The "volume" stuff will allow you not to re-download the huggingface model…

:sound: spafe: Simplified Python Audio Features Extraction

Simplified Python Audio Features Extraction spafe requires: if you want to use the visualization module/ functions of spafe, you will need to install: Once you have the Dependencies installed, use on…

Novoic's audio feature extraction library

Install using pip Alternatively, Given a set of components and an optional set of statistics to apply to the time-varying components, extract them using Python. Assume the following directory structu…

A toolkit for symbolic music generation

MusPy is an open source Python library for symbolic music generation. It provides essential tools for developing a music generation system, including dataset management, data I/O, data preprocessing …

SamplerBox is a sampler musical instrument based on RaspberryPi.

SamplerBox works with the RaspberryPi's built-in soundcard, but it is recommended to use a USB DAC (PCM2704 USB DAC for less than 10€ on eBay is fine) for better sound quality. Install the required…

Swing Music is a beautiful, self-hosted music player for your local audio files. Like a cooler Spotify ... but bring your own music.

Swing Music is a beautiful, self-hosted music player for your local audio files. Like a cooler Spotify ... but bring your own music. Just run the app and enjoy your music library in a web browser. F…

Understanding emotions from audio files using neural networks and multiple datasets.

This project presents a deep learning classifier able to predict the emotions of a human speaker encoded in an audio file. The classifier is trained using 2 different datasets, RAVDESS and TESS, and …

Apply audio effects such as reverb and EQ directly to audio files or NumPy ndarrays.

Contact me if you'd like this package to persist, and/or if you'd like to take over ownership of this repo. This is a lightweight Python wrapper for SoX - Sound eXchange. Supported effects range from…

scalable audio processing framework and server written in Python

TimeSide is a Python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling. Its high-level API is designed to enable complex processing on very large dat…

Requests based captcha solver for Arkose Labs using speech recognition

This captcha solver will only work on sites that support audio challenges, I am not responsible for your actions using this script. This script was made for educational purposes.

scalable audio processing framework and server written in Python

Command line C++ and Python VSTi Host library with MFCC, FFT, RMS and audio extraction and .wav writing.

Here are some quick gifs demonstrating a miniscule amount of the availble features, go towards the bottom of the README to see the full API: Next, get the boost headers. Now just open the Xcode proje…

Telegram Bot for downloading MP3 rips of tracks/sets from SoundCloud, Bandcamp, YouTube with tags and artwork.

Telegram Bot for downloading MP3 rips of tracks/sets from SoundCloud, Bandcamp, YouTube with tags and artwork. 2 scdlbot is standing on the shoulders of giants: Download or copy configuration file sa…

The Shazam-similar app, that identify the song using audio fingerprints & spectrum analysis and Fast Fourier transform

To remove a specific song & related hash from db

Python library for working with Music Information Retrieval datasets

This library provides tools for working with common MIR datasets, including tools for: To install, simply run: There are two ways of citing mirdata: If you are using the library for your work, please…

Python Core Audio Windows Library

Python Core Audio Windows Library, working for both Python2 and Python3. Latest stable release: Development branch: System requirements:

Scalable audio processing framework written in Python with a RESTful API

TimeSide is a python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling. Its high-level API is designed to enable complex processing on very large dat…

Qualitative data analysis for text, images, audio, video. Cross platform. Python 3.10 or newer and PyQt6.

QualCoder is a qualitative data analysis application written in Python. Text files can be typed in manually or loaded from txt, odt, docx, html, htm, md, epub, and PDF files. Images, video, and audi…

Automatically synchronize subtitles with audio using machine learning

Automatic speed and shift correction Supports all reasonably encoded SRT files in any language Should work with any language in the audio (only tested with a few though) Quality-of-fit metric for che…

Python interface for forced audio alignment using HTK and SoX

Scripts for alignment of laboratory speech production data Please you use this tool; we would appreciate if you cited the following paper: Gorman, Kyle, Jonathan Howell and Michael Wagner. 2011. Pros…

A Python package for time series augmentation

Prerequisites: Python 3.5 or later. A first-time user may start with two examples: Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change. P…

Python bindings for Chromaprint acoustic fingerprinting and the Acoustid Web service

This library works with Python 2 (2.7+, possibly also 2.6) and Python 3 (3.3+). Multiple artists are joined by join phrases, as displayed on the web page. A new (pure-Python) function compares two Ch…

Segment an audio file and obtain utterance alignments. (Python package)

CTC segmentation can be used to find utterance alignments within large audio files. The CTC segmentation package is not standalone, as it needs a neural network with CTC output. It is integrated in t…

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given hierarchical time-aligned transcriptions (utterance > word > syllable > phone, etc).

Praat uses a file format called textgrids, which are time aligned speech transcripts. This library isn't just a data struct for reading and writing textgrids--many utilities are provided to make it e…

Audio visualization for LED strips in real-time with web interface on a raspberry pi.

Run the following command in your terminal: The script also accepts some options: Also, check out the tutorial video I created for the installation: Thank you for the digital signal processing and so…

🎵 Python sound notifications made easy

Calling any of the above functions will play a sound. Note that the sounds are played in asynchronous processes, and are thus non-blocking. Each function should take around 2ms to execute, regardless…

Automated feature extraction in Python

Pliers is a Python package for automated extraction of features from multimodal stimuli. It provides a unified, standardized interface to dozens of different feature extraction tools and services--in…

Fast and simple music and audio analysis using RNN in Python 🕵️‍♀️ 🥁

Analyze a WAV audio file - or an MP3 file - Tested on Python 3.6 or later Given an audio file, AudioOwl generates an objects with many useful information about your file 💪. Returns a numpy array that…