Home / Audio
341 Components & Libraries
Sortby
A live audio spectrum analyzer using pyAudio, matplotlib and scipy. A series of Jupyter notebooks and python files which stream audio from a microphone using pyaudio.
Decode audio files using whichever backend is available. The library currently supports: Use the library like so: Additional values are available as fields on the audio file object: Audioread support…
Still, these are the commands for Linux: You can build the image with: After getting your VAD token (see next sections) run: The "volume" stuff will allow you not to re-download the huggingface model…
Simplified Python Audio Features Extraction spafe requires: if you want to use the visualization module/ functions of spafe, you will need to install: Once you have the Dependencies installed, use on…
Install using pip Alternatively, Given a set of components and an optional set of statistics to apply to the time-varying components, extract them using Python. Assume the following directory structu…
MusPy is an open source Python library for symbolic music generation. It provides essential tools for developing a music generation system, including dataset management, data I/O, data preprocessing …
SamplerBox works with the RaspberryPi's built-in soundcard, but it is recommended to use a USB DAC (PCM2704 USB DAC for less than 10€ on eBay is fine) for better sound quality. Install the required…
Swing Music is a beautiful, self-hosted music player for your local audio files. Like a cooler Spotify ... but bring your own music. Just run the app and enjoy your music library in a web browser. F…
This project presents a deep learning classifier able to predict the emotions of a human speaker encoded in an audio file. The classifier is trained using 2 different datasets, RAVDESS and TESS, and …
Contact me if you'd like this package to persist, and/or if you'd like to take over ownership of this repo. This is a lightweight Python wrapper for SoX - Sound eXchange. Supported effects range from…
TimeSide is a Python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling. Its high-level API is designed to enable complex processing on very large dat…
This captcha solver will only work on sites that support audio challenges, I am not responsible for your actions using this script. This script was made for educational purposes.
Here are some quick gifs demonstrating a miniscule amount of the availble features, go towards the bottom of the README to see the full API: Next, get the boost headers. Now just open the Xcode proje…
Telegram Bot for downloading MP3 rips of tracks/sets from SoundCloud, Bandcamp, YouTube with tags and artwork. 2 scdlbot is standing on the shoulders of giants: Download or copy configuration file sa…
To remove a specific song & related hash from db
This library provides tools for working with common MIR datasets, including tools for: To install, simply run: There are two ways of citing mirdata: If you are using the library for your work, please…
Python Core Audio Windows Library, working for both Python2 and Python3. Latest stable release: Development branch: System requirements:
TimeSide is a python framework enabling low and high level audio analysis, imaging, transcoding, streaming and labelling. Its high-level API is designed to enable complex processing on very large dat…
QualCoder is a qualitative data analysis application written in Python. Text files can be typed in manually or loaded from txt, odt, docx, html, htm, md, epub, and PDF files. Images, video, and audi…
Automatic speed and shift correction Supports all reasonably encoded SRT files in any language Should work with any language in the audio (only tested with a few though) Quality-of-fit metric for che…
Scripts for alignment of laboratory speech production data Please you use this tool; we would appreciate if you cited the following paper: Gorman, Kyle, Jonathan Howell and Michael Wagner. 2011. Pros…
Prerequisites: Python 3.5 or later. A first-time user may start with two examples: Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change. P…
This library works with Python 2 (2.7+, possibly also 2.6) and Python 3 (3.3+). Multiple artists are joined by join phrases, as displayed on the web page. A new (pure-Python) function compares two Ch…
CTC segmentation can be used to find utterance alignments within large audio files. The CTC segmentation package is not standalone, as it needs a neural network with CTC output. It is integrated in t…
Praat uses a file format called textgrids, which are time aligned speech transcripts. This library isn't just a data struct for reading and writing textgrids--many utilities are provided to make it e…
Run the following command in your terminal: The script also accepts some options: Also, check out the tutorial video I created for the installation: Thank you for the digital signal processing and so…
Calling any of the above functions will play a sound. Note that the sounds are played in asynchronous processes, and are thus non-blocking. Each function should take around 2ms to execute, regardless…
Pliers is a Python package for automated extraction of features from multimodal stimuli. It provides a unified, standardized interface to dozens of different feature extraction tools and services--in…
Analyze a WAV audio file - or an MP3 file - Tested on Python 3.6 or later Given an audio file, AudioOwl generates an objects with many useful information about your file 💪. Returns a numpy array that…
Subscribe to our newsletter