Home / Audio
341 Components & Libraries
Sortby
The widget can then be initialized on a file upload form the following way: For further information, please refer to the following guides: The File Upload plugin is regularly tested with the latest b…
As developers, we spend our days with code. The site you're reading this on is mostly modules, packages, libraries, frameworks, and the like. But users see applications. When building our own applica…
Library for performing speech recognition, with support for several engines and APIs, online and offline. Speech recognition engine/API support: Project links: First, make sure you have all the requi…
A python package for music and audio analysis. The latest stable release is available on PyPI, and you can install it by saying To build librosa from source, say Then, to install librosa, say If all …
Dejavu can memorize audio by listening to it once and fingerprinting it. Then by playing a song and recording microphone input or reading from disk, Dejavu attempts to match the audio against the fin…
pyAudioAnalysis is a Python library covering a wide range of audio analysis tasks. Through pyAudioAnalysis you can: pyAudioAnalysis provides easy-to-call wrappers to execute audio analysis tasks. Eg,…
For more examples, see: To cite via BibTeX:
aubio is a library to label music and sounds. It listens to audio signals and attempts to detect events. For instance, when a drum is hit, at which frequency is a note, or at what tempo is a rhythmic…
Provide a compatible audio file and basic-pitch will generate a MIDI file, complete with pitch bends. Basic pitch is instrument-agnostic and supports polyphonic instruments, so you can freely enjoy t…
Essentia is an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPLv3 license. It contains an extensive collection of reusable algorith…
Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi. The repository includes everything needed to build an LED strip music visualizer (excluding hardware): To build a…
This synchronization map can be output to file in several formats, depending on its application: The generic OS-independent procedure is simple: No copy rights were harmed in the making of this proje…
If you find this package useful, please cite as: This is a utility library that downloads and prepares public datasets. We do not host or distribute these datasets, vouch for their quality or fairnes…
MediaCMS is a modern, fully featured open source video and media CMS. It is developed to meet the needs of modern web platforms for viewing and sharing media. It can be used to build a small to mediu…
This is what the system tray menu looks like: Check these images: There are two ways of installing this application: If you are using homebrew, it is possible to install the binary as follows: Addit…
You can use multiple dimensional feature combinations, select different deep learning networks training, study various tasks in the audio field such as Classification, Separation, MIR etc. In the tim…
SoLoud is an easy to use, free, portable c/c++ audio engine for games. Zlib/LibPng licensed. Portable. Easy.
The aim of this repository is to create a comprehensive, curated list of python software/tools related and used for scientific research in audio/music applications. I will keep some pull requests ope…
Polymath makes it effortless to combine elements from different songs to create unique new compositions: Simply grab a beat from a Funkadelic track, a bassline from a Tito Puente piece, and fitting h…
Mutagen is a Python module to handle audio metadata. It supports ASF, FLAC, MP4, Monkey's Audio, MP3, Musepack, Ogg Opus, Ogg FLAC, Ogg Speex, Ogg Theora, Ogg Vorbis, True Audio, WavPack, OptimFROG, …
The most recent version of noisereduce comprises two algorithms: If you use this code in your research, please cite it:
Madmom is an audio signal processing library written in Python with a strong focus on music information retrieval (MIR) tasks. Possible acronyms are: The package has two licenses, one for source code…
The idea behind creating this project was to build a machine learning model that could detect emotions from the speech we have with each other all the time. Nowadays personalization is something that…
Fansly Downloader is the go-to app for all your bulk media downloading needs. Download photos, videos, audio or any other media from Fansly, this powerful tool has got you covered! Say goodbye to the…
Differences from the previous major version: Looking for the perfect BPM or key for a new EDM track? A completely free open-source web service from the author of Matchering. If our package saved your…
OpenShot Video Library (libopenshot) is a free, open-source C++ library dedicated to delivering high quality video editing, animation, and playback solutions to the world. OpenShot now supports exper…
A utility for batch-normalizing audio using ffmpeg. This program normalizes media files to a certain loudness level using the EBU R128 loudness normalization procedure. It can also perform RMS-based …
If you use this code or part of it, please cite us! Feb, 16 2019: Even though the code can be easily adapted to any speech dataset, in the following part of the documentation we provide an example ba…
For detailed information, please check the commit history. It's recommended to create an alias for a convenient usage: Essentially, what this does is to map the /home/worker/.config/whipper and ${PWD…
Subscribe to our newsletter