Python implementation of the "Shazam" algorithm

lukemcraig Last update: May 19, 2022

AudioSearch

Python implementation of "An Industrial-Strength Audio Search Algorithm"

Created for my term paper in CS 5110 - Design and Analysis of Algorithms, Fall 2018.

One of the things I look for most in a term paper is its simplicity in explaining complex ideas. This year, the Best Term Paper Award goes to Luke Craig for his paper titled "Robust Audio Fingerprinting Using Combinatorial Hashing and Temporal Offsets". Many congratulations to him.
-- Dr. Raghuveer Mohan

Some neat figures in my paper:

Usage:

positional arguments:

d the root directory of the library of mp3s to insert or test

optional arguments:

--insert to insert into the database instead of testing
--plot whether to plot the algorithm
-processes p the number of processes to use during insertion
-noise n noise type (White or Pub)

Dependencies:

numpy
scipy
pandas
matplotlib (only for plotting)
librosa (only for loading audio)
mutagen (for parsing mp3 metadata)
pymongo (only if MongoDB is used for the database)

A conda environment with these packages can be built automatically from audiosearchminenv.yml

conda env create -f audiosearchminenv.yml

Tags:

Owner

lukemcraig

GitHub Repository https://github.com/lukemcraig

Django and Wagtail based blogging / podcasting app

After switching to Wagtail, the documentation has to be updated. Stay tuned 😄. Although switching to Wagtail was a big step, there is still a lot to do. Things that are on the roadmap:

A small library for playing audio files in python, with essential playback functionality.

A small library for playing audio files in python. Provides file format independent methods for loading audio files, playing, pausing, resuming, stopping, seeking, getting the current playback positi…

An audio filter bank implementation in Python, contains ERB and linear filter banks

Contains implementations of an Equal Rectangular Bandwidth (ERB) filter bank and a linear filter bank. Filter banks are a very useful time-frequency analysis tool. A nice alternative to your standard…

A library for reading and, in the future, writing audio metadata. https://audio-metadata.readthedocs.io/

Clean and understandable code, nice API, and good UX (user experience) are the focal points of audio-metadata. One or more of these things I feel are lacking from already existing alternatives enough…

This project demonstrates the use of Alexa Audio Player for skills, using the ASK Python SDK

This project demonstrates the use of Alexa Audio Player for skills using ASK Python SDK. Multiple-streams folder contains an example skill to play multiple, pre-recorded audio streams, such as a basi…

Video to audio converter microservices application in Python

Converting mp4 videos to mp3 in a microservices architecture. Before you begin, ensure that the following prerequisites are met: Follow these steps to deploy your microservice application: In the "C…

A bot for music streaming to TeamTalk Servers.

A media streaming bot for TeamTalk. You can also run the bot in a Docker container. First of all, You should build an image from the provided Dockerfile: Note: The first run could take some time. The…

Pythonic access to audio files

Pythonic libsndfile wrapper to read and write audio files. Python dependencies are managed by the setup.py script. But still there are a couple of binary dependencies. In Debian/Ubuntu, you can insta…

Experimenting with Python and librosa to do Audio Event Detection

Identify when a sound effect is played multiple times in an audio file (e.g. an MP3). Otherwise known as Audio Event Detection. The output will end with something like: If you want this data to be pu…

🎚️ Simple Matchering 2.0 Command Line Application

And then: If our script saved your time or money, you may: