AcoustID

Last updated
AcoustID
Acoustid logo.svg
DeveloperLukáš Lalinský
Typeaudio identification service
Pricing modelfree for non-commercial use
Website acoustid.org

AcoustID is a webservice for the identification of music recordings based on the Chromaprint acoustic fingerprint algorithm. It can identify entire songs but not short snippets. [1]

Contents

By 2017, the free service had 34 million "fingerprints" in-store and every day acquired between 15 and 20 thousand new entries and answered around five million search queries. AcoustID is integrated into the audio file metadata editors Picard, Jaikoz [2] and Puddletag, for example. [3] [4]

Chromaprint

In October 2009 MusicIP was acquired by AmpliFIND. [5] Some time after the acquisition, the MusicDNS service began having intermittent problems. Since the future of the free identification service was uncertain, a replacement for it was sought. The Chromaprint acoustic fingerprinting algorithm, the basis for AcoustID identification service, was started in February 2010 by a long-time MusicBrainz contributor Lukáš Lalinský. [6] The oldest entry in the DB is from 8 Oct 2010. [7]

While AcoustID and Chromaprint are not officially MusicBrainz projects, they are closely tied with each other and both are open source. Chromaprint works by analyzing the first two minutes of a track, detecting the strength in each of 12 pitch classes, storing these 8 times per second. Additional post-processing is then applied to compress this fingerprint while retaining patterns. [8] The AcoustID search server then searches from the database of fingerprints by similarity and returns the AcoustID identifier along with MusicBrainz recording identifiers if known.

Since 2013 Chromaprint is the only fingerprint supported by MusicBrainz. [9]

Fingerprint ID

The fingerprint IDs are 8-digit and conform to /[1-9][0-9]{7}/. E.g.

AcoustID "track"

Groups of Chromaprints are given a UUID and can be reached via https://acoustid.org/track/<uuid>, e.g. https://acoustid.org/track/a64cc174-c77c-47ee-ac1b-78015270dfe6.

The underlying chromaprints can be reached via fingerprint IDs, e.g.

ID                                             Length  Sources https://acoustid.org/fingerprint/11799567 3:35 255 https://acoustid.org/fingerprint/41547743 3:36 152 https://acoustid.org/fingerprint/21463426 3:38 81

The linked MusicBrainz "recordings" can contain music of different performers, e.g.

Related Research Articles

<span class="mw-page-title-main">MusicBrainz</span> Online music metadata database

MusicBrainz is a MetaBrainz project that aims to create a collaborative music database that is similar to the freedb project. MusicBrainz was founded in response to the restrictions placed on the Compact Disc Database (CDDB), a database for software applications to look up audio CD information on the Internet. MusicBrainz has expanded its goals to reach beyond a CD metadata storehouse to become a structured online database for music.

<span class="mw-page-title-main">Identifier</span> Name that identifies a unique entity

An identifier is a name that identifies either a unique object or a unique class of objects, where the "object" or class may be an idea, person, physical countable object, or physical noncountable substance. The abbreviation ID often refers to identity, identification, or an identifier. An identifier may be a word, number, letter, symbol, or any combination of those.

<span class="mw-page-title-main">Universally unique identifier</span> Label used for information in computer systems

A Universally Unique Identifier (UUID) is a 128-bit label used to uniquely identify objects in computer systems. The term Globally Unique Identifier (GUID) is also used, mostly in Microsoft systems.

<span class="mw-page-title-main">Electronic identification</span> Digital proof of identity

An electronic identification ("eID") is a digital solution for proof of identity of citizens or organizations. They can be used to view to access benefits or services provided by government authorities, banks or other companies, for mobile payments, etc. Apart from online authentication and login, many electronic identity services also give users the option to sign electronic documents with a digital signature.

A pitch detection algorithm (PDA) is an algorithm designed to estimate the pitch or fundamental frequency of a quasiperiodic or oscillating signal, usually a digital recording of speech or a musical note or tone. This can be done in the time domain, the frequency domain, or both.

Catalogue Service for the Web (CSW), sometimes seen as Catalogue Service - Web, is a standard for exposing a catalogue of geospatial records in XML on the Internet. The catalogue is made up of records that describe geospatial data, geospatial services, and related resources.

<span class="mw-page-title-main">Tag editor</span> Software for editing the metadata of media files

A tag editor is an app that can add, edit, or remove embedded metadata on multimedia file formats. Content creators, such as musicians, photographers, podcasters, and video producers, may need to properly label and manage their creations, adding such details as title, creator, date of creation, and copyright notice.

AmpliFIND is an acoustic fingerprinting service and a software development kit developed by the US company MusicIP.

<span class="mw-page-title-main">Jaikoz</span> Java tagging program

Jaikoz is a Java program used for editing and mass tagging music file tags.

MicroID is a decentralized identity protocol. It was originally developed in 2005 by Jeremie Miller. A MicroID is a simple identifier comprising a hashed communication/identity URI and claimed URL. Together, the two elements create a hash that can be claimed by third-party services.

<span class="mw-page-title-main">Gracenote</span> American metadata company

Gracenote, Inc. is a company and service that provides music, video, and sports metadata and automatic content recognition (ACR) technologies to entertainment services and companies worldwide. Formerly CDDB, Gracenote maintains and licenses an Internet-accessible database containing information about the contents of audio compact discs and vinyl records. From 2008 to 2014, it was owned by Sony, later sold to Tribune Media, and has been owned since 2017 by Nielsen Holdings. In 2019, Nielsen Holdings announced plans to split into two separate publicly traded companies, Nielsen Global Connect and Nielsen Global Media. In October 2022, Nielsen Holdings, including the Gracenote subsidiary was acquired by a private equity consortium.

An audio search engine is a web-based search engine which crawls the web for audio content. The information can consist of web pages, images, audio files, or another type of document. Various techniques exist for research on these engines.

<span class="mw-page-title-main">International Standard Name Identifier</span> 16 digit identifier for people and organisations

The International Standard Name Identifier (ISNI) is an identifier system for uniquely identifying the public identities of contributors to media content such as books, television programmes, and newspaper articles. Such an identifier consists of 16 digits. It can optionally be displayed as divided into four blocks.

An acoustic fingerprint is a condensed digital summary, a digital fingerprint, deterministically generated from an audio signal, that can be used to identify an audio sample or quickly locate similar items in a music database.

The Entertainment Identifier Registry, or EIDR, is a global unique identifier system for a broad array of audiovisual objects, including motion pictures, television, and radio programs. The identification system resolves an identifier to a metadata record that is associated with top-level titles, edits, DVDs, encodings, clips, and mashups. EIDR also provides identifiers for video service providers, such as broadcast and cable networks.

<span class="mw-page-title-main">Puddletag</span> Tag editor for Unix-like operating systems

Puddletag is a graphical audio file metadata editor ("tagger") for Unix-like operating systems.

<span class="mw-page-title-main">Kid3</span> Tag editor

Kid3 is an open-source cross-platform audio tag editor for many audio file formats. It supports DSF, MP3, Ogg, FLAC, MPC, MPEG-4 (mp4/m4a/m4b), AAC, Opus, SPX, TrueAudio, APE, WavPack, WMA, WAV, AIFF, tracker modules.

Search by sound is the retrieval of information based on audio input. There are a handful of applications, specifically for mobile devices that utilize search by sound. Shazam, Soundhound, Axwave, ACRCloud and others have seen considerable success by using a simple algorithm to match an acoustic fingerprint to a song in a library. These applications take a sample clip of a song, or a user-generated melody and check a music library/music database to see where the clip matches with the song. From there, song information will be queried and displayed to the user.

Automatic content recognition (ACR) is a technology used to identify content played on a media device or presented within a media file. Devices with ACR can allow for the collection of content consumption information automatically at the screen or speaker level itself, without any user-based input or search efforts. This information may be collected for purposes such as personalized advertising, content recommendations, or sale to customer data aggregators.

SecondHandSongs is a collaborative website that maintains a global database of mainly cover versions of original works. It also contains information about adaptations and samples. The website allows performers and volunteer curators to add songs and update their metadata. It includes links to freely accessible recordings of the covers, and external identifiers for those works and performances in other databases.

References

  1. Frequently Asked Questions AcoustID
  2. Henry, Alan (2014-04-10). "The Best Apps For Automatically Cleaning Up Your Music Library". Lifehacker Australia. Retrieved 2019-02-24.
  3. Walsh, Terry (2017-04-28). "How to master your music metadata (Part 1)". TechHive. Retrieved 2019-02-24.
  4. Liane M. Dubowy (2012-08-24). "Audio-Tag-Editor Puddletag mit AcoustID-Support". Heise Online (in German). Retrieved 2019-02-23.
  5. AmpliFIND Music Services: News Archived 2013-09-21 at the Wayback Machine
  6. "Introducing Chromaprint – Lukáš Lalinský". Oxygene.sk. 2010-07-24. Retrieved 2018-04-10.
  7. "Five years of AcoustID - Lukáš Lalinský". oxygene.sk.
  8. Jang, Dalwon; Yoo, Chang D; Lee, Sunil; Kim, Sungwoong; Kalker, Ton (2011-01-18). "How does Chromaprint work? – Lukáš Lalinský". IEEE Transactions on Information Forensics and Security. 4 (4): 995–1004. doi:10.1109/TIFS.2009.2034452. S2CID   1502596 . Retrieved 2018-04-10.
  9. "Fingerprinting - MusicBrainz Wiki".

Further reading