List of CBIR engines

Last updated

This is a list of publicly available content-based image retrieval (CBIR) engines. These image search engines look at the content (pixels) of images in order to return results that match a particular query.

Commercial CBIR search engines

NameDescriptionExternal Image QueryMetadata QueryIndex Size (Estimate, Millions of Images)Organization TypeLicense (Open/Closed)
Pixolution CBIR search engine, by pixolutionNoNo32MPrivate CompanyClosed
Picalike CBIR engine for Mobile and eCommerceNoNo (additional filters can be added)Private CompanyClosed
Elastic Vision Smart image searcher with content-based clustering in a visual network.NoNoPrivate CompanyClosed
Google Image Search Google's CBIR system with reverse image search YesYesPublic Company Google Custom Search API access
Yandex Image Search Yandex CBIR systemYesYes10000MPublic CompanyClosed
Baidu Image Search Baidu's CBIR systemYesYes1000MPublic CompanyClosed
ID My Pill Automatic prescription pill identification (CBIR)YesNoPrivate CompanyOpen (via API)
Imense Image Search Portal CBIR search engine, by Imense.NoYes3MPrivate CompanyClosed
Imprezzeo Image Search CBIR search engine, by Imprezzeo.NoYesPrivate CompanyClosed
Incogna Image Search CBIR search engine, by Incogna Inc.NoYes100MPrivate CompanyClosed
Chic Engine Visual fashion search engine (CBIR)YesNoPrivate CompanyClosed
MiPai similarity search engine Online similarity search engineYesYes100MIndividualClosed
Piximilar Demo engine, developed by Idee Inc.NoNo3MPrivate CompanyClosed
Empora Product comparison & shopping using CBIR for product images. Previously known as PixstaNoYes0.5MPrivate CompanyClosed
Shopachu Shopping & fashion CBIR engine, by Incogna Inc.NoYes1MPrivate CompanyClosed
TinEye CBIR site for finding variations of web images, by Idee Inc.YesNo24200MPrivate CompanyClosed
PicScout CBIR service tracks image usage across the web.YesYes270MPrivate Company (Getty Images)Open (via API)
Galaxy CBIR engine for finding product/catalogue/video frames, by Odd Concepts Inc.YesYes35MPrivate CompanyClosed
eBay Image Search Image Search for eBay FashionNoYes20MPublic CompanyClosed
LykDat LykDat Fashion Search EngineYesNo13MPrivate CompanyClosed
IMMENSELAB CBIR search engine by KBKGROUP.YesNo10MPrivate CompanyClosed
Macroglossa Visual Search CBIR visual search engineYesNoPrivate CompanyClosed
NoClone PC image search engine and classification based on contentYes (a set)NoPrivate CompanyClosed
Querbie General purpose CBIR visual search engineYesYes20MPrivate CompanyClosed
Infringement.Report CBIR visual search engineYesYesPrivate CompanyOpen (via API)

CBIR research projects/demos/open source projects

NameDescriptionExternal Image QueryMetadata QueryIndex Size (Estimate, Millions of Images)Organization TypeLicense (Open/Closed)
akiwi akiwi is a semi-automatic image keywording tool using CBIR techniques. It was developed by HTW Berlin / pixolution GmbHYesYes15MUniversityClosed
ALIPR Developed by Penn State University researchersYesYesUniversityClosed
Anaktisi This Web-Solution implements a new family of CBIR descriptors. These descriptors combine in one histogram color and texture information and are suitable for accurately retrieving images.YesNo0.225MUniversityOpen
BRISC BRISC is a recursive acronym for BRISC Really IS Cool, and is (conveniently enough) also an anagram of Content-Based Image Retrieval System.YesNoUniversityGPL
digiKam Extensive photo management application build on top of KDE libraries. It provides, besides many other features, reverse searches for images in the local collection, detection of duplicates and a fuzzy search by drawings.YesYesDesktop-basedKDE GPL
Caliph & Emir Creation and Retrieval of images based on MPEG-7.YesNoDesktop-basedUniversity GPL
FIRE Open source query by visual example CBIR system. Developed at RWTH Aachen University. FIRE is a research system developed with extensibility in mind and can easily be combined with textual information retrieval systems.NoNoUniversityOpen
GNU Image Finding Tool Query by example image search system.YesNoDesktop-basedGNU GPL
ISSBP Similar Image Search by Imense plugin for Adobe Bridge, free beta.YesYesfree-beta limited to 4k imagesPrivate CompanyClosed
img(Rummager) Image retrieval Engine (Freeware Application).YesNoDesktop-basedIndividualClosed
imgSeek photo collection manager and viewer with content-based search and many other features.YesNoIndividual GPL
IKONA Generic CBIR system - INRIA - IMEDIAYesYesUniversityClosed
IOSB Image retrieval demonstration software of Fraunhofer IOSB (Germany)YesNoDesktop-basedResearch InstituteClosed
LIRE Java GPL library for content based image retrieval based on Lucene including multiple low level global and local features and different indexing strategies including bag of visual words and hashing.YesYesUniversityGPL
Lucignolo Image similarity search engine using only the native full-text search engine Lucene.YesYes106MResearch InstituteClosed
Luigi Large Histopathological Image Retrieval System developed at University of TokyoYesNo0.3MUniversityClosed
MIFile Image similarity search engine based on MI File (Metric Inverted File) developed at ISTI-CNR. Source code of the MI File.NoNo100MResearch InstituteOpen
MUVIS CBIR System at TUT- Tampere University of Technology.YesNoDesktop-basedUniversityClosed
Pastec C++ LGPL index and search engine for near-duplicate image retrieval that uses bag of visual words with ORB features.YesYesPrivate companyLGPL
PIBE An adaptive image browsing system that provides users with an intuitive, easy-to-use, structured view of an image collection and complements it with ideas from the field of adaptable content-based similarity search. A hierarchical view of images (the Browsing Tree) that can be customized according to user preferences is provided.YesNoUniversityClosed
PicsLikeThat Image search using visual similarity search and sorting combined with a recommender system. (Cooperation of pixolution GmbH, fotolia and HTW Berlin)NoNo12MUniversityClosed
PIRIA CBIR tool developed at CEA-LIST, LVIC (Vision and Content Engineering Laboratory).YesYes1000 MUniversityClosed
Pixcavator Similar image search based on topological image analysisYesNoDesktop-basedPrivate companyClosed
QuickLook Visual information retrieval system with relevance feedbackNoYesUniversityClosed
RETIN Interactive images retrieval system - CNRS - ETIS Lab., MIDI TeamNoNoUniversityClosed
Retrievr Search and explore in a selection of Flickr images by drawing a rough sketch or uploading an image.NoNoUniversityClosed
SHIATSU A novel system for automatic video tagging which is based on shot boundaries detection and hierarchical annotation processes. The tagging phase assigns semantic concepts to both shot sequences and whole videos, by exploiting visual features extracted from key frames.YesYesUniversityClosed
SIMBA demo of system by the Albert-Ludwigs-Universitet Freiburg (Germany) Inst. for Pattern Recognition and Image ProcessingYesNo0.002MUniversityClosed
TagProp The demonstration of image annotation tool TagProp in ICCV2009 for image set: Corel 5k ESP Game IAPR TC-12 and MIR Flickr.NoYesInstituteClosed
VIRaL Visual Image Retrieval and Localization: A visual search engine that, given a query image, retrieves photos depicting the same object or scene under varying viewpoint or lighting conditions. Using Flickr photos of urban scenes, it automatically estimates where a picture is taken, suggests tags, identifies known landmarks or points of interest, and links to relevant Wikipedia articles. It currently supports 39 cities around the world.YesYes2.221MUniversityClosed
Windsurf A general framework for efficiently processing content-based image queries with particular emphasis to the region-based paradigm; it provides an environment where different alternatives of the paradigm can be implemented, allowing such implementations to be compared on a fair basis, from the points of view of both effectiveness and efficiency.YesNoUniversityOpen but not free

Related Research Articles

Scalable Vector Graphics (SVG) is an XML-based vector image format for defining two-dimensional graphics, having support for interactivity and animation. The SVG specification is an open standard developed by the World Wide Web Consortium since 1999.

Search engine optimization (SEO) is the process of improving the quality and quantity of website traffic to a website or a web page from search engines. SEO targets unpaid traffic rather than direct traffic or paid traffic. Unpaid traffic may originate from different kinds of searches, including image search, video search, academic search, news search, and industry-specific vertical search engines.

In the context of the World Wide Web, deep linking is the use of a hyperlink that links to a specific, generally searchable or indexed, piece of web content on a website, rather than the website's home page. The URL contains all the information needed to point to a particular item. Deep linking is different from mobile deep linking, which refers to directly linking to in-app content using a non-HTTP URI.

An image retrieval system is a computer system used for browsing, searching and retrieving images from a large database of digital images. Most traditional and common methods of image retrieval utilize some method of adding metadata such as captioning, keywords, title or descriptions to the images so that retrieval can be performed over the annotation words. Manual image annotation is time-consuming, laborious and expensive; to address this, there has been a large amount of research done on automatic image annotation. Additionally, the increase in social web applications and the semantic web have inspired the development of several web-based image annotation tools.

Google AdSense is a program run by Google through which website publishers in the Google Network of content sites serve text, images, video, or interactive media advertisements that are targeted to the site content and audience. These advertisements are administered, sorted, and maintained by Google. They can generate revenue on either a per-click or per-impression basis. Google beta-tested a cost-per-action service, but discontinued it in October 2008 in favor of a DoubleClick offering. In Q1 2014, Google earned US$3.4 billion, or 22% of total revenue, through Google AdSense. AdSense is a participant in the AdChoices program, so AdSense ads typically include the triangle-shaped AdChoices icon. This program also operates on HTTP cookies. In 2021, over 38.3 million websites use AdSense.

<span class="mw-page-title-main">Content-based image retrieval</span> Method of image retrieval

Content-based image retrieval, also known as query by image content (QBIC) and content-based visual information retrieval (CBVIR), is the application of computer vision techniques to the image retrieval problem, that is, the problem of searching for digital images in large databases. Content-based image retrieval is opposed to traditional concept-based approaches.

<span class="mw-page-title-main">Anchor text</span> Visible, clickable text in a hyperlink

The anchor text, link label or link text is the visible, clickable text in an HTML hyperlink. The term "anchor" was used in older versions of the HTML specification for what is currently referred to as the a element, or <a>. The HTML specification does not have a specific term for anchor text, but refers to it as "text that the a element wraps around". In XML terms, the anchor text is the content of the element, provided that the content is text.

Keyword stuffing is a search engine optimization (SEO) technique, considered webspam or spamdexing, in which keywords are loaded into a web page's meta tags, visible content, or backlink anchor text in an attempt to gain an unfair rank advantage in search engines. Keyword stuffing may lead to a website being temporarily or permanently banned or penalized on major search engines. The repetition of words in meta tags may explain why many search engines no longer use these tags. Nowadays, search engines focus more on the content that is unique, comprehensive, relevant, and helpful that overall makes the quality better which makes keyword stuffing useless, but it is still practiced by many webmasters.

SafeSearch is a feature in Google Search and Google Images that acts as an automated filter of pornography and potentially offensive and inappropriate content.

The alt attribute is the HTML attribute used in HTML and XHTML documents to specify alternative text that is to be displayed in place of an element that cannot be rendered. The alt attribute is used for short descriptions, with longer descriptions using the longdesc attribute. The standards organization for the World Wide Web, the World Wide Web Consortium (W3C), recommends that every image displayed through HTML have an alt attribute, though the alt attribute does not need to contain text. The lack of proper alt attributes on website images has led to several accessibility-related lawsuits.

Sitemaps is a protocol in XML format meant for a webmaster to inform search engines about URLs on a website that are available for web crawling. It allows webmasters to include additional information about each URL: when it was last updated, how often it changes, and how important it is in relation to other URLs of the site. This allows search engines to crawl the site more efficiently and to find URLs that may be isolated from the rest of the site's content. The Sitemaps protocol is a URL inclusion protocol and complements robots.txt, a URL exclusion protocol.

<span class="mw-page-title-main">Google Base</span> Defunct Google database

Google Base was a database provided by Google into which any user can add almost any type of content, such as text, images, and structured information in formats such as XML, PDF, Excel, RTF, or WordPerfect. As of September 2010, the product has been downgraded to Google Merchant Center.

A scraper site is a website that copies content from other websites using web scraping. The content is then mirrored with the goal of creating revenue, usually through advertising and sometimes by selling user data. Scraper sites come in various forms. Some provide little, if any material or information, and are intended to obtain user information such as e-mail addresses, to be targeted for spam e-mail. Price aggregation and shopping sites access multiple listings of a product and allow a user to rapidly compare the prices.

Search Engine Results Pages (SERP) are the pages displayed by search engines in response to a query by a user. The main component of the SERP is the listing of results that are returned by the search engine in response to a keyword query.

<span class="mw-page-title-main">Google Images</span> Image search engine by Google Inc.

Google Images is a search engine owned by Google that allows users to search the World Wide Web for images. It was introduced on July 12, 2001, due to a demand for pictures of the green Versace dress of Jennifer Lopez worn in February 2000. In 2011, reverse image search functionality was added.

Multimedia search enables information search using queries in multiple data types including text and other multimedia formats. Multimedia search can be implemented through multimodal search interfaces, i.e., interfaces that allow to submit search queries not only as textual requests, but also through other media. We can distinguish two methodologies in multimedia search:

In the field of search engine optimization (SEO), link building describes actions aimed at increasing the number and quality of inbound links to a webpage with the goal of increasing the search engine rankings of that page or website. Briefly, link building is the process of establishing relevant hyperlinks to a website from external sites. Link building can increase the number of high-quality links pointing to a website, in turn increasing the likelihood of the website ranking highly in search engine results. Link building is also a proven marketing tactic for increasing brand awareness.

<i>Perfect 10, Inc. v. Amazon.com, Inc.</i> 2007 American legal decision

Perfect 10, Inc. v. Amazon.com, Inc., 508 F.3d 1146 was a case in the United States Court of Appeals for the Ninth Circuit involving a copyright infringement claim against Amazon.com, Inc. and Google, Inc., by the magazine publisher Perfect 10, Inc. The court held that framing and hyperlinking of original images for use in an image search engine constituted a fair use of Perfect 10's images because the use was highly transformative, and thus not an infringement of the magazine's copyright ownership of the original images.

<span class="mw-page-title-main">Reverse image search</span> Content-based image retrieval

Reverse image search is a content-based image retrieval (CBIR) query technique that involves providing the CBIR system with a sample image that it will then base its search upon; in terms of information retrieval, the sample image is very useful. In particular, reverse image search is characterized by a lack of search terms. This effectively removes the need for a user to guess at keywords or terms that may or may not return a correct result. Reverse image search also allows users to discover content that is related to a specific sample image or the popularity of an image, and to discover manipulated versions and derivative works.

<span class="mw-page-title-main">OCRFeeder</span>

OCRFeeder is an optical character recognition suite for GNOME, which also supports virtually any command-line OCR engine, such as CuneiForm, GOCR, Ocrad and Tesseract. It converts paper documents to digital document files and can serve to make them accessible to visually impaired users.