Phetch

Last updated

Phetch is a game with a purpose intended to label images on the internet with descriptive captions suitable to assist sight impaired readers. Approximately 75% of the images on the web do not have proper ALT text labels, making them inaccessible through screen readers.[ citation needed ] The solution aimed at by Phetch is to label the images external to the web page rather than depending upon the web page author to create proper alt text for each image. Rather than paying people to do the mundane task of labeling images, Phetch aims to create a fun game that produces such descriptions as a side effect of having fun.

Phetch was created by Luis von Ahn and Shiry Ginosar of Carnegie Mellon University following the pattern set by the earlier ESP game.

Phetch is played by three to five people. One is designated as a describer, while the rest are seekers. The describer is shown an image, which he describes to the seekers. The seekers use an Internet image search engine to attempt to find the image being described. [1] The first seeker to find the image gains points and becomes the describer for the next round. The describer is also rewarded for a successful outcome.

The data produced as the side effect of playing the game is the describer's descriptions of the image. An imagined system for serving these descriptions from a centralized server is described in the Phetch paper. [2]

The output of the game was later used to improve image search engines [3] and the game itself was later proposed as a mechanism to test interactive search interfaces. [4]

In late 2008, public access to Phetch was discontinued when the ESP game was moved to the gwap.com domain. Peekaboom was also discontinued in late 2008.

Notes

  1. Marks, Paul: "Gamers help the blind get the picture", New Scientist, Daily News, 16 May 2006.
  2. von Ahn, Luis; Ginosar, Shiry; Kedia, Mihir; Blum, Manuel: "Improving Accessibility of the Web with a Computer Game", ACM CHI Notes 2006.
  3. von Ahn, Luis; Ginosar, Shiry; Kedia, Mihir; Blum, Manuel: "Improving Image Search with Phetch", Proceedings, International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2007.
  4. Ginosar, Shiry: "Human Computation for HCIR Evaluation", Proceedings, HCIR 2007, pp. 40-42.

Related Research Articles

In the field of artificial intelligence, the most difficult problems are informally known as AI-complete or AI-hard, implying that the difficulty of these computational problems, assuming intelligence is computational, is equivalent to that of solving the central artificial intelligence problem—making computers as intelligent as people, or strong AI. To call a problem AI-complete reflects an attitude that it would not be solved by a simple specific algorithm.

<span class="mw-page-title-main">Google Search</span> Search engine from Google

Google Search is a search engine provided by Google. Handling more than 3.5 billion searches per day, it has a 92% share of the global search engine market. It is also the most-visited website in the world.

<span class="mw-page-title-main">Search engine optimization</span> Practice of increasing online visibility in search engine results pages

Search engine optimization (SEO) is the process of improving the quality and quantity of website traffic to a website or a web page from search engines. SEO targets unpaid traffic rather than direct traffic or paid traffic. Unpaid traffic may originate from different kinds of searches, including image search, video search, academic search, news search, and industry-specific vertical search engines.

A CAPTCHA is a type of challenge–response test used in computing to determine whether the user is human.

Internet Content Rating Association (ICRA) was an international non-profit organization with offices in the United States and the United Kingdom. In October 2010, the ICRA rating system, and the organization, was discontinued.

The alt attribute is the HTML attribute used in HTML and XHTML documents to specify alternative text that is to be rendered when the element to which it is applied cannot be rendered. The alt attribute is used for short descriptions, with longer descriptions using the longdesc attribute. The standards organization for the World Wide Web, the World Wide Web Consortium (W3C), recommends that every image displayed through HTML have an alt attribute for accessibility, though the alt attribute does not need to contain text. The lack of proper alt attributes on website images has led to several accessibility-related lawsuits.

<span class="mw-page-title-main">Google Reader</span> Defunct RSS/Atom feed aggregator formerly operated by Google

Google Reader was an RSS/Atom feed aggregator operated by Google. It was created in early 2005 by Google engineer Chris Wetherell and launched on October 7, 2005, through Google Labs. Google Reader grew in popularity to support a number of programs which used it as a platform for serving news and information to people. Google closed Google Reader on July 1, 2013, citing declining use.

The ESP game is a human-based computation game developed to address the problem of creating difficult metadata. The idea behind the game is to use the computational power of humans to perform a task that computers cannot by packaging the task as a game. It was originally conceived by Luis von Ahn of Carnegie Mellon University. Google bought a license to create its own version of the game in 2006 in order to return better search results for its online images. The license of the data acquired by Ahn's ESP game, or the Google version, is not clear. Google's version was shut down on September 16, 2011, as part of the Google Labs closure in September 2011.

<span class="mw-page-title-main">Search engine</span> Software system that is designed to search for information on the World Wide Web

A search engine is a software system designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a line of results, often referred to as search engine results pages (SERPs). The information may be a mix of links to web pages, images, videos, infographics, articles, research papers, and other types of files. Some search engines also mine data available in databases or open directories. Unlike web directories and social bookmarking sites, which are maintained by human editors, search engines also maintain real-time information by running an algorithm on a web crawler. Any internet-based content that can't be indexed and searched by a web search engine falls under the category of deep web.

Human-based computation (HBC), human-assisted computation, ubiquitous human computing or distributed thinking is a computer science technique in which a machine performs its function by outsourcing certain steps to humans, usually as microwork. This approach uses differences in abilities and alternative costs between humans and computer agents to achieve symbiotic human–computer interaction. For computationally difficult tasks such as image recognition, human-based computation plays a central role in training Deep Learning-based Artificial Intelligence systems. In this case, human-based computation has been referred to as human-aided artificial intelligence.

<span class="mw-page-title-main">Luis von Ahn</span> Guatemalan entrepreneur and computer scientist

Luis von Ahn is a Guatemalan entrepreneur and a consulting professor in the Computer Science Department at Carnegie Mellon University in Pittsburgh, Pennsylvania. He is known as one of the pioneers of crowdsourcing. He is the founder of the company reCAPTCHA, which was sold to Google in 2009, and the co-founder and CEO of Duolingo, the world's most popular language-learning platform.

<span class="mw-page-title-main">Google Image Labeler</span>

Google Image Labeler is a feature, in the form of a game, of Google Images that allows the user to label random images to help improve the quality of Google's image search results. It was online from 2006 to 2011 and relaunched in 2016.

Social search is a behavior of retrieving and searching on a social searching engine that mainly searches user-generated content such as news, videos and images related search queries on social media like Facebook, LinkedIn, Twitter, Instagram and Flickr. It is an enhanced version of web search that combines traditional algorithms. The idea behind social search is that instead of ranking search results purely based on semantic relevance between a query and the results, a social search system also takes into account social relationships between the results and the searcher. The social relationships could be in various forms. For example, in LinkedIn people search engine, the social relationships include social connections between searcher and each result, whether or not they are in the same industries, work for the same companies, belong the same social groups, and go the same schools, etc.

<span class="mw-page-title-main">Carrot2</span>

Carrot² is an open source search results clustering engine. It can automatically cluster small collections of documents, e.g. search results or document abstracts, into thematic categories. Carrot² is written in Java and distributed under the BSD license.

reCAPTCHA CAPTCHA implementation owned by Google

reCAPTCHA is a CAPTCHA system that enables web hosts to distinguish between human and automated access to websites. The original version asked users to decipher hard to read text or match images. Version 2 also asked users to decipher text or match images if the analysis of cookies and canvas rendering suggested the page was being downloaded automatically. Since version 3, reCAPTCHA will never interrupt users and is intended to run automatically when users load pages or click buttons. reCAPTCHA is owned by Google.

Co-training is a machine learning algorithm used when there are only small amounts of labeled data and large amounts of unlabeled data. One of its uses is in text mining for search engines. It was introduced by Avrim Blum and Tom Mitchell in 1998.

A human-based computation game or game with a purpose (GWAP) is a human-based computation technique of outsourcing steps within a computational process to humans in an entertaining way (gamification).

Page Hunt is a game developed by Bing for investigating human research behavior. It is a so-called "game with a purpose", as it pursues additional goals: not only to provide entertainment but also to harness human computation for some specific research task. The term "games with a purpose" was coined by Luis von Ahn, inventor of CAPTCHA, co-organizer of the reCAPTCHA project, and inventor of a famous ESP game.

A content farm is a company that employs large numbers of freelance writers to generate a large amount of textual web content which is specifically designed to satisfy algorithms for maximal retrieval by automated search engines, known as SEO. Their main goal is to generate advertising revenue through attracting reader page views, as first exposed in the context of social spam.