Type of business | Private |
---|---|
Type of site | Search engine |
Available in | English |
Founded | 2006 |
Headquarters | Redwood City, California |
Key people | Jaideep Singh, Co-founder/CEO Jay Bhatti, Co-founder/VP product Hongche Liu, Chief Information Architect |
URL | www.spock.com |
Registration | optional |
Launched | 2006 |
Current status | active |
Spock is a vertical search engine or entity search engine on people in the US. The name "Spock" is explained with a backronym: "single point of contact (by) keyword." [1] Founded in 2006 by Jay Bhatti and Jaideep Singh, it has "indexed over 250 million people representing over 1.5 billion data records." [2] These records are from publicly available sources, including Wikipedia, IMDb, ESPN, LinkedIn, Hi5, MySpace, Friendster, Facebook, YouTube, Flickr, Twitter, corporate biographies, university faculty and staff pages, real estate agents sites, school alumni and member directory pages, etc. The company maintains that "30% of all Internet searches are people-related". [3]
As entity resolution is the main algorithmic hurdle of their indexing endeavour, Spock has issued and awarded the Spock Challenge Prize. The winning entry combines various machine learning algorithms. [4]
Spock opened its service to public beta on August 8, 2007. [5]
Google Search, is a search engine provided by Google. Handling over 3.5 billion searches per day, it has a 92% share of the global search engine market. It is also the most-visited website in the world.
In computer science, a search algorithm is an algorithm which solves a search problem. Search algorithms work to retrieve information stored within some data structure, or calculated in the search space of a problem domain, either with discrete or continuous values.
A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web, typically operated by search engines for the purpose of Web indexing.
Sergey Mikhaylovich Brin is an American business magnate, computer scientist and Internet entrepreneur. Together with Larry Page, he co-founded Google. Brin was the president of Google's parent company, Alphabet Inc., until stepping down from the role on December 3, 2019. He and Page remain at Alphabet as co-founders, controlling shareholders, board members, and employees. As of November 2021, Brin is the 6th-richest person in the world, with an estimated net worth of $127.3 billion.
CiteSeerx is a public search engine and digital library for scientific and academic papers, primarily in the fields of computer and information science. CiteSeer is considered as a predecessor of academic search tools such as Google Scholar and Microsoft Academic Search. CiteSeer-like engines and archives usually only harvest documents from publicly available websites and do not crawl publisher websites. For this reason, authors whose documents are freely available are more likely to be represented in the index.
Search engine optimization (SEO) is the process of improving the quality and quantity of website traffic to a website or a web page from search engines. SEO targets unpaid traffic rather than direct traffic or paid traffic. Unpaid traffic may originate from different kinds of searches, including image search, video search, academic search, news search, and industry-specific vertical search engines.
Record linkage is the task of finding records in a data set that refer to the same entity across different data sources. Record linkage is necessary when joining different data sets based on entities that may or may not share a common identifier, which may be due to differences in record shape, storage location, or curator style or preference. A data set that has undergone RL-oriented reconciliation may be referred to as being cross-linked. Record linkage is referred to as data linkage in many jurisdictions, but the two are the same process.
Google Desktop was a computer program with desktop search capabilities, created by Google for Linux, Apple Mac OS X, and Microsoft Windows systems. It allowed text searches of a user's email messages, computer files, music, photos, chats, Web pages viewed, and the ability to display "Google Gadgets" on the user's desktop in a Sidebar.
Google Scholar is a freely accessible web search engine that indexes the full text or metadata of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes most peer-reviewed online academic journals and books, conference papers, theses and dissertations, preprints, abstracts, technical reports, and other scholarly literature, including court opinions and patents. Google Scholar uses a web crawler, or web robot, to identify files for inclusion in the search results. For content to be indexed in Google Scholar, it must meet certain specified criteria. An earlier statistical estimate published in PLOS ONE using a Mark and recapture method estimated approximately 80–90% coverage of all articles published in English with an estimate of 100 million. This estimate also determined how many documents were freely available on the internet.
A search engine is a software system that is designed to carry out web searches. They search the World Wide Web in a systematic way for particular information specified in a textual web search query. The search results are generally presented in a line of results, often referred to as search engine results pages (SERPs) The information may be a mix of links to web pages, images, videos, infographics, articles, research papers, and other types of files. Some search engines also mine data available in databases or open directories. Unlike web directories, which are maintained only by human editors, search engines also maintain real-time information by running an algorithm on a web crawler. Internet content that is not capable of being searched by a web search engine is generally described as the deep web.
Microsoft Bing is a web search engine owned and operated by Microsoft. The service has its origins in Microsoft's previous search engines: MSN Search, Windows Live Search and later Live Search. Bing provides a variety of search services, including web, video, image and map search products. It is developed using ASP.NET.
Google Cloud Bigtable is a compressed, high-performance, proprietary data storage system built on Google File System, Chubby Lock Service, SSTable and a few other Google technologies. On May 6, 2015, a public version of Bigtable was made available as a service. Bigtable also underlies Google Cloud Datastore, which is available as a part of the Google Cloud Platform.
A search engine is an information retrieval software program that discovers, crawls, transforms and stores information for retrieval and presentation in response to user queries.
Search engine indexing is the collecting, parsing, and storing of data to facilitate fast and accurate information retrieval. Index design incorporates interdisciplinary concepts from linguistics, cognitive psychology, mathematics, informatics, and computer science. An alternate name for the process, in the context of search engines designed to find web pages on the Internet, is web indexing. Or just indexing.
Intelius, Inc. is a public records business headquartered in Seattle, Washington, United States. It provides information services, including people and property search, background checks and reverse phone lookup. Users also have the ability to perform reverse address lookups to find people using Intelius’ services and an address. Intelius, founded by former InfoSpace executives, was started in 2003.
Enterprise search is the practice of making content from multiple enterprise-type sources, such as databases and intranets, searchable to a defined audience.
GenieKnows Inc. was a privately owned vertical search engine company based in Halifax, Nova Scotia was started by Rami Hamodah who also started SwiftlyLabs.com and Salesboom.com. Like many internet search engines, its revenue model centers on an online advertising platform and B2B transactions. It focuses on a set of niche search markets, or verticals, including health search, video games search, and local business directory search.
Munax was a Swedish company that developed a Large Hyper-Parallel Execution (LHPE) search engine system Munax XE. Munax XE, is an all-content search engine and powered nationwide and worldwide public search engines with page, document, audio, video, images, software, and email search. Other customers included vertical search engines and mobile operators.
Yebol was a vertical "decision" search engine that had developed a knowledge-based, semantic search platform. Based in San Jose, California, Yebol's artificial intelligence human intelligence-infused algorithms automatically cluster and categorize search results, web sites, pages and contents that it presents in a visually indexed format that is more aligned with initial human intent. Yebol used association, ranking and clustering algorithms to analyze related keywords or web pages. Yebol presented as one of its goals the creation of a unique "homepage look" for every possible search term.
Volunia was a web search engine created by Massimo Marchiori. It was launched in beta only for registered power users on February 6, 2012 and went live on June 14, 2012. Volunia, dubbed as "the search engine of the future", was speculated to be based on Hyper Search technology. On June 8, 2012 Marchiori announced with an open letter that he had been excluded from his project: six days later, on June 14, 2012 the site went live, but it ceased to operate in February 2014.