Original author(s) | Alan Emtage |
---|---|
Developer(s) | Bunyip Information Systems, Inc. |
Initial release | 10 September 1990 [1] |
Final release | 3.5 / 1996 |
Written in | C |
Operating system | Solaris, AIX |
Type | Web search engine |
Website | bunyip.com/products/archie/ (original product page, archived) archie |
Internet history timeline |
Early research and development:
Merging the networks and creating the Internet:
Commercialization, privatization, broader access leads to the modern Internet: Contents
Examples of Internet services:
|
Archie is a tool for indexing FTP archives, allowing users to more easily identify specific files. It is considered the first Internet search engine. [2] The original implementation was written in 1990 by Alan Emtage, then a postgraduate student at McGill University in Montreal, Canada. [3] [4] [5] [6] Archie was superseded by other, more sophisticated search engines, including Jughead and Veronica, which were search engines for the Gopher protocol. These were in turn superseded by search engines like Yahoo! in 1995 and Google in 1998. Work on Archie ceased in the late 1990s. A legacy Archie server was maintained for historic purposes in Poland at Interdisciplinary Centre for Mathematical and Computational Modelling in the University of Warsaw until 2023.
With assistance from the University of Warsaw, a new Archie server was created and opened for public access at The Serial Port, a web-based computer museum, on 11 May 2024. [7] [8]
Archie first appeared in 1986, while Emtage was the systems manager at the McGill University School of Computer Science. His predecessor had attempted to persuade the institution to connect to the Internet, but due to the expensive cost — roughly $35,000 per year for a sluggish link to Boston — it had been challenging to persuade the appropriate parties that the investment was worthwhile. [9]
The name derives from the word "archive" without the 'v'. Emtage has said that contrary to popular belief, there was no association with the Archie Comics. [10] Despite this, other early Internet search technologies such as Jughead and Veronica were named after characters from the comics. Anarchie, one of the earliest graphical FTP clients, was named for its ability to perform Archie searches.
The earliest versions of Archie would simply search a list of public anonymous File Transfer Protocol (FTP) sites using the Telnet protocol and create index files available via FTP. To view the contents of a file, it had first to be downloaded. The indexes are updated on a regular basis (contacting each roughly once a month, so as not to waste too many resources of the remote servers) by requesting a listing. These listings were stored in local files to be searched using the Unix grep command.
The developers populated the engine's servers with databases of anonymous FTP host directories. [11] This was used to find specific file titles since the list was plugged in to a searchable database of FTP sites. [12] Archie did not recognize natural language requests nor index the content inside the files. Therefore, users had to know the title of the file they wanted. The ability to index the content inside the files was later introduced by Gopher.
Emtage and Heelan wrote a script allowing people to log in and search collected information using the Telnet protocol at the host "archie.mcgill.ca" [132.206.2.3]. [13] Later, more efficient front- and back-ends were developed, and the system spread from a local tool to a network-wide resource and a popular service available from multiple sites around the Internet. The collected data would be exchanged between the neighbouring Archie servers. The servers could be accessed in multiple ways: using a local client (such as archie or xarchie); telnetting to a server directly; sending queries by electronic mail; [14] and later via a World Wide Web interface. At the peak of its popularity, the Archie search engine accounted for 50% of Montreal Internet traffic. [15]
In 1992, Emtage, along with J. Peter Deutsch and some financial help from McGill University, formed Bunyip Information Systems with a licensed commercial version of the Archie search engine used by millions of people worldwide. Heelan followed them into Bunyip soon after, where he together with Bibi Ali and Sandro Mazzucato significantly updated the Archie database and indexed web pages. Work on the search engine ceased in the late 1990s, and the company dissolved in 2003. [16]
The Gopher protocol is a communication protocol designed for distributing, searching, and retrieving documents in Internet Protocol networks. The design of the Gopher protocol and user interface is menu-driven, and presented an alternative to the World Wide Web in its early stages, but ultimately fell into disfavor, yielding to Hypertext Transfer Protocol (HTTP). The Gopher ecosystem is often regarded as the effective predecessor of the World Wide Web.
Wide Area Information Server (WAIS) is a client–server text searching system that uses the ANSI Standard Z39.50 Information Retrieval Service Definition and Protocol Specifications for Library Applications" (Z39.50:1988) to search index databases on remote computers. It was developed in 1990 as a project of Thinking Machines, Apple Computer, Dow Jones, and KPMG Peat Marwick.
Cello is an early, discontinued graphical web browser for Windows 3.1; it was developed by Thomas R. Bruce of the Legal Information Institute at Cornell Law School. It was released as shareware in 1993. While other browsers ran on various Unix machines, Cello was the first web browser for Microsoft Windows, using the winsock system to access the Internet. In addition to the basic Windows, Cello worked on Windows NT 3.5 and with small modifications on OS/2.
The File Transfer Protocol (FTP) is a standard communication protocol used for the transfer of computer files from a server to a client on a computer network. FTP is built on a client–server model architecture using separate control and data connections between the client and the server. FTP users may authenticate themselves with a plain-text sign-in protocol, normally in the form of a username and password, but can connect anonymously if the server is configured to allow it. For secure transmission that protects the username and password, and encrypts the content, FTP is often secured with SSL/TLS (FTPS) or replaced with SSH File Transfer Protocol (SFTP).
An application layer is an abstraction layer that specifies the shared communication protocols and interface methods used by hosts in a communications network. An application layer abstraction is specified in both the Internet Protocol Suite (TCP/IP) and the OSI model. Although both models use the same term for their respective highest-level layer, the detailed definitions and purposes are different.
WebDAV is a set of extensions to the Hypertext Transfer Protocol (HTTP), which allows user agents to collaboratively author contents directly in an HTTP web server by providing facilities for concurrency control and namespace operations, thus allowing Web to be viewed as a writeable, collaborative medium and not just a read-only medium. WebDAV is defined in RFC 4918 by a working group of the Internet Engineering Task Force (IETF).
Veronica was a search engine system for the Gopher protocol, released in November 1992 by Steven Foster and Fred Barrie at the University of Nevada, Reno.
This page provides an index of articles thought to be Internet or Web related topics.
Jughead is a search engine system for the Gopher protocol. It is distinct from Veronica in that it searches a single server at a time.
Alan Emtage is a Bajan-Canadian computer scientist who conceived and implemented the first version of Archie, a pre-Web Internet search engine for locating material in public FTP archives. It is widely considered the world's first Internet search engine.
MacX is an obsolete display server implementation supporting the X11 display server protocol, that ran on System 7, Mac OS 8, and Mac OS 9. It also ran under A/UX. Prior to X11R4 and the introduction of the PowerPC-based Power Macintosh, this server was developed internally by Apple Inc. for the Motorola-68000-based Macintoshes. MacX was initially developed within the Networking and Communications organization as one component of the Apple DEC Alliance suite of products, but later was moved to Apple's A/UX group since X11 was an important part of UNIX user interfaces. Versions supporting X11R4 and X11R5 were developed for Apple by a small team of engineers at AGE Logic, Inc., a San Diego, California company. AGE also OEMed the MacX software under the trade name XoftWare for Macintosh. Apple provided early versions of the Power Macintosh to AGE Logic, and the result was a binary that supported both the Power Macintosh as well as earlier, 68000-based systems.
Mark Perry McCahill is an American computer scientist and Internet pioneer. He has developed and popularized a number of Internet technologies since the late 1980s, including the Gopher protocol, Uniform Resource Locators (URLs), and POPmail.
Rick Gates is an Internet pioneer mostly known for organizing The Internet Hunt and developing the concept of Interpedia. He studied at the Graduate Library School at the University of Arizona.
A search engine is a software system that provides hyperlinks to web pages and other relevant information on the Web in response to a user's query. The user inputs a query within a web browser or a mobile app, and the search results are often a list of hyperlinks, accompanied by textual summaries and images. Users also have the option of limiting the search to a specific type of results, such as images, videos, or news.
The School of Computer Science is an academic department in the Faculty of Science at McGill University in Montreal, Quebec, Canada. The School is the second most funded computer science department in Canada. As of 2024, it has 46 faculty members, 60 Ph.D. students and 100 Master's students.
A proxy list is a list of open HTTP/HTTPS/SOCKS proxy servers all on one website. Proxies allow users to make indirect network connections to other computer network services. Proxy lists include the IP addresses of computers hosting open proxy servers, meaning that these proxy servers are available to anyone on the internet. Proxy lists are often organized by the various proxy protocols the servers use. Many proxy lists index Web proxies, which can be used without changing browser settings.
The Internet Hunt was a monthly online game and search training tool, conceived and conducted by Rick Gates, as Director of Library Automation UC Santa Barbara, which began 31 August 1992, before the World Wide Web.
Emissary was a popular early commercial internet suite from Attachmate for Windows. It featured a web browser, FTP support, e-mail program, a newsreader program, and an HTML editor.
Agora was a World Wide Web email browser that served as a proof of concept to help people use the full internet. Agora was an email-based web browser designed for non-graphic terminals and to help people without full access to the internet such as in developing countries or without a permanent internet connection. Similar to W3Gate, Agora was a server application designed to fetch HTML documents through e-mail rather than http.
This page provides a full timeline of web search engines, starting from the WHOis in 1982, the Archie search engine in 1990, and subsequent developments in the field. It is complementary to the history of web search engines page that provides more qualitative detail on the history.