Archive Team

Last updated
Archive Team logo Archive Team logo.png
Archive Team logo

Archive Team is a group dedicated to digital preservation and web archiving that was co-founded by Jason Scott in 2009. [1] [2]

Contents

Its primary focus is the copying and preservation of content housed by at-risk online services. Some of its projects include the partial preservation of GeoCities, [3] [4] Yahoo! Video, Google Video, Splinder, Friendster, FortuneCity, [5] [6] [7] [8] [9] [10] [11] [12] [ excessive citations ] TwitPic, [13] SoundCloud, [14] and the "Aaron Swartz Memorial JSTOR Liberator". [15] Archive Team also archives URL shortener services [16] and wikis [17] on a regular basis.

According to Jason Scott, "Archive Team was started out of anger and a feeling of powerlessness, this feeling that we were letting companies decide for us what was going to survive and what was going to die." [18] Scott continues, "it's not our job to figure out what's valuable, to figure out what's meaningful. We work by three virtues: rage, paranoia and kleptomania." [19]

Warrior/Tracker system

Archive Team is composed of a loose community of independent contributors/users.[ citation needed ] Their archival process makes use of a "Warrior", a virtual machine environment. Individuals use the Warrior in their desktop environments use to download content without requiring technical expertise. Tasks are allocated by a centrally-managed Tracker that networks with and allocates items to Warriors. The tracker also monitors user upload activity and displays a leader board. [20]

Projects

There are several projects currently running:

As of 9 June 2023, the largest project on ArchiveTeam is Reddit, with over 2.82 petabytes archived. [33]

See also

Related Research Articles

<span class="mw-page-title-main">PHP</span> Scripting language created in 1994

PHP is a general-purpose scripting language geared towards web development. It was originally created by Danish-Canadian programmer Rasmus Lerdorf in 1993 and released in 1995. The PHP reference implementation is now produced by the PHP Group. PHP was originally an abbreviation of Personal Home Page, but it now stands for the recursive initialism PHP: Hypertext Preprocessor.

<span class="mw-page-title-main">FreeDOS</span> Open source clone of MS-DOS

FreeDOS is a free software operating system for IBM PC compatible computers. It intends to provide a complete MS-DOS-compatible environment for running legacy software and supporting embedded systems.

robots.txt is the filename used for implementing the Robots Exclusion Protocol, a standard used by websites to indicate to visiting web crawlers and other web robots which portions of the website they are allowed to visit.

<span class="mw-page-title-main">Wikisource</span> Free online library on a wiki

Wikisource is an online digital library of free-content textual sources on a wiki, operated by the Wikimedia Foundation. Wikisource is the name of the project as a whole and the name for each instance of that project ; multiple Wikisources make up the overall project of Wikisource. The project's aim is to host all forms of free text, in many languages, and translations. Originally conceived as an archive to store useful or important historical texts, it has expanded to become a general-content library. The project officially began on November 24, 2003, under the name Project Sourceberg, a play on the famous Project Gutenberg. The name Wikisource was adopted later that year and it received its own domain name.

This article outlines the general features commonly found in various Internet forum software packages. It highlights major features that the manager of a forum might want and should expect to be commonly available in different forum software. These comparisons do not include remotely hosted services which use their own proprietary software, rather than offering a package for download which webmasters can host by themselves.

<span class="mw-page-title-main">Markdown</span> Plain text markup language

Markdown is a lightweight markup language for creating formatted text using a plain-text editor. John Gruber created Markdown in 2004 as a markup language that is easy to read in its source code form. Markdown is widely used for blogging and instant messaging, and also used elsewhere in online forums, collaborative software, documentation pages, and readme files.

<span class="mw-page-title-main">Progressive enhancement</span> Web design strategy putting emphasis web content first

Progressive enhancement is a strategy in web design that puts emphasis on web content first, allowing everyone to access the basic content and functionality of a web page, whilst users with additional browser features or faster Internet access receive the enhanced version instead. This strategy speeds up loading and facilitates crawling by web search engines, as text on a page is loaded immediately through the HTML source code rather than having to wait for JavaScript to initiate and load the content subsequently, meaning content ready for consumption "out of the box" is served immediately, and not behind additional layers.

Notable issue tracking systems, including bug tracking systems, help desk and service desk issue tracking systems, as well as asset management systems, include the following. The comparison includes client-server application, distributed and hosted systems.

<span class="mw-page-title-main">Wikiversity</span> Wikimedia wiki for learning materials that anyone can edit

Wikiversity is a Wikimedia Foundation project that supports learning communities, their learning materials, and resulting activities. It differs from Wikipedia in that it offers tutorials and other materials for the fostering of learning, rather than an encyclopedia. It is available in many languages.

<span class="mw-page-title-main">Imgur</span> American online image hosting service

Imgur is an American online image sharing and image hosting service with a focus on social gossip that was founded by Alan Schaaf in 2009. The service has hosted viral images and memes, particularly those posted on Reddit.

<span class="mw-page-title-main">Curse LLC</span> Network of gaming websites

Curse was a gaming company that managed the video game mod host CurseForge, wiki host Gamepedia, and the Curse Network of gaming community websites.

References

  1. Scott, Jason (January 6, 2009). "Team Archive is GO". ASCII by Jason Scott. Archived from the original on 2016-11-02. Retrieved December 30, 2016.
  2. "Revision history of "Main Page"". Archive Team. Archived from the original on 2016-12-31. Retrieved December 30, 2016.
  3. Gilbertson, Scott (2010-11-01). "Geocities Lives On as Massive Torrent Download". Wired. Archived from the original on 2012-04-25.
  4. Modine, Austin (2009-04-28). "Web 0.2 archivists save Geocities from deletion". The Register. Archived from the original on 2012-05-03.
  5. Sullivan, Mark (2012-04-13). "The 'Archive Team' Rescues User Content From Doomed Sites". PC World. Archived from the original on 2012-04-20.
  6. Schwartz, Matt (January 2012). "Fire in the Library". Technology Review. Archived from the original on 2012-01-24.
  7. Garfield, Bob; Scott, Jason (2012-03-23). "The Archive Team". OnTheMedia. Archived from the original on 2012-04-27. Retrieved 2012-04-19.
  8. Masnick, Mike (2012-04-12). "Historic Archive Of Websites From The January 18th SOPA Blackout". Techdirt. Archived from the original on 2012-04-15.
  9. Scott, Jason (2012-03-06). "Click: The Archive Team - Jason Scott talks about his mission to salvage our digital heritage". BBC. Archived from the original on 2015-04-03.
  10. Morton, Simon; Scott, Jason (2012-03-03). "The Archive Team". RadioNZ. Archived from the original on 2012-04-21.
  11. Misener, Dan (2011-04-29). "Full Interview: Jason Scott on online video and digital heritage". CBC. Archived from the original on 2012-10-26.
  12. Paul-Choudhury, Sumit (May 6, 2011). "Amateur heroes of online heritage". New Scientist. Archived from the original on April 2, 2015. Retrieved March 9, 2015.
  13. "TwitPic - Archiveteam". Archived from the original on 2014-09-09. Retrieved 2014-09-17.
  14. "Archive Team promises to back up SoundCloud amid worries of a shutdown". 2017-07-18. Archived from the original on 2018-10-21. Retrieved 2018-11-28.
  15. "Aaron Swartz Memorial JSTOR Liberator sets public domain academic articles free". 2013-01-15. Archived from the original on 2018-03-23. Retrieved 2018-11-28.
  16. "url shortening was a fucking awful idea". URLTE.AM. Archived from the original on 2011-06-11.
  17. WikiTeam Archived 2016-02-10 at the Wayback Machine - We archive wikis, from Wikipedia to tiniest wikis
  18. "Open Source Bridge 2012 Keynote - Jason Scott". YouTube . Archived from the original on 2017-09-14. Retrieved 2018-11-28.
  19. "Open Source Bridge 2012 Keynote - Jason Scott". YouTube . Archived from the original on 2017-09-14. Retrieved 2018-11-28.
  20. Ogden, Jessica (October 21, 2021). ""Everything on the internet can be saved": Archive Team, Tumblr and the cultural significance of web archiving". Internet Histories. 6 (1–2): 113–132. doi: 10.1080/24701475.2021.1985835 . hdl: 1983/daef55ca-1fb1-4d91-a820-244bf24fe2b7 . S2CID   239510759.
  21. "Imgur Terms of Service Update". Imgur Help. Archived from the original on 31 May 2023. Retrieved 9 June 2023.
  22. "Blogger - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  23. KeyserSosa (2023-04-18). "An Update Regarding Reddit's API". r/reddit. Archived from the original on 2023-06-08. Retrieved 2023-06-09.
  24. ".ua - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-03-23. Retrieved 2023-06-09.
  25. "Telegram - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-05-29. Retrieved 2023-06-09.
  26. "GitHub - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-05-27. Retrieved 2023-06-09.
  27. "MediaFire - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  28. "Coronavirus - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-06-09. Retrieved 2023-06-09.
  29. "YouTube - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  30. "WikiTeam - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  31. "URLTeam - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  32. "URLs - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  33. "Reddit tracker Dashboard". tracker.archiveteam.org. Archived from the original on 2023-05-23. Retrieved 2023-06-09.