Archive Team

Last updated
Archive Team logo Archive Team logo.png
Archive Team logo

Archive Team is a group dedicated to digital preservation and web archiving that was co-founded by Jason Scott in 2009. [1] [2]

Contents

Its primary focus is the copying and preservation of content housed by at-risk online services. Some of its projects include the partial preservation of GeoCities, [3] [4] Yahoo! Video, Google Video, Splinder, Friendster, FortuneCity, [a] TwitPic, [5] SoundCloud, [6] Veoh, and the "Aaron Swartz Memorial JSTOR Liberator". [7] Archive Team also archives URL shortener services [8] and wikis [9] on a regular basis.

According to Jason Scott, "Archive Team was started out of anger and a feeling of powerlessness, this feeling that we were letting companies decide for us what was going to survive and what was going to die." [10] Scott continues, "it's not our job to figure out what's valuable, to figure out what's meaningful. We work by three virtues: rage, paranoia, and kleptomania." [11]

Warrior/Tracker system

Archive Team is composed of a loose community of independent contributors/users.[ citation needed ] Their archival process makes use of a "Warrior", a virtual machine environment. Individuals use the Warrior in their desktop environments to download content without requiring technical expertise. Tasks are allocated by a centrally-managed Tracker that networks with and allocates items to Warriors. The tracker also monitors user upload activity and displays a leader board. [12]

Projects

There are several projects currently running:

As of 18 June 2024, the largest project on ArchiveTeam is Reddit, with over 3 petabytes archived. [25] [b]

See also

Notes

  1. Sources covering Archive Team projects: [c] [d] [e] [f] [g] [h] [i] [j]
  2. The tracker uses units with binary prefixes e.g. pebibytes (1024 GB) instead of petabytes (1000 GB)

Related Research Articles

<span class="mw-page-title-main">PHP</span> Scripting language created in 1994

PHP is a general-purpose scripting language geared towards web development. It was originally created by Danish-Canadian programmer Rasmus Lerdorf in 1993 and released in 1995. The PHP reference implementation is now produced by the PHP Group. PHP was originally an abbreviation of Personal Home Page, but it now stands for the recursive acronym PHP: Hypertext Preprocessor.

robots.txt Internet protocol

robots.txt is the filename used for implementing the Robots Exclusion Protocol, a standard used by websites to indicate to visiting web crawlers and other web robots which portions of the website they are allowed to visit.

<i>The Battle for Wesnoth</i> Free and open source turn-based strategy video game

The Battle for Wesnoth is a free and open-source turn-based strategy video game with a high fantasy setting, designed by Australian-American developer David White and first released in June 2003. In Wesnoth, the player controls a particular faction/race and attempts to build a powerful army by controlling villages and defeating enemies for experience. The game is loosely based on the Sega Genesis games Master of Monsters and Warsong.

This article outlines the general features commonly found in various Internet forum software packages. It highlights major features that the manager of a forum might want and should expect to be commonly available in different forum software. These comparisons do not include remotely hosted services which use their own proprietary software, rather than offering a package for download which webmasters can host by themselves.

<span class="mw-page-title-main">Image sharing</span> Publishing or transfer of photos online

Image sharing, or photo sharing, is the publishing or transfer of digital photos online. Image sharing websites offer services such as uploading, hosting, managing and sharing of photos. This function is provided through both websites and applications that facilitate the upload and display of images. The term can also be loosely applied to the use of online photo galleries that are set up and managed by individual users, including photoblogs. Sharing means that other users can view but not necessarily download images, and users can select different copyright options for their images.

<span class="mw-page-title-main">Markdown</span> Plain text markup language

Markdown is a lightweight markup language for creating formatted text using a plain-text editor. John Gruber created Markdown in 2004 as an easy-to-read markup language. Markdown is widely used for blogging and instant messaging, and also used elsewhere in online forums, collaborative software, documentation pages, and readme files.

<span class="mw-page-title-main">Progressive enhancement</span> Web design strategy putting emphasis web content first

Progressive enhancement is a strategy in web design that puts emphasis on web content first, allowing everyone to access the basic content and functionality of a web page, whilst users with additional browser features or faster Internet access receive the enhanced version instead. This strategy speeds up loading and facilitates crawling by web search engines, as text on a page is loaded immediately through the HTML source code rather than having to wait for JavaScript to initiate and load the content subsequently, meaning content ready for consumption "out of the box" is served immediately, and not behind additional layers.

Notable issue tracking systems, including bug tracking systems, help desk and service desk issue tracking systems, as well as asset management systems, include the following. The comparison includes client-server application, distributed and hosted systems.

<span class="mw-page-title-main">Vuze</span> BitTorrent client

Vuze is a BitTorrent client used to transfer files via the BitTorrent protocol. Vuze is written in Java, and uses the Azureus Engine. In addition to downloading data linked to .torrent files, Azureus allows users to view, publish and share original DVD and HD quality video content. Content is presented through channels and categories containing TV shows, music videos, movies, video games, series and others.

<span class="mw-page-title-main">TuneIn</span> American audio streaming service

TuneIn is a global audio streaming service providing news, radio, sports, music, and podcasts to over 75 million monthly active users.

<span class="mw-page-title-main">Deletionpedia</span> Website collecting deleted Wikipedia articles

Deletionpedia was an online archive wiki containing articles deleted from the English Wikipedia. Its version of each article included a header with more information about the deletion such as whether a speedy deletion occurred, where the deletion discussion about the article can be found and which editor deleted the article. The original Deletionpedia operated from February to September 2008. The site was restarted under new management in December 2013.

<span class="mw-page-title-main">Imgur</span> American online image hosting service

Imgur is an American online image sharing and image hosting service with a focus on social gossip that was founded by Alan Schaaf in 2009. The service has hosted viral images and memes, particularly those posted on Reddit.

<span class="mw-page-title-main">Curse LLC</span> Network of gaming websites

Curse was a gaming company that managed the video game mod host CurseForge, wiki host Gamepedia, and the Curse Network of gaming community websites.

References

  1. Scott, Jason (January 6, 2009). "Team Archive is GO". ASCII by Jason Scott. Archived from the original on 2016-11-02. Retrieved December 30, 2016.
  2. "Revision history of "Main Page"". Archive Team. Archived from the original on 2016-12-31. Retrieved December 30, 2016.
  3. Gilbertson, Scott (2010-11-01). "Geocities Lives On as Massive Torrent Download". Wired. Archived from the original on 2012-04-25.
  4. Modine, Austin (2009-04-28). "Web 0.2 archivists save Geocities from deletion". The Register. Archived from the original on 2012-05-03.
  5. "TwitPic - Archiveteam". Archived from the original on 2014-09-09. Retrieved 2014-09-17.
  6. Deahl, Dani (2017-07-18). "Archive Team promises to back up SoundCloud amid worries of a shutdown". Archived from the original on 2018-10-21. Retrieved 2018-11-28.
  7. Farivar, Cyrus (2013-01-15). "Aaron Swartz Memorial JSTOR Liberator sets public domain academic articles free". Archived from the original on 2018-03-23. Retrieved 2018-11-28.
  8. "url shortening was a fucking awful idea". URLTE.AM. Archived from the original on 2011-06-11.
  9. WikiTeam Archived 2016-02-10 at the Wayback Machine
  10. "Open Source Bridge 2012 Keynote - Jason Scott". YouTube . 28 June 2012. Archived from the original on 2017-09-14. Retrieved 2018-11-28.
  11. "Open Source Bridge 2012 Keynote - Jason Scott". YouTube . 28 June 2012. Archived from the original on 2017-09-14. Retrieved 2018-11-28.
  12. Ogden, Jessica (October 21, 2021). ""Everything on the internet can be saved": Archive Team, Tumblr and the cultural significance of web archiving". Internet Histories. 6 (1–2): 113–132. doi: 10.1080/24701475.2021.1985835 . hdl: 1983/daef55ca-1fb1-4d91-a820-244bf24fe2b7 . S2CID   239510759.
  13. "Imgur Terms of Service Update". Imgur Help. Archived from the original on 31 May 2023. Retrieved 9 June 2023.
  14. "Blogger - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  15. Slowe, Christopher (2023-04-18). "An Update Regarding Reddit's API". reddit.com. Archived from the original on 2024-06-18. Retrieved 2023-06-09.
  16. ".ua - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-03-23. Retrieved 2023-06-09.
  17. "Telegram - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-05-29. Retrieved 2023-06-09.
  18. "GitHub - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-05-27. Retrieved 2023-06-09.
  19. "MediaFire - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  20. "Coronavirus - Archiveteam". wiki.archiveteam.org. Archived from the original on 2023-06-09. Retrieved 2023-06-09.
  21. "YouTube - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  22. "WikiTeam - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  23. "URLTeam - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  24. "URLs - Archiveteam". wiki.archiveteam.org. Retrieved 2024-01-02.
  25. "Reddit tracker Dashboard". tracker.archiveteam.org. Archived from the original on 2023-05-23. Retrieved 2023-06-09.
  26. Paul-Choudhury, Sumit (May 6, 2011). "Amateur heroes of online heritage". New Scientist. Archived from the original on April 2, 2015. Retrieved March 9, 2015.
  27. Garfield, Bob; Scott, Jason (2012-03-23). "The Archive Team". OnTheMedia. Archived from the original on 2012-04-27. Retrieved 2012-04-19.
  28. Masnick, Mike (2012-04-12). "Historic Archive Of Websites From The January 18th SOPA Blackout". Techdirt. Archived from the original on 2012-04-15.
  29. Misener, Dan (2011-04-29). "Full Interview: Jason Scott on online video and digital heritage". CBC. Archived from the original on 2012-10-26.
  30. Morton, Simon; Scott, Jason (2012-03-03). "The Archive Team". RadioNZ. Archived from the original on 2012-04-21.
  31. Schwartz, Matt (January 2012). "Fire in the Library". Technology Review. Archived from the original on 2012-01-24.
  32. Scott, Jason (2012-03-06). "Click: The Archive Team - Jason Scott talks about his mission to salvage our digital heritage". BBC. Archived from the original on 2015-04-03.
  33. Sullivan, Mark (2012-04-13). "The 'Archive Team' Rescues User Content From Doomed Sites". PC World. Archived from the original on 2012-04-20.