DocumentCloud

Last updated
DocumentCloud
Company type Nonprofit
Industry
  • Journalism
  • Publishing
  • Software
Founded2009
Headquarters,
United States
Number of employees
5 [2]
Parent Investigative Reporters and Editors
Website documentcloud.org

DocumentCloud is an open-source software as a service platform that allows users to upload, analyze, annotate, collaborate on and publish primary source documents. Since its launch in 2009, it has been used primarily by journalists to find information in the documents they gather in the course of their reporting and, in the interests of transparency, publish the documents. As of May 2023, DocumentCloud users had uploaded more than 5 million documents. [3] Many of them are accessible via a public search portal.

Contents

DocumentCloud's development has led to the creation of several notable open-source projects, including Backbone.js, [4] [5] Jammit and Underscore.js. [6] [7] The majority of funding for DocumentCloud has come from grants by the Knight Foundation.

History

In 2009, journalists Scott Klein and Eric Umansky of ProPublica and Aron Pilhofer of The New York Times received a Knight News Challenge grant for initial development of the platform. [8] [9] [10] [11] This first version of the DocumentCloud was built by the New York Times Digital team and included Elliott Malkin and Sascha Mombartz working on design and development by Jeremy Ashkenas, Ben Koski and Jake Harris. [12] Jeremy Ashkenas joined as lead developer, and DocumentCloud was incorporated as a nonprofit organization. By September 2009, two dozen media outlets including The Washington Post , The New York Times and the Chicago Tribune had signed on as beta testers. [13]

A public beta was announced [14] at the 2010 NICAR conference of Investigative Reporters and Editors (IRE), and within a year contributing news organizations had uploaded 1 million pages. [15]

In 2011, DocumentCloud received a second Knight News Challenge grant, [16] dissolved its own nonprofit entity, and merged with the nonprofit Investigative Reporters and Editors. [17] [18] Since then, IRE has assumed primary responsibility for maintenance and development of the platform as well as managing its grant funding.

DocumentCloud received a third Knight grant in summer 2014, with primary goals including improved platform stability, new features, and developing a plan for financial sustainability. [19] Since its start, DocumentCloud accounts have been free to journalism organizations, but the organization has announced it will be implementing a pay model. [20]

On June 11, 2018, DocumentCloud and MuckRock announced they would be merging. [21]

Open-source projects

In addition to the platform itself, development of DocumentCloud has led to the creation of several open-source projects:

Related Research Articles

<span class="mw-page-title-main">Jennifer 8. Lee</span> Chinese-American businessperson and former journalist

Jennifer 8. Lee is an American journalist who previously worked for The New York Times. She is the co-founder and president of the literary studio Plympton and a producer of The Search for General Tso, which premiered at the 2014 Tribeca Film Festival.

The John S. and James L. Knight Foundation, also known as the Knight Foundation, is an American non-profit foundation that provides grants for journalism, communities, and the arts.

<span class="mw-page-title-main">Digital journalism</span> Editorial content published via the Internet

Digital journalism, also known as netizen journalism or online journalism, is a contemporary form of journalism where editorial content is distributed via the Internet, as opposed to publishing via print or broadcast. What constitutes digital journalism is debated by scholars; however, the primary product of journalism, which is news and features on current affairs, is presented solely or in combination as text, audio, video, or some interactive forms like storytelling stories or newsgames, and disseminated through digital media technology.

This is a comparison of web frameworks for front-end web development that are heavily reliant on JavaScript code for their behavior.

Google App Engine is a cloud computing platform as a service for developing and hosting web applications in Google-managed data centers. Applications are sandboxed and run across multiple servers. App Engine offers automatic scaling for web applications—as the number of requests increases for an application, App Engine automatically allocates more resources for the web application to handle the additional demand.

<span class="mw-page-title-main">Wikimedia Foundation</span> American charitable organization

The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered as a charitable foundation under local laws. It is best known as the host platform for Wikipedia, the largest crowdsourced online encyclopedia and the 7th most visited website in the world, but also hosts other related projects and MediaWiki, a wiki software.

Gita Pullapilly is a Hollywood film and television director, screenwriter, producer, and author. She writes and directs with her husband and film partner, Aron Gaudet under their banner, "Team A + G, Inc."

CoffeeScript is a programming language that compiles to JavaScript. It adds syntactic sugar inspired by Ruby, Python, and Haskell in an effort to enhance JavaScript's brevity and readability. Specific additional features include list comprehension and destructuring assignment.

OnlyOffice, stylized as ONLYOFFICE, is a free software office suite and ecosystem of collaborative applications. It features online editors for text documents, spreadsheets, presentations, forms and PDFs, and the room-based collaborative platform.

<span class="mw-page-title-main">Sarah Cohen (journalist)</span> American journalist and professor

Sarah Cohen is an American journalist, author, and professor. Cohen is a proponent of, and teaches classes on, computational journalism and authored the book "Numbers in the Newsroom: Using math and statistics in the news."

Backbone.js is a JavaScript rich-client web app framework based on the model–view–controller design paradigm, intended to connect to an API over a RESTful JSON interface. Backbone has only hard dependency, which is on one JavaScript library, Underscore.js,. jQuery can also be optionally used for the library. It is designed for developing single-page web applications, and for keeping various parts of web applications synchronized. Backbone was created by Jeremy Ashkenas, who is also known for CoffeeScript and Underscore.js.

Underscore.js is a JavaScript library which provides utility functions for common programming tasks. It is comparable to features provided by Prototype.js and the Ruby language, but opts for a functional programming design instead of extending object prototypes. The documentation refers to Underscore.js as "the tie to go along with jQuery's tux, and Backbone.js' suspenders." Underscore.js was created by Jeremy Ashkenas, who is also known for Backbone.js and CoffeeScript.

LocalWiki is a collaborative project that aims to collect and open the world's local knowledge. The LocalWiki project was founded by DavisWiki creators Mike Ivanov and Philip Neustrom and is a 501(c)(3) nonprofit organization based in San Francisco, California. LocalWiki is both the name of the project and the software that runs the project's websites.

MuckRock is a United States-based 501(c)(3) non-profit organization which assists anyone in filing governmental requests for information through the Freedom of Information Act (FOIA) and other public record laws around the United States, then publishes the returned information on its website and encourages journalism around it.

<span class="mw-page-title-main">Jeremy Ashkenas</span> American computer programmer

Jeremy Ashkenas is a computer programmer known for the creation and co-creation of the CoffeeScript and LiveScript programming languages respectively, the Backbone.js JavaScript framework and the Underscore.js JavaScript library. While working in the graphics department at The New York Times, he shared the 2015 Gerald Loeb Award for Images/Graphics/Interactives. After working at the Times, he was an employee of Observable, Inc. As of 2020, he works at Substack Inc. Jeremy returned to The New York Times in June 2022 as Director of Graphics for Opinion.

Google Cloud Platform (GCP), offered by Google, is a suite of cloud computing services that provides a series of modular cloud services including computing, data storage, data analytics, and machine learning, alongside a set of management tools. It runs on the same infrastructure that Google uses internally for its end-user products, such as Google Search, Gmail, and Google Docs, according to Verma, et.al. Registration requires a credit card or bank account details.

<span class="mw-page-title-main">Node-RED</span> Programming tool for network-aware devices

Node-RED is a flow-based, low-code development tool for visual programming developed originally by IBM for wiring together hardware devices, APIs and online services as part of the Internet of Things.

Microsoft, a technology company historically known for its opposition to the open source software paradigm, turned to embrace the approach in the 2010s. From the 1970s through 2000s under CEOs Bill Gates and Steve Ballmer, Microsoft viewed the community creation and sharing of communal code, later to be known as free and open source software, as a threat to its business, and both executives spoke negatively against it. In the 2010s, as the industry turned towards cloud, embedded, and mobile computing—technologies powered by open source advances—CEO Satya Nadella led Microsoft towards open source adoption although Microsoft's traditional Windows business continued to grow throughout this period generating revenues of 26.8 billion in the third quarter of 2018, while Microsoft's Azure cloud revenues nearly doubled.

Lisa Song is an American journalist and author. She won the 2013 Pulitzer Prize for National Reporting, with David Hasemyer and Elizabeth McGowan, for their report on the Kalamazoo River oil spill. She works for ProPublica, reporting on the environment, energy and climate change.

References

  1. "Who We Are". DocumentCloud. Retrieved 31 July 2018.
  2. "Who We Are". DocumentCloud. Retrieved 12 October 2015.
  3. "DocumentCloud". www.documentcloud.org. Retrieved 2023-05-15.
  4. "Backbone.js" . Retrieved 16 October 2015.
  5. Ashkenas, Jeremy (13 October 2010). "Code Drop: Backbone.js". DocumentCloud Blog. Archived from the original on 28 October 2015. Retrieved 15 October 2015.
  6. "Underscore.js" . Retrieved 16 October 2015.
  7. Ashkenas, Jeremy (28 October 2009). "Underscore.js: Our Second Open-Source Release". DocumentCloud Blog. Archived from the original on 28 October 2015. Retrieved 15 October 2015.
  8. "Knight News Challenge: DocumentCloud". Knight Foundation. Retrieved 15 October 2015.
  9. Seward, Zachary M. (17 June 2009). "Knight News Challenge: A grant to DocumentCloud promises a data boost for investigative journalism". Nieman Lab. Retrieved 16 October 2015.
  10. Kirkpatrick, Marshall (17 June 2009). "DocumentCloud Gets Funding to Create Research Memory Bank in the Sky". ReadWrite. Archived from the original on 12 January 2012. Retrieved 16 October 2015.
  11. Reagan, Gillian (18 June 2009). "Times, ProPublica Journos Get $719,500 for DocumentCloud". Observer. Retrieved 16 October 2015.
  12. McLean, Alan (27 March 2010). "A New View: Introducing Doc Viewer 2.0". New York Times. Retrieved 15 November 2023.
  13. Seward, Zachary M. (24 September 2009). "DocumentCloud adds impressive list of investigative-journalism outfits". Nieman Lab. Retrieved 16 October 2015.
  14. Townend, Judith (6 January 2010). "DocumentCloud aims to release a public beta in March 2010". Journalism.co.uk. Retrieved 16 October 2015.
  15. Ashkenas, Jeremy (28 February 2011). "A Million Pages". DocumentCloud blog. Archived from the original on 28 October 2015. Retrieved 16 October 2015.
  16. "Knight News Challenge: DocumentCloud Reader Annotations". Knight Foundation. 23 June 2011. Retrieved 16 October 2015.
  17. Sonderman, Jeff (9 June 2011). "IRE takes over DocumentCloud as Knight funding expires". Poynter. Archived from the original on 17 October 2014. Retrieved 15 October 2015.
  18. Bracken, John (9 June 2011). "News Challenge Success Story Finds a Home". Knight Foundation. Archived from the original on 23 October 2015. Retrieved 16 October 2015.
  19. "DocumentCloud, an annotation tool for journalists, to improve features and become a standard in newsrooms with $1.4 million from Knight Foundation". Knight Foundation. 27 June 2014. Archived from the original on 23 September 2015. Retrieved 16 October 2015.
  20. DeBarros, Anthony (10 August 2015). "A Summer Day's Worth of Updates". DocumentCloud Blog. Archived from the original on 28 October 2015. Retrieved 16 October 2015.
  21. Morisy, Michael; Pilhofer, Aron (June 11, 2018). "MuckRock and DocumentCloud merge to build tools for a more informed society". Muckrock.org. MuckRock. Retrieved June 11, 2018. We are thrilled to announce that DocumentCloud and MuckRock are merging.

Sources