Abstract Wikipedia

Last updated

Abstract Wikipedia
Developer(s) Wikimedia Foundation
Website meta.wikimedia.org/wiki/Abstract_Wikipedia   OOjs UI icon edit-ltr-progressive.svg

Abstract Wikipedia is an in-development project of the Wikimedia Foundation. It aims to use Wikifunctions to create a language-independent version of Wikipedia using its structured data. [1] First conceived in 2020 (with a precursor proposal in 2013), Abstract Wikipedia has been under active development ever since, with the related project of Wikifunctions launched successfully in 2023. Nevertheless, the project has proved controversial. As envisioned, Abstract Wikipedia would consist of "Constructors" (templates for abstract statements), "Content" (the abstract statements themselves), and "Renderers" (which would automatically translate abstract statements into natural language).

Contents

History

Conception (2013–2020)

On 7 August 2013, Denny Vrandečić, the co-founder of Wikidata, suggested "an extension of the template system" where template calls would expand into content based on the language of the user. [2] For example, a template call such as {{F12:Q64|Q5519|Q183}} could be variously expanded by Template:F12/en into "Berlin is the capital of Germany.", and by Template:F12/de into "Berlin ist die Hauptstadt Deutschlands." [2] This has been viewed as a predecessor of Abstract Wikipedia proper. [3]

Vrandečić proposed it again in a Google working paper in April 2020, [4] formally proposed in May 2020 (as Wikilambda). It was approved by the Wikimedia Foundation Board of Trustees in July 2020 as Abstract Wikipedia. [5] [6] [7]

Development (2020–present)

In April 2021, Vrandečić published an overview of the system in the computer science journal Communications of the ACM . [8]

The Abstract Wikipedia team at a 2022 offsite in Switzerland Google.org fellowship offsite dinner.jpg
The Abstract Wikipedia team at a 2022 offsite in Switzerland

In January 2023, The Signpost reported on the slow progress of the Abstract Wikipedia project. [9] According to an evaluation by four Google Fellows working on the project, it was at a "substantial risk of failure" due to its poor technical plan. [9] The Google Fellows recommended that Abstract Wikipedia be decoupled from Wikifunctions, that Wikifunctions refine MediaWiki's support for programming in Lua rather than having a completely new language, and that Abstract Wikipedia converge on a unified approach to natural language generation (NLG) that builds on open source software if possible. [9]

The Wikimedia Foundation staff responded to this report by completely rejecting the idea that Abstract Wikipedia and Wikifunctions could be separated, and accusing the Google Fellows of making "fallacies and false comparisons". [9] The Wikimedia Foundation also stated that using existing NLG pipelines like Grammatical Framework could not support certain languages such as the Niger–Congo B languages, and would also "replicate the trends of an imperialist English-focused Western-thinking industry.". [9]

On 26 July 2023, Wikifunctions officially launched to the general public. [10]

Design

Technical components

A diagram of the Abstract Wikipedia technical plan Components of Multilingual Wikipedia.png
A diagram of the Abstract Wikipedia technical plan

The Abstract Wikipedia project would consist of three main components: [11]

  1. Constructors, which enable abstract statements. The Abstract Wikipedia team prefers that these are hosted in Wikifunctions.
  2. Content, which consists of abstract calls to Constructors, with values for each slot. These are preferably hosted in Wikidata.
  3. Renderers (one per language), which convert the abstract Content into text in that particular language. These are, like Constructors, also preferably hosted in Wikifunctions.

Each version of Wikipedia, once Abstract Wikipedia is deployed, could choose between three options: [12]

  1. Implicit integration with Abstract Wikipedia. There would be a special page called Special:Abstract that would display content automatically translated from Abstract Wikipedia into the local language. This content would be linkable and searchable. Furthermore, a new magic word LINK_TO_Q would be added in order to enable linking to Abstract Wikipedia content.
  2. Explicit integration with Abstract Wikipedia. In this scenario, to create a new article, the editor would add a sitelink on Wikidata to a not-yet-existing page. This would create a "virtual article" in mainspace that would be pre-populated with content from Abstract Wikipedia automatically translated into the local language. This "virtual article" would have a URL similar to that of a real article, and would also be linkable and searchable just like a real article.
  3. No integration with Abstract Wikipedia.

Example

As a preliminary example, content from Abstract Wikipedia could look like: [13]

Article(   content: [     Instantiation(       instance: San Francisco (Q62),       class: Object_with_modifier_and_of(         object: center,         modifier: And_modifier(           conjuncts: [cultural, commercial, financial]         ),         of: Northern California (Q1066807)       )     ),     Ranking(       subject: San Francisco (Q62),       rank: 4,       object:  City  (Q515),       by:  Population size  (Q1613416),       local_constraint: California (Q99),       after: [ Los Angeles  ( Q65 ),  San Diego  ( Q16552 ),  San Jose  ( Q16553 )]     )   ] )

This would translate into English as "San Francisco is the cultural, commercial, and financial center of Northern California. It is the fourth-most populous city in California, after Los Angeles, San Diego and San Jose."

Related Research Articles

<span class="mw-page-title-main">History of Wikipedia</span>

Wikipedia, a free-content online encyclopedia written and maintained by a community of volunteers, began with its first edit on 15 January 2001, two days after the domain was registered. It grew out of Nupedia, a more structured free encyclopedia, as a way to allow easier and faster drafting of articles and translations.

<span class="mw-page-title-main">Wiktionary</span> Multilingual online dictionary

Wiktionary is a multilingual, web-based project to create a free content dictionary of terms in all natural languages and in a number of artificial languages. These entries may contain definitions, images for illustration, pronunciations, etymologies, inflections, usage examples, quotations, related terms, and translations of terms into other languages, among other features. It is collaboratively edited via a wiki. Its name is a portmanteau of the words wiki and dictionary. It is available in 192 languages and in Simple English. Like its sister project Wikipedia, Wiktionary is run by the Wikimedia Foundation, and is written collaboratively by volunteers, dubbed "Wiktionarians". Its wiki software, MediaWiki, allows almost anyone with access to the website to create and edit entries.

<span class="mw-page-title-main">MediaWiki</span> Free and open-source wiki software

MediaWiki is free and open-source wiki software originally developed by Magnus Manske for use on Wikipedia on January 25, 2002, and further improved by Lee Daniel Crocker, after which it has been coordinated by the Wikimedia Foundation. It powers most websites hosted by the Foundation including Wikipedia, Wiktionary, Wikimedia Commons, Wikiquote, Meta-Wiki and Wikidata, which define a large part of the set requirements for the software.

<span class="mw-page-title-main">English Wikipedia</span> English-language edition of Wikipedia

The English Wikipedia is the primary English-language edition of Wikipedia, an online encyclopedia. It was created by Jimmy Wales and Larry Sanger on January 15, 2001, as Wikipedia's first edition.

<span class="mw-page-title-main">Wikibooks</span> Free resource library of books hosted by the Wikimedia Foundation and edited by volunteers

Wikibooks is a wiki-based Wikimedia project hosted by the Wikimedia Foundation for the creation of free content digital textbooks and annotated texts that anyone can edit.

<span class="mw-page-title-main">Wikisource</span> Free online library on a wiki

Wikisource is an online digital library of free-content textual sources on a wiki, operated by the Wikimedia Foundation. Wikisource is the name of the project as a whole and the name for each instance of that project ; multiple Wikisources make up the overall project of Wikisource. The project's aim is to host all forms of free text, in many languages, and translations. Originally conceived as an archive to store useful or important historical texts, it has expanded to become a general-content library. The project officially began on November 24, 2003, under the name Project Sourceberg, a play on the famous Project Gutenberg. The name Wikisource was adopted later that year and it received its own domain name.

<span class="mw-page-title-main">Norwegian Wikipedia</span> Two of the Norwegian-language editions of Wikipedia

There are two Norwegian language editions of Wikipedia: one for articles written in Bokmål or Riksmål, and one for articles written in Nynorsk or Høgnorsk. There are currently 619,713 articles on the Norwegian Wikipedia edition in Bokmål/Riksmål, and 168,096 articles on the Nynorsk edition.

<span class="mw-page-title-main">Wikimedia movement</span> Group of global contributors to Wikimedia projects

The Wikimedia movement is the global community of contributors to the Wikimedia projects, including Wikipedia. This community directly builds and administers these projects with the commitment of achieving this using open standards and software.

<span class="mw-page-title-main">Semantic MediaWiki</span> Software for creating, managing and sharing structured data in MediaWiki

Semantic MediaWiki (SMW) is an extension to MediaWiki that allows for annotating semantic data within wiki pages, thus turning a wiki that incorporates the extension into a semantic wiki. Data that has been encoded can be used in semantic searches, used for aggregation of pages, displayed in formats like maps, calendars and graphs, and exported to the outside world via formats like RDF and CSV.

<span class="mw-page-title-main">Bengali Wikipedia</span> Edition of the free-content encyclopedia in Bengali language

The Bengali Wikipedia or Bangla Wikipedia is the Bengali language edition of Wikipedia, the free online encyclopedia. Launched on 27 January 2004, it surpassed 10,000 articles in October 2006, becoming the second South-Asian language to do so. On 25 December 2020, the site achieved the milestone of 100k articles. As of 11 December 2023, the Bengali Wikipedia has 144,429 articles. Though it joined later compared to top wikipedias, it ranks 5th in terms of article depth among 318 active wikipedias by language.

<span class="mw-page-title-main">Wikivoyage</span> Free travel guide that anyone can edit

Wikivoyage is a free web-based travel guide for travel destinations and travel topics written by volunteer authors. It is a sister project of Wikipedia and supported and hosted by the same non-profit Wikimedia Foundation (WMF). Wikivoyage has been called the "Wikipedia of travel guides".

<span class="mw-page-title-main">Tamil Wikipedia</span> Tamil language edition of Wikipedia

The Tamil Wikipedia is the Tamil language edition of Wikipedia established in September 2003, run by the Wikimedia Foundation. The Tamil Wikipedia is the second largest Wikipedia among Indian languages and the 60th largest Wikipedia by article count(As of 11 December 2023). It is also the first and only Wikipedia of Dravidian origin to possess more than 150,000+ articles. The project is one of the leading Wikipedia among other South Asian language Wikipedia's in various quality matrices. It has 160,884 articles and 227,151 registered users as of December 2023. It crossed 100,000 articles in May 2017.

<span class="mw-page-title-main">Wikimedia Foundation</span> American charitable organization

The Wikimedia Foundation, Inc. (WMF) is an American 501(c)(3) nonprofit organization headquartered in San Francisco, California, and registered as a charitable foundation under local laws. It is best known as the host platform for Wikipedia, the largest crowdsourced online encyclopedia in the world, but also hosts other related projects and MediaWiki, a wiki software.

<span class="mw-page-title-main">Wikitravel</span> Collaborative wiki travel website

Wikitravel is a web-based collaborative travel guide based on the wiki format and owned by Internet Brands. It was most active from 2003 through 2012, when most of its editing community left and brought their contributions to the nonprofit Wikivoyage guide.

<span class="mw-page-title-main">Sanskrit Wikipedia</span> Sanskrit edition of Wikipedia

Sanskrit Wikipedia is the Sanskrit edition of Wikipedia, a free, web-based, collaborative, multilingual encyclopedia project supported by the non-profit Wikimedia Foundation. Its five thousand articles have been written collaboratively by volunteers around the world, with major concentration of contributors in India and Nepal.

<span class="mw-page-title-main">Odia Wikipedia</span>

The Odia Wikipedia is the Odia edition of Wikipedia. It is a free, web-based, collaborative encyclopedia project supported by the non-profit Wikimedia Foundation. The project was started by Suneet Samaetha in June 2002 and reached 1,000 articles in May 2011. This is one of the first four Indic Wikipedias started in 2002, among over 20 Indic language Wikipedias. The first edit on Odia Wikipedia occurred on 3 June 2002.

<span class="mw-page-title-main">Wikidata</span> Free knowledge database project

Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, can use under the CC0 public domain license. Wikidata is a wiki powered by the software MediaWiki, including its extension for semi-structured data, the Wikibase.

<span class="mw-page-title-main">Cebuano Wikipedia</span> Cebuano-language edition of Wikipedia

The Cebuano Wikipedia is the Cebuano-language edition of Wikipedia, the free online encyclopedia. Despite being the second-largest Wikipedia in numbers of articles, it has a small community of only 161 active users; nearly all of the 6,122,276 articles were initially created through automatic programs, most notably Sverker Johansson's Lsjbot.

<span class="mw-page-title-main">Wikifunctions</span> Wikimedia open library of reusable code

Wikifunctions is a collaboratively edited catalog of computer functions to enable the creation, modification, and reuse of source code. It is closely related to Abstract Wikipedia, an extension of Wikidata to create a language-independent version of Wikipedia using its structured data. Provisionally named Wikilambda, the definitive name of Wikifunctions was announced on 22 December 2020 following a naming contest. Wikifunctions is the first Wikimedia project to launch since Wikidata in 2012. After three years of development, Wikifunctions officially launched in July 2023.

<span class="mw-page-title-main">Denny Vrandečić</span> Croatian computer scientist

Zdenko "Denny" Vrandečić is a Croatian computer scientist. He was a co-developer of Semantic MediaWiki and Wikidata, the lead developer of the Wikifunctions project, and an employee of the Wikimedia Foundation as a Head of Special Projects, Structured Content. He published modules for the German role-playing game The Dark Eye.

References

  1. Hill, Paul (13 April 2020). "Wikidata founder floats idea for balanced multilingual Wikipedia". Neowin . Retrieved 2 July 2020.
  2. 1 2 Vrandečić, Denny (7 August 2013). "A proposal towards a multilingual Wikipedia - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  3. "Abstract Wikipedia/Historic proposal - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  4. Vrandečić, Denny (8 April 2020). "Architecture for a multilingual Wikipedia". arXiv: 2004.04733 [cs.CY].
  5. Maher, Katherine. "Abstract Wikipedia/June 2020 announcement - Meta". meta.wikimedia.org.
  6. ""Abstract Wikipedia": Neues Projekt soll Wissen in alle Sprachen übersetzen". RedaktionsNetzwerk Deutschland (in German). 6 July 2020. Retrieved 6 July 2020.
  7. Rixecker, Kim (6 July 2020). "Abstract Wikipedia: Wie das Online-Lexikon eines seiner größten Probleme lösen will". t3n Magazine (in German). Retrieved 6 July 2020.
  8. Vrandečić, Denny (April 2021). "Building a Multilingual Wikipedia". Communications of the ACM . 64 (4): 38–41. doi:10.1145/3425778. ISSN   0001-0782. Wikidata   Q106143058.
  9. 1 2 3 4 5 Bayer, Tilman (1 January 2023). "Wikimedia Foundation's Abstract Wikipedia project "at substantial risk of failure"". The Signpost . Retrieved 14 August 2023.
  10. Vrandečic, Denny (26 July 2023). "Abstract Wikipedia/Updates/2023-07-26 - Meta". meta.wikimedia.org. Retrieved 22 September 2023.
  11. "Abstract Wikipedia/Architecture - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  12. "Abstract Wikipedia/Components - Meta". meta.wikimedia.org. Retrieved 14 August 2023.
  13. "Abstract Wikipedia/Examples - Meta". meta.wikimedia.org. Retrieved 14 August 2023.