Vega and Vega-Lite visualisation grammars

Last updated
Developer(s) Jeffrey Heer, Arvind Satyanarayan, Dominik Moritz, Kanit Wongsuphasawat, and community
Initial release2 April 2013;10 years ago (2013-04-02)
Stable release
5.25.0 / 27 April 2023;11 months ago (2023-04-27) [1]
Written in JavaScript
Type Data visualization, JavaScript library
License BSD
Website vega.github.io

Vega and Vega-Lite are visualization tools implementing a grammar of graphics, similar to ggplot2. The Vega and Vega-Lite grammars extend Leland Wilkinson's Grammar of Graphics. [2] by adding a novel grammar of interactivity to assist in the exploration of complex datasets.

Vega acts as a low-level language suited to explanatory figures (the same use case as D3.js), while Vega-Lite is a higher-level language suited to rapidly exploring data. [3] Vega is used in the back end of several data visualization systems, for example Voyager. [4] [5] Chart specifications are written in JSON and rendered in a browser or exported to either vector or bitmap images. Bindings for Vega-Lite have been written for in several programming languages, for example the python package Altair [6] to make it easier to use. The grammars and associated tools are open source projects led by the University of Washington Interactive Data Lab and released under a BSD-3 license. [7]

Related Research Articles

<span class="mw-page-title-main">Visual programming language</span> Programming language written graphically by a user

In computing, a visual programming language, also known as diagrammatic programming, graphical programming or block coding, is a programming language that lets users create programs by manipulating program elements graphically rather than by specifying them textually. A VPL allows programming with visual expressions, spatial arrangements of text and graphic symbols, used either as elements of syntax or secondary notation. For example, many VPLs are based on the idea of "boxes and arrows", where boxes or other screen objects are treated as entities, connected by arrows, lines or arcs which represent relations. VPLs are generally the basis of Low-code development platforms.

<span class="mw-page-title-main">Visualization (graphics)</span> Set of techniques for creating images, diagrams, or animations to communicate a message

Visualization or visualisation is any technique for creating images, diagrams, or animations to communicate a message. Visualization through visual imagery has been an effective way to communicate both abstract and concrete ideas since the dawn of humanity. from history include cave paintings, Egyptian hieroglyphs, Greek geometry, and Leonardo da Vinci's revolutionary methods of technical drawing for engineering and scientific purposes.

<span class="mw-page-title-main">Treemapping</span> Visualisation method for hierchical data

In information visualization and computing, treemapping is a method for displaying hierarchical data using nested figures, usually rectangles.

<span class="mw-page-title-main">Heat map</span> Data visualization technique

A heat map is a 2-dimensional data visualization technique that represents the magnitude of individual values within a dataset as a color. The variation in color may be by hue or intensity.

<span class="mw-page-title-main">Call graph</span> Structure in computing

A call graph is a control-flow graph, which represents calling relationships between subroutines in a computer program. Each node represents a procedure and each edge (f, g) indicates that procedure f calls procedure g. Thus, a cycle in the graph indicates recursive procedure calls.

<span class="mw-page-title-main">Pat Hanrahan</span> American computer graphics researcher

Patrick M. Hanrahan is an American computer graphics researcher, the Canon USA Professor of Computer Science and Electrical Engineering in the Computer Graphics Laboratory at Stanford University. His research focuses on rendering algorithms, graphics processing units, as well as scientific illustration and visualization. He has received numerous awards, including the 2019 Turing Award.

<span class="mw-page-title-main">Visual analytics</span>

Visual analytics is an outgrowth of the fields of information visualization and scientific visualization that focuses on analytical reasoning facilitated by interactive visual interfaces.

<span class="mw-page-title-main">Leland Wilkinson</span> American statistician and computer scientist (1944–2021)

Leland Wilkinson was an American statistician and computer scientist at H2O.ai and Adjunct Professor of Computer Science at University of Illinois at Chicago. Wilkinson developed the SYSTAT statistical package in the early 1980s, sold it to SPSS in 1995, and worked at SPSS for 10 years recruiting and managing the visualization team. He left SPSS in 2008 and became Executive VP of SYSTAT Software Inc. in Chicago. He then served as the VP of Data Visualization at Skytree, Inc and VP of Statistics at Tableau Software before joining H2O.ai. His research focused on scientific visualization and statistical graphics. In these communities he was well known for his book The Grammar of Graphics, which was the foundation for the R package ggplot2.

Martin M. Wattenberg is an American scientist and artist known for his work with data visualization. He is currently the Gordon McKay Professor of Computer Science at the Harvard University School of Engineering and Applied Sciences.

<span class="mw-page-title-main">Voreen</span> Volume visualization library and development platform

Voreen is an open-source volume visualization library and development platform. Through the use of GPU-based volume rendering techniques it allows high frame rates on standard graphics hardware to support interactive volume exploration.

<span class="mw-page-title-main">MeVisLab</span>

MeVisLab is a cross-platform application framework for medical image processing and scientific visualization. It includes advanced algorithms for image registration, segmentation, and quantitative morphological and functional image analysis. An IDE for graphical programming and rapid user interface prototyping is available.

ggplot2 Data visualization package for R

ggplot2 is an open-source data visualization package for the statistical programming language R. Created by Hadley Wickham in 2005, ggplot2 is an implementation of Leland Wilkinson's Grammar of Graphics—a general scheme for data visualization which breaks up graphs into semantic components such as scales and layers. ggplot2 can serve as a replacement for the base graphics in R and contains a number of defaults for web and print display of common scales. Since 2005, ggplot2 has grown in use to become one of the most popular R packages.

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics (SVG), HTML5, and Cascading Style Sheets (CSS) standards. It is the successor to the earlier Protovis framework. Its development was noted in 2011, as version 2.0.0 was released in August 2011. With the release of version 4.0.0 in June 2016, D3 was changed from a single library into a collection of smaller, modular libraries that can be used independently.

<span class="mw-page-title-main">Jean-Daniel Fekete</span>

Jean-Daniel Fekete is a French computer scientist.

Michael Bostock is an American computer scientist and data visualization specialist. He is one of the co-creators of Observable and a key developer of D3.js, a JavaScript library used to produce dynamic, interactive data visualizations for web browsers. He also contributed to the preceding Protovis framework.

<span class="mw-page-title-main">Jeffrey Heer</span> American computer scientist

Jeffrey Michael Heer is an American computer scientist best known for his work on information visualization and interactive data analysis. He is a professor of computer science & engineering at the University of Washington, where he directs the UW Interactive Data Lab. He co-founded Trifacta with Joe Hellerstein and Sean Kandel in 2012.

The IEEE Visualization Conference (VIS) is an annual conference on scientific visualization, information visualization, and visual analytics administrated by the IEEE Computer Society Technical Committee on Visualization and Graphics. As ranked by Google Scholar's h-index metric in 2016, VIS is the highest rated venue for visualization research and the second-highest rated conference for computer graphics over all. It has an 'A' rating from the Australian Ranking of ICT Conferences, an 'A' rating from the Brazilian ministry of education, and an 'A' rating from the China Computer Federation (CCF). The conference is highly selective with generally < 25% acceptance rates for all papers.

<span class="mw-page-title-main">Gaia Sky</span> Open-source astronomy visualisation program

Gaia Sky is an open-source astronomy visualisation desktop and VR program with versions for Windows, Linux and macOS. It is created and developed by Toni Sagristà Sellés in the framework of ESA's Gaia mission to create a billion-star multi-dimensional map of our Milky Way Galaxy, in the Gaia group of the Astronomisches Rechen-Institut. Gaia Sky is a product of the outreach working group of the Gaia Data Processing and Analysis Consortium. The software is released under the Mozilla Public License.

Steven Mark Drucker is an American computer scientist who studies how to help people understand data, and communicate their insights to others. He is a Partner at Microsoft Research, where he also serves as the Research Manager of the VIDA group. Drucker is an affiliate professor at the University of Washington Computer Science and Engineering Department.

<span class="mw-page-title-main">UpSet Plot</span> Data visualization method

UpSet plots are a data visualization method for showing set data with more than three intersecting sets. UpSet shows intersections in a matrix, with the rows of the matrix corresponding to the sets, and the columns to the intersections between these sets. The size of the sets and of the intersections are shown as bar charts.

References

  1. "vega Releases". Github.com.
  2. Wilkinson, Leland (1999). The Grammar of Graphics. New York: Springer. ISBN   9780387987743.
  3. Satyanarayan, Arvind; Moritz, Dominik; Wongsuphasawat, Kanit; Heer, Jeffrey (2017). "Vega-Lite: A Grammar of Interactive Graphics". IEEE Transactions on Visualization and Computer Graphics. 23 (1): 341–350. doi:10.1109/TVCG.2016.2599030. PMID   27875150. S2CID   206805969.
  4. Wongsuphasawat, Kanit; Moritz, Dominik; Anand, Anushka; MacKinlay, Jock; Howe, Bill; Heer, Jeffrey (2016). "Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations". IEEE Transactions on Visualization and Computer Graphics. 22 (1): 649–658. doi:10.1109/TVCG.2015.2467191. PMID   26390469. S2CID   2366653.
  5. Wongsuphasawat, Kanit; Qu, Zening; Moritz, Dominik; Chang, Riley; Ouk, Felix; Anand, Anushka; MacKinlay, Jock; Howe, Bill; Heer, Jeffrey (2017). "Voyager 2". Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. pp. 2648–2659. doi:10.1145/3025453.3025768. ISBN   9781450346559. S2CID   14999239.
  6. Vanderplas, Jacob; Granger, Brian; Heer, Jeffrey; Moritz, Dominik; Wongsuphasawat, Kanit; Satyanarayan, Arvind; Lees, Eitan; Timofeev, Ilia; Welsh, Ben; Sievert, Scott (2018). "Altair: Interactive Statistical Visualizations for Python". Journal of Open Source Software. 3 (32): 1057. Bibcode:2018JOSS....3.1057V. doi: 10.21105/joss.01057 .
  7. "Vega: A Visualization Grammar". Vega.