Rainbow box

Last updated
Example of rainbow boxes displaying the 8 properties of the 20 amino acids (from the Taylor's classification) Rainbow boxes displaying the properties of amino acids.png
Example of rainbow boxes displaying the 8 properties of the 20 amino acids (from the Taylor's classification)
Euler diagram displaying amino acids properties, equivalent to the rainbow boxes above Amino Acids Venn Diagram.png
Euler diagram displaying amino acids properties, equivalent to the rainbow boxes above

A rainbow box is a data visualization technique for overlapping sets. [1] Rainbow boxes are an alternative to Euler and Venn diagrams.

The first figure shows rainbow boxes displaying the properties of amino acids. In rainbow boxes, the elements are shown in columns (e.g. the amino acids), and each set is represented by a rectangular box (e.g. the groups of amino acids sharing a given property). Each box covers the columns corresponding to the elements belonging to its set. When the elements of a given set are not displayed in contiguous columns, a “hole” is present in the box of the set (in the amino acids example on the right, the "Polar" box has three holes). The column order is computed using a heuristic optimization algorithm, in order to minimize the number of holes.

A specific color is associated with each column, ranging across the spectrum, and a box's color is the mean of the colors of the columns it covers. Boxes are stacked vertically, the largest boxes being at the bottom. Custom colors may also be defined, depending on the data.

Rainbow boxes were initially presented at the International Conference Information Visualisation (iV) in 2016, [2] and then published in 2017 the Journal of Visual Language and Computing. [3] A user study involving 78 students in biology compared rainbow boxes to Euler diagram for the visualization of amino acids properties. [4] Results showed that rainbow boxes lead to significantly fewer errors and lower response times. A majority of students also preferred rainbow boxes.

Related Research Articles

<span class="mw-page-title-main">Venn diagram</span> Diagram that shows all possible logical relations between a collection of sets

A Venn diagram is a widely used diagram style that shows the logical relation between sets, popularized by John Venn (1834–1923) in the 1880s. The diagrams are used to teach elementary set theory, and to illustrate simple set relationships in probability, logic, statistics, linguistics and computer science. A Venn diagram uses simple closed curves drawn on a plane to represent sets. Very often, these curves are circles or ellipses.

<span class="mw-page-title-main">Scientific visualization</span> Interdisciplinary branch of science concerned with presenting scientific data visually

Scientific visualization is an interdisciplinary branch of science concerned with the visualization of scientific phenomena. It is also considered a subset of computer graphics, a branch of computer science. The purpose of scientific visualization is to graphically illustrate scientific data to enable scientists to understand, illustrate, and glean insight from their data. Research into how people read and misread various types of visualizations is helping to determine what types and features of visualizations are most understandable and effective in conveying information.

<span class="mw-page-title-main">Visualization (graphics)</span> Set of techniques for creating images, diagrams, or animations to communicate a message

Visualization or visualisation is any technique for creating images, diagrams, or animations to communicate a message. Visualization through visual imagery has been an effective way to communicate both abstract and concrete ideas since the dawn of humanity. from history include cave paintings, Egyptian hieroglyphs, Greek geometry, and Leonardo da Vinci's revolutionary methods of technical drawing for engineering and scientific purposes.

<span class="mw-page-title-main">OLAP cube</span> Multidimensional data array organized for rapid analysis

An OLAP cube is a multi-dimensional array of data. Online analytical processing (OLAP) is a computer-based technique of analyzing data to look for insights. The term cube here refers to a multi-dimensional dataset, which is also sometimes called a hypercube if the number of dimensions is greater than three.

<span class="mw-page-title-main">Euler diagram</span> Graphical set representation involving overlapping circles

An Euler diagram is a diagrammatic means of representing sets and their relationships. They are particularly useful for explaining complex hierarchies and overlapping definitions. They are similar to another set diagramming technique, Venn diagrams. Unlike Venn diagrams, which show all possible relations between different sets, the Euler diagram shows only relevant relationships.

<span class="mw-page-title-main">Orange (software)</span>

Orange is an open-source data visualization, machine learning and data mining toolkit. It features a visual programming front-end for explorative qualitative data analysis and interactive data visualization.

<span class="mw-page-title-main">CPK coloring</span> Colour convention for differentiating atoms

In chemistry, the CPK coloring is a popular color convention for distinguishing atoms of different chemical elements in molecular models.

<span class="mw-page-title-main">Data and information visualization</span> Visual representation of data

Data and information visualization is the practice of designing and creating easy-to-communicate and easy-to-understand graphic or visual representations of a large amount of complex quantitative and qualitative data and information with the help of static, dynamic or interactive visual items. Typically based on data and information collected from a certain domain of expertise, these visualizations are intended for a broader audience to help them visually explore and discover, quickly understand, interpret and gain important insights into otherwise difficult-to-identify structures, relationships, correlations, local and global patterns, trends, variations, constancy, clusters, outliers and unusual groupings within data. When intended for the general public to convey a concise version of known, specific information in a clear and engaging manner, it is typically called information graphics.

<span class="mw-page-title-main">Heat map</span> Data visualization technique

A heat map is a 2-dimensional data visualization technique that represents the magnitude of individual values within a dataset as a color. The variation in color may be by hue or intensity.

Sequential pattern mining is a topic of data mining concerned with finding statistically relevant patterns between data examples where the values are delivered in a sequence. It is usually presumed that the values are discrete, and thus time series mining is closely related, but usually considered a different activity. Sequential pattern mining is a special case of structured data mining.

<span class="mw-page-title-main">Hyperbolic tree</span> Mathematical tree in the hyperbolic plane

A hyperbolic tree is an information visualization and graph drawing method inspired by hyperbolic geometry.

<span class="mw-page-title-main">Diagrammatic reasoning</span>

Diagrammatic reasoning is reasoning by means of visual representations. The study of diagrammatic reasoning is about the understanding of concepts and ideas, visualized with the use of diagrams and imagery instead of by linguistic or algebraic means.

<span class="mw-page-title-main">Vis5D</span> Scientific visualization tool

Vis5D is a 3D visualization system used primarily for animated 3D visualization of weather simulations. It was the first system to produce fully interactive animated 3D displays of time-dynamic volumetric data sets and the first open source 3D visualization system. It is GNU GPL licensed.

<span class="mw-page-title-main">Mathematical diagram</span> Visual representation of a mathematical relationship

Mathematical diagrams, such as charts and graphs, are mainly designed to convey mathematical relationships—for example, comparisons over time.

Lattice Miner is a formal concept analysis software tool for the construction, visualization and manipulation of concept lattices. It allows the generation of formal concepts and association rules as well as the transformation of formal contexts via apposition, subposition, reduction and object/attribute generalization, and the manipulation of concept lattices via approximation, projection and selection. Lattice Miner allows also the drawing of nested line diagrams.

Interactive Visual Analysis (IVA) is a set of techniques for combining the computational power of computers with the perceptive and cognitive capabilities of humans, in order to extract knowledge from large and complex datasets. The techniques rely heavily on user interaction and the human visual system, and exist in the intersection between visual analytics and big data. It is a branch of data visualization. IVA is a suitable technique for analyzing high-dimensional data that has a large number of data points, where simple graphing and non-interactive techniques give an insufficient understanding of the information.


Computer Atlas of Surface Topography of Proteins (CASTp) aims to provide comprehensive and detailed quantitative characterization of topographic features of protein, is now updated to version 3.0. Since its release in 2006, the CASTp server has ~ 45 000 visits and fulfills ~ 33 000 calculation requests annually. CASTp has been proven as a confident tool for a wide range of researches, including investigations of signaling receptors, discoveries of cancer therapeutics, understanding of mechanism of drug actions, studies of immune disorder diseases, analysis of protein–nanoparticle interactions, inference of protein functions and development of high-throughput computational tools. This server is maintained by Jie Liang's lab in University of Illinois at Chicago.

<span class="mw-page-title-main">NodeXL</span> Network analysis and visualization package for Microsoft Excel

NodeXL is a network analysis and visualization software package for Microsoft Excel 2007/2010/2013/2016. The package is similar to other network visualization tools such as Pajek, UCINet, and Gephi. It is widely applied in ring, mapping of vertex and edge, and customizable visual attributes and tags. NodeXL enables researchers to undertake social network analysis work metrics such as centrality, degree, and clustering, as well as monitor relational data and describe the overall relational network structure. When applied to Twitter data analysis, it showed the total network of all users participating in public discussion and its internal structure through data mining. It allows social Network analysis (SNA) to emphasize the relationships rather than the isolated individuals or organizations, allowing interested parties to investigate the two-way dialogue between organizations and the public. SNA also provides a flexible measurement system and parameter selection to confirm the influential nodes in the network, such as in-degree and out-degree centrality. The software contains network visualization, social network analysis features, access to social media network data importers, advanced network metrics, and automation.

Visual computing is a generic term for all computer science disciplines dealing with images and 3D models, such as computer graphics, image processing, visualization, computer vision, virtual and augmented reality and video processing. Visual computing also includes aspects of pattern recognition, human computer interaction, machine learning and digital libraries. The core challenges are the acquisition, processing, analysis and rendering of visual information. Application areas include industrial quality control, medical image processing and visualization, surveying, robotics, multimedia systems, virtual heritage, special effects in movies and television, and computer games.

<span class="mw-page-title-main">UpSet Plot</span> Data visualization method

UpSet plots are a data visualization method for showing set data with more than three intersecting sets. UpSet shows intersections in a matrix, with the rows of the matrix corresponding to the sets, and the columns to the intersections between these sets. The size of the sets and of the intersections are shown as bar charts.

References

  1. Alsallakh, B; Micallef, L; Aigner, W; Hauser, H; Miksch, S; Rodgers, P (2016). "The State-of-the-Art of Set Visualization". Computer Graphics Forum. 35 (1): 234-260.
  2. Lamy, JB; Berthelot, H; Capron, C; Favre, M (2016). Rainbow boxes: a technique for visualizing overlapping sets and an application to the comparison of drugs properties. Lisboa, Portugal: Proceedings of the International Conference Information Visualisation (iV) 2016.
  3. Lamy, JB; Berthelot, H; Favre, M (2017). "Rainbow boxes: a new technique for overlapping set visualization and two applications in the biomedical domain" (PDF). Journal of Visual Language and Computing. 43: 71–82.
  4. Lamy, JB (2018). A new diagram for amino acids: User study comparing rainbow boxes to Venn/Euler diagram (PDF). Salerno, Italy: Proceedings of the International Conference Information Visualisation (iV) 2018.