Part of a series on Statistics |
Data and information visualization |
---|
Major dimensions |
Important figures |
Information graphic types |
Related topics |
Infographics (a clipped compound of "information" and "graphics") are graphic visual representations of information, data, or knowledge intended to present information quickly and clearly. [1] [2] They can improve cognition by using graphics to enhance the human visual system's ability to see patterns and trends. [3] [4] Similar pursuits are information visualization, data visualization, statistical graphics, information design, or information architecture. [2] Infographics have evolved in recent years to be for mass communication, and thus are designed with fewer assumptions about the readers' knowledge base than other types of visualizations. [5] Isotypes are an early example of infographics conveying information quickly and easily to the masses. [6]
Infographics have been around for many years and recently the increase of the number of easy-to-use, free tools have made the creation of infographics available to a large segment of the population. Social media sites such as Facebook and Twitter have also allowed for individual infographics to be spread among many people around the world. Infographics are widely used in the age of short attention span.[ citation needed ]
In newspapers, infographics are commonly used to show the weather, as well as maps, site plans, and graphs for summaries of data. Some books are almost entirely made up of information graphics, such as David Macaulay's The Way Things Work . The Snapshots in USA Today are also an example of simple infographics used to convey news and current events. [7]
Modern maps, especially route maps for transit systems, use infographic techniques to integrate a variety of information, such as the conceptual layout of the transit network, transfer points, and local landmarks. Public transportation maps, such as those for the Washington Metro and the London Underground map, are well-known infographics. Public places such as transit terminals usually have some sort of integrated "signage system" with standardized icons and stylized maps.
In his 1983 "landmark book" The Visual Display of Quantitative Information, Edward Tufte defines "graphical displays" in the following passage:
Graphical displays should
- show the data
- induce the viewer to think about the substance rather than about methodology, graphic design, the technology of graphic production, or something else
- avoid distorting what the data has to say
- present many numbers in a small space
- make large data sets coherent
- encourage the eye to compare different pieces of data
- reveal the data at several levels of detail, from a broad overview to the fine structure
- serve a reasonably clear purpose: description, exploration, tabulation, or decoration
- be closely integrated with the statistical and verbal descriptions of a data set.
Graphics reveal data. Indeed graphics can be more precise and revealing than conventional statistical computations. [8]
In 1626, Christoph Scheiner published the Rosa Ursina sive Sol , a book that revealed his research about the rotation of the sun. Infographics appeared in the form of illustrations demonstrating the Sun's rotation patterns. [9]
In 1786, William Playfair, an engineer and political economist, published the first data graphs in his book The Commercial and Political Atlas. To represent the economy of 18th century England, Playfair used statistical graphs, bar charts, line graphs, area charts, and histograms. In his work, Statistical Breviary, he is credited with introducing the first pie chart. [10] [11] [12]
Around 1820, modern geography was established by Carl Ritter. [13] His maps included shared frames, agreed map legends, scales, repeatability, and fidelity. Such a map can be considered a "supersign" which combines sign systems—as defined by Charles Sanders Peirce—consisting of symbols, icons, indexes as representations. [14] Other examples can be seen in the works of geographers Ritter and Alexander von Humboldt. [15]
In 1857, English nurse Florence Nightingale used information graphics to persuade Queen Victoria to improve conditions in military hospitals. The principal one she used was the Coxcomb chart, a combination of stacked bar and pie charts, depicting the number and causes of deaths during each month of the Crimean War.
1861 saw the release of an influential information graphic on the subject of Napoleon's disastrous march on Moscow. The graphic's creator, Charles Joseph Minard, captured four different changing variables that contributed to Napoleon's downfall in a single two-dimensional image: the army's direction as they traveled, the location the troops passed through, the size of the army as troops died from hunger and wounds, and the freezing temperatures they experienced.
James Joseph Sylvester introduced the term "graph" in 1878 in the scientific magazine Nature and published a set of diagrams showing the relationship between chemical bonds and mathematical properties. [16] These were also some of the first mathematical graphs.
In 1900, the African-American historian, sociologist, writer, and Black rights activist, W.E.B. Du Bois presented data visualizations at the Exposition Universelle (1900) in Paris, France. In addition to curating 500 photographs of the lives of Black Americans, Du Bois and his Atlanta University team of students and scholars created 60 handmade data visualizations [17] to document the ways Black Americans were being denied access to education, housing, employment, and household wealth. [18]
The Cologne Progressives developed an aesthetic approach to art that focused on communicating information. [19] Gerd Arntz, Peter Alma and Augustin Tschinkel, all participants in this movement were recruited by Otto Neurath for the Gesellschafts- und Wirtschaftsmuseum, where they developed the Vienna Method from 1926 to 1934. Here simple images were used to represent data in a structured way. Following the victory of Austrofascism in the Austrian Civil War, the team moved to the Netherlands where they continued their work rebranding it Isotypes (International System of Typographic Picture Education). The method was also applied by IZOSTAT (ИЗОСТАТ) in the Soviet Union.
In 1942 Isidore Isou published the Lettrist manifesto, a document covering art, culture, poetry, film, and political theory. The included works also called metagraphics and hypergraphics, are a synthesis of writing and visual art.
In 1958 Stephen Toulmin proposed a graphical argument model, called The Toulmin Model of Argumentation. The diagram contained six interrelated components used for analyzing arguments and was considered Toulmin's most influential work, particularly in the field of rhetoric, communication, and computer science. The Toulmin Model of Argumentation became influential in argumentation theory and its applications.
In 1972 and 1973, respectively, the Pioneer 10 and Pioneer 11 spacecraft included on their vessels the Pioneer Plaques, a pair of gold-anodized aluminum plaques, each featuring a pictorial message. The pictorial messages included nude male and female figures as well as symbols that were intended to provide information about the origin of the spacecraft. The images were designed by Carl Sagan and Frank Drake and were unique in that their graphical meanings were to be understandable to extraterrestrial beings, who would have no conception of human language.
A pioneer in data visualization, Edward Tufte, wrote a series of books – Visual Explanations, The Visual Display of Quantitative Information, and Envisioning Information – on the subject of information graphics. [20] [21] [22] Referred to by The New York Times as the "da Vinci of Data", Tufte began to give day-long lectures and workshops on the subject of infographics starting in 1993. As of 2012 [update] , Tufte still gives these lectures. [23] To Tufte, good data visualizations represent every data point accurately and enable a viewer to see trends and patterns in the data. Tufte's contribution to the field of data visualization and infographics is considered immense, and his design principles can be seen in many websites, magazines, and newspapers today. [24]
The infographics created by Peter Sullivan for The Sunday Times in the 1970s, 1980s, and 1990s were some of the key factors in encouraging newspapers to use more infographics. Sullivan is also one of the few authors who have written about information graphics in newspapers. Likewise, the staff artists at USA Today, the United States newspaper that debuted in 1982, established the goal of using graphics to make information easier to comprehend. However, the paper has received criticism for oversimplifying news stories and for creating infographics that some find emphasizes entertainment over content and data. Tufte coined the term chartjunk to refer to graphics that are visually appealing to the point of losing the information contained within them.
With vector graphics and raster graphics becoming ubiquitous in computing in the 21st century, data visualizations have been applied to commonly used computer systems, including desktop publishing and Geographic Information Systems (GIS).
Closely related to the field of information graphics is information design, which is the creation of infographics.
By the year 2000, Adobe Flash-based animations on the Internet had made use of many key practices in creating infographics in order to create a variety of products and games.
Likewise, television began to incorporate infographics into the viewers' experiences in the early 2000s. One example of infographics usage in television and in pop culture is the 2002 music video by the Norwegian musicians of Röyksopp, for their song "Remind Me." The video was composed entirely of animated infographics. [25] Similarly, in 2004, a television commercial for the French nuclear technology company Areva used animated infographics as an advertising tactic. Both of these videos and the attention they received have conveyed to other fields the potential value of using information graphics to describe complex information efficiently.
With the rise of alternatives to Adobe Flash, such as HTML 5 and CSS3, infographics are now created in a variety of media with a number of software tools. [26]
The field of journalism has also incorporated and applied information graphics to news stories. For stories that intend to include text, images, and graphics, the system called the maestro concept allows entire newsrooms to collaborate and organize a story to successfully incorporate all components. Across many newsrooms, this teamwork-integrated system is applied to improve time management. The maestro system is designed to improve the presentation of stories for busy readers of media. Many news-based websites have also used interactive information graphics in which the user can extract information on a subject as they explore the graphic.
Many businesses use infographics as a medium for communicating with and attracting potential customers. [27] Information graphics are a form of content marketing [28] and have become a tool for internet marketers and companies to create content that others will link to, thus possibly boosting a company's reputation and online presence. [29]
Religious denominations have also started using infographics. For example, The Church of Jesus Christ of Latter-day Saints has made numerous infographics to help people learn about their faith, missionaries, temples, lay ministry, and family history efforts. [30]
Infographics are finding a home in the classroom as well. Courses that teach students to create their own infographics using a variety of tools may encourage engagement in the classroom and may lead to a better understanding of the concepts they are mapping onto the graphics.[ citation needed ]
With the popularity of social media, infographics have become popular, often as static images or simple web interfaces, covering any number of topics. Such infographics are often shared between users of social networks such as Facebook, Twitter, Pinterest, Google+ and Reddit. The hashtag #infographic was tweeted 56,765 times in March 2012 and at its peak 3,365 times in a span of 24 hours.[ citation needed ]
The three parts of all infographics are the visual, the content, and the knowledge. [31] The visual consists of colors and graphics. There are two different types of graphics – theme, and reference. These graphics are included in all infographics and represent the underlying visual representation of the data. Reference graphics are generally icons that can be used to point to certain data, although they are not always found in infographics. Statistics and facts usually serve as the content for infographics and can be obtained from any number of sources, including census data and news reports. One of the most important aspects of infographics is that they contain some sort of insight into the data that they are presenting – this is the knowledge. [31]
Infographics are effective because of their visual element. Humans receive input from all five of their senses (sight, touch, hearing, smell, taste), but they receive significantly more information from vision than any of the other four. [32] Fifty percent of the human brain is dedicated to visual functions, and images are processed faster than text. The brain processes pictures all at once, but processes text in a linear fashion, meaning it takes much longer to obtain information from text. [2] Entire business processes or industry sectors can be made relevant to a new audience through a guidance design technique that leads the eye. The page may link to a complete report, but the infographic primes the reader making the subject-matter more accessible. [33] Online trends, such as the increasingly short attention span of Internet users, has also contributed to the increasing popularity and effectiveness of infographics. [ citation needed ]
When designing the visual aspect of an infographic, a number of considerations must be made to optimize the effectiveness of the visualization. The six components of visual encoding are spatial, marks, connection, enclosure, retinal properties, and temporal encoding. [4] Each of these can be utilized in its own way to represent relationships between different types of data. However, studies have shown that spatial position is the most effective way to represent numerical data and leads to the fastest and easiest understanding by viewers. [3] Therefore, the designers often spatially represent the most important relationship being depicted in an infographic.
There are also three basic provisions of communication that need to be assessed when designing an infographic – appeal, comprehension, and retention. [34] "Appeal" is the idea that communication needs to engage its audience. Comprehension implies that the viewer should be able to easily understand the information that is presented to them. And finally, "retention" means that the viewer should remember the data presented by the infographic. The order of importance of these provisions depends on the purpose of the infographic. If the infographic is meant to convey information in an unbiased way, such as in the domains of academia or science, comprehension should be considered first, then retention, and finally, appeal. However, if the infographic is being used for commercial purposes, then appeal becomes most important, followed by retention and comprehension. When infographics are being used for editorial purposes, such as in a newspaper, the appeal is again most important but is followed first by comprehension and then retention. [34]
However, the appeal and the retention can in practice be put together with the aid of a comprehensible layout design. Recently, as an attempt to study the effect of the layout of an infographic on the comprehension of the viewers, a new Neural Network-based cognitive load estimation method was applied on different types of common layouts for the infographic design. [35] When the varieties of factors listed above are taken into consideration when designing infographics, they can be a highly efficient and effective way to convey large amounts of information in a visual manner.
Data visualizations are often used in infographics and may make up the entire infographic. There are many types of visualizations that can be used to represent the same set of data. Therefore, it is crucial to identify the appropriate visualization for the data set and infographic by taking into consideration graphical features such as position, size, shape, and color. There are primarily five types of visualization categories – time-series data, statistical distributions, maps, hierarchies, and networking. [3]
Time-series data is one of the most common forms of data visualization. It documents sets of values over time. Examples of graphics in this category include index charts, stacked graphs, small multiples, and horizon graphs. Index charts are ideal to use when raw values are less important than relative changes. It is an interactive line chart that shows percentage changes for a collection of time-series data based on a selected index point. For example, stock investors could use this because they are less concerned with the specific price and more concerned with the rate of growth. Stacked graphs are area charts that are stacked on top of each other, and depict aggregate patterns. They allow viewers to see overall patterns and individual patterns. However, they do not support negative numbers and make it difficult to accurately interpret trends. An alternative to stacked graphs is small multiples. Instead of stacking each area chart, each series is individually shown so the overall trends of each sector are more easily interpreted. Horizon graphs are a space efficient method to increase the data density of a time-series while preserving resolution. [3]
Statistical distributions reveal trends based on how numbers are distributed. Common examples include histograms and box-and-whisker plots, which convey statistical features such as mean, median, and outliers. In addition to these common infographics, alternatives include stem-and-leaf plots, Q–Q plots, scatter plot matrices (SPLOM) and parallel coordinates. For assessing a collection of numbers and focusing on frequency distribution, stem-and-leaf plots can be helpful. The numbers are binned based on the first significant digit, and within each stack binned again based on the second significant digit. On the other hand, Q–Q plots compare two probability distributions by graphing quantiles against each other. This allows the viewer to see if the plot values are similar and if the two are linearly related. SPLOM is a technique that represents the relationships among multiple variables. It uses multiple scatter plots to represent a pairwise relation among variables. Another statistical distribution approach to visualize multivariate data is parallel coordinates. Rather than graphing every pair of variables in two dimensions, the data is repeatedly plotted on a parallel axis, and corresponding points are then connected with a line. The advantage of parallel coordinates is that they are relatively compact, allowing many variables to be shown simultaneously. [3]
Maps are a natural way to represent geographical data. Time and space can be depicted through the use of flow maps. Line strokes are used with various widths and colors to help encode information. Choropleth maps, which encode data through color and geographical region, are also commonly used. Graduated symbol maps are another method to represent geographical data. They are an alternative to choropleth map and use symbols, such as pie charts for each area, over a map. This map allows for more dimensions to be represented using various shapes, sizes, and colors. Cartograms, on the other hand, completely distort the shape of a region and directly encode a data variable. Instead of using a geographic map, regions are redrawn proportionally to the data. For example, each region can be represented by a circle and the size/color is directly proportional to other information, such as population size. [3]
Many data sets, such as spatial entities of countries or common structures for governments, can be organized into natural hierarchies. Node-link diagrams, adjacency diagrams, and enclosure diagrams are all types of infographics that effectively communicate hierarchical data. Node-link diagrams are a popular method due to the tidy and space-efficient results. A node-link diagram is similar to a tree, where each node branches off into multiple sub-sections. An alternative is adjacency diagrams, which is a space-filling variant of the node-link diagram. Instead of drawing a link between hierarchies, nodes are drawn as solid areas with sub-sections inside of each section. This method allows for size to be easily represented than in the node-link diagrams. Enclosure diagrams are also a space-filling visualization method. However, they use containment rather than adjacency to represent the hierarchy. Similar to the adjacency diagram, the size of the node is easily represented in this model. [3]
Network visualization explores relationships, such as friendships and cliques. Three common types are force-directed layout, arc diagrams, and matrix view. Force-directed layouts are a common and intuitive approach to network layout. In this system, nodes are similar to charged particles, which repel each other. Links are used to pull related nodes together. Arc diagrams are one-dimensional layouts of nodes with circular arcs linking each node. When used properly, with good order in nodes, cliques and bridges are easily identified in this layout. Alternatively, mathematicians and computer scientists more often use matrix views. Each value has an (x,y) value in the matrix that corresponds to a node. By using color and saturation instead of text, values associated with the links can be perceived rapidly. While this method makes it hard to view the path of the nodes, there are no line crossings, which in a large and highly connected network can quickly become too cluttered. [3]
While all of these visualizations can be effectively used on their own, many modern infographics combine multiple types into one graphic, along with other features, such as illustrations and text. Some modern infographics do not even contain data visualization, and instead are simply a colorful and succinct ways to present knowledge. Fifty-three percent of the 30 most-viewed infographics on the infographic sharing site visual.ly did not contain actual data. [37]
Comparison infographics are a type of visual representation that focuses on comparing and contrasting different elements, such as products, services, options, or features. These infographics are designed to help viewers make informed decisions by presenting information in a clear and concise manner. Comparison infographics can be highly effective in simplifying complex data and highlighting key differences between multiple items.
Infographics can be created by hand using simple everyday tools such as graph paper, pencils, markers, and rulers. However, today they are more often created using computer software, which is often both faster and easier. They can be created with general illustration software.
Diagrams can be manually created and drawn using software, which can be downloaded for the desktop or used online. Templates can be used to get users started on their diagrams. Additionally, the software allows users to collaborate on diagrams in real time over the Internet.
There are also numerous tools to create very specific types of visualizations, such as creating a visualization based on embedded data in the photos on a user's smartphone. Users can create an infographic of their resume or a "picture of their digital life." [38]
A chart is a graphical representation for data visualization, in which "the data is represented by symbols, such as bars in a bar chart, lines in a line chart, or slices in a pie chart". A chart can represent tabular numeric data, functions or some kinds of quality structure and provides different info.
Information design is the practice of presenting information in a way that fosters an efficient and effective understanding of the information. The term has come to be used for a specific area of graphic design related to displaying information effectively, rather than just attractively or for artistic expression. Information design is closely related to the field of data visualization and is often taught as part of graphic design courses. The broad applications of information design along with its close connections to other fields of design and communication practices have created some overlap in the definitions of communication design, data visualization, and information architecture.
Edward Rolf Tufte, sometimes known as "ET", is an American statistician and professor emeritus of political science, statistics, and computer science at Yale University. He is noted for his writings on information design and as a pioneer in the field of data visualization.
Graphics are visual images or designs on some surface, such as a wall, canvas, screen, paper, or stone, to inform, illustrate, or entertain. In contemporary usage, it includes a pictorial representation of data, as in design and manufacture, in typesetting and the graphic arts, and in educational and recreational software. Images that are generated by a computer are called computer graphics.
A small multiple is a series of similar graphs or charts using the same scale and axes, allowing them to be easily compared. It uses multiple views to show different partitions of a dataset. The term was popularized by Edward Tufte.
A diagram is a symbolic representation of information using visualization techniques. Diagrams have been used since prehistoric times on walls of caves, but became more prevalent during the Enlightenment. Sometimes, the technique uses a three-dimensional visualization which is then projected onto a two-dimensional surface. The word graph is sometimes used as a synonym for diagram.
Visualization, also known as Graphics Visualization, is any technique for creating images, diagrams, or animations to communicate a message. Visualization through visual imagery has been an effective way to communicate both abstract and concrete ideas since the dawn of humanity. from history include cave paintings, Egyptian hieroglyphs, Greek geometry, and Leonardo da Vinci's revolutionary methods of technical drawing for engineering purposes that actively involve scientific requirements.
A pie chart is a circular statistical graphic which is divided into slices to illustrate numerical proportion. In a pie chart, the arc length of each slice is proportional to the quantity it represents. While it is named for its resemblance to a pie which has been sliced, there are variations on the way it can be presented. The earliest known pie chart is generally credited to William Playfair's Statistical Breviary of 1801.
Chartjunk consists of all visual elements in charts and graphs that are not necessary to comprehend the information represented on the graph, or that distract the viewer from this information.
Data and information visualization is the practice of designing and creating easy-to-communicate and easy-to-understand graphic or visual representations of a large amount of complex quantitative and qualitative data and information with the help of static, dynamic or interactive visual items. Typically based on data and information collected from a certain domain of expertise, these visualizations are intended for a broader audience to help them visually explore and discover, quickly understand, interpret and gain important insights into otherwise difficult-to-identify structures, relationships, correlations, local and global patterns, trends, variations, constancy, clusters, outliers and unusual groupings within data. When intended for the general public to convey a concise version of known, specific information in a clear and engaging manner, it is typically called information graphics.
Charles Joseph Minard was a French civil engineer recognized for his significant contribution in the field of information graphics in civil engineering and statistics. Minard was, among other things, noted for his representation of numerical data on geographic maps, especially his flow maps.
Chernoff faces, invented by applied mathematician, statistician, and physicist Herman Chernoff in 1973, display multivariate data in the shape of a human face. The individual parts, such as eyes, ears, mouth, and nose represent values of the variables by their shape, size, placement, and orientation. The idea behind using faces is that humans easily recognize faces and notice small changes without difficulty. Chernoff faces handle each variable differently. Because the features of the faces vary in perceived importance, the way in which variables are mapped to the features should be carefully chosen.
Diagrammatic reasoning is reasoning by means of visual representations. The study of diagrammatic reasoning is about the understanding of concepts and ideas, visualized with the use of diagrams and imagery instead of by linguistic or algebraic means.
Statistical graphics, also known as statistical graphical techniques, are graphics used in the field of statistics for data visualization.
A radial tree, or radial map, is a method of displaying a tree structure in a way that expands outwards, radially. It is one of many ways to visually display a tree, with examples dating back to the early 20th century. In use, it is a type of information graphic.
A motion chart is a dynamic bubble chart which allows efficient and interactive exploration and visualization of longitudinal multivariate data. Motion charts provide mechanisms for mapping ordinal, nominal and quantitative variables onto time, 2D coordinate axes, size, colors, glyphs and appearance characteristics, which facilitate the interactive display of multidimensional and temporal data.
In statistics, a misleading graph, also known as a distorted graph, is a graph that misrepresents data, constituting a misuse of statistics and with the result that an incorrect conclusion may be derived from it.
Howard Gray Funkhouser was an American mathematician, historian and associate professor of mathematics at the Washington and Lee University, and later at the Phillips Exeter Academy, particularly known for his early work on the history of graphical methods.
Graphical perception is the human capacity for visually interpreting information on graphs and charts. Both quantitative and qualitative information can be said to be encoded into the image, and the human capacity to interpret it is sometimes called decoding. The importance of human graphical perception, what we discern easily versus what our brains have more difficulty decoding, is fundamental to good statistical graphics design, where clarity, transparency, accuracy and precision in data display and interpretation are essential for understanding the translation of data in a graph to clarify and interpret the science.
Map layout, also called map composition or (cartographic) page layout, is the part of cartographic design that involves assembling various map elements on a page. This may include the map image itself, along with titles, legends, scale indicators, inset maps, and other elements. It follows principles similar to page layout in graphic design, such as balance, gestalt, and visual hierarchy. The term map composition is also used for the assembling of features and symbols within the map image itself, which can cause some confusion; these two processes share a few common design principles but are distinct procedures in practice. Similar principles of layout design apply to maps produced in a variety of media, from large format wall maps to illustrations in books to interactive web maps, although each medium has unique constraints and opportunities.