Dianne Cook | |
---|---|
Born | Dianne Helen Cook |
Alma mater | University of New England (BSc, DipEd) Rutgers University (MSc, PhD) |
Occupation | Professor of econometrics & business statistics |
Known for | GGobi |
Scientific career | |
Fields | Statistical graphics Data science Multivariate data [1] |
Institutions | Iowa State University (1993–2015) Monash University (2015–present) |
Thesis | Grand Tour and Projection Pursuit (1993) |
Doctoral advisor | Andreas Buja Javier Cabrera [2] |
Doctoral students | Yihui Xie [2] [3] |
Website | www |
Dianne Helen Cook is an Australian statistician, the editor of the Journal of Computational and Graphical Statistics , [4] and an expert on the visualization of high-dimensional data. [5] She is Professor of Business Analytics in the Department of Econometrics and Business Statistics at Monash University [5] and professor emeritus of statistics at Iowa State University. [1] [6] The emeritus status was chosen so that she could continue to supervise graduate students at Iowa State after moving to Australia. [7] [8] [9]
Dianne Helen Cook [10] grew up in Wauchope, New South Wales as an athletic farm girl, the first woman to play on her local (men's) cricket team. She studied statistics at University of New England (Australia), [4] where she earned a BSc and Dip.Ed. in 1982. [10] She received her MS in 1990 and her PhD in 1993 from Rutgers University; her dissertation, supervised jointly by Andreas Buja and Javier Cabrera, was Grand Tour and Projection Pursuit. [10] [2]
Cook joined the Iowa State University faculty in 1993, and remained there until her move to Monash University in 2015. [10] At Iowa State, her students have included Hadley Wickham and Yihui Xie. [2]
She is one of the developers of GGobi, and with Deborah F. Swayne, she is the author of Interactive and Dynamic Graphics for Data Analysis: With R and GGobi (Springer, 2007). [11]
She is a Fellow of the American Statistical Association. [5] She was editor of the Journal of Computational and Graphical Statistics from 2016 to 2018. [12]
In statistics, exploratory data analysis (EDA) is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling and thereby contrasts traditional hypothesis testing. Exploratory data analysis has been promoted by John Tukey since 1970 to encourage statisticians to explore the data, and possibly formulate hypotheses that could lead to new data collection and experiments. EDA is different from initial data analysis (IDA), which focuses more narrowly on checking assumptions required for model fitting and hypothesis testing, and handling missing values and making transformations of variables as needed. EDA encompasses IDA.
Visual Molecular Dynamics (VMD) is a molecular modelling and visualization computer program. VMD is developed mainly as a tool to view and analyze the results of molecular dynamics simulations. It also includes tools for working with volumetric data, sequence data, and arbitrary graphics objects. Molecular scenes can be exported to external rendering tools such as POV-Ray, RenderMan, Tachyon, Virtual Reality Modeling Language (VRML), and many others. Users can run their own Tcl and Python scripts within VMD as it includes embedded Tcl and Python interpreters. VMD runs on Unix, Apple Mac macOS, and Microsoft Windows. VMD is available to non-commercial users under a distribution-specific license which permits both use of the program and modification of its source code, at no charge.
Parallel rendering is the application of parallel programming to the computational domain of computer graphics. Rendering graphics can require massive computational resources for complex scenes that arise in scientific visualization, medical visualization, CAD applications, and virtual reality. Recent research has also suggested that parallel rendering can be applied to mobile gaming to decrease power consumption and increase graphical fidelity. Rendering is an embarrassingly parallel workload in multiple domains and thus has been the subject of much research.
JMP is a suite of computer programs for statistical analysis and machine learning developed by JMP, a subsidiary of SAS Institute. The program was launched in 1989 to take advantage of the graphical user interface introduced by the Macintosh operating systems. It has since been significantly rewritten and made available for the Windows operating system.
XLispStat is a statistical scientific package based on the XLISP language.
The Statistics Online Computational Resource (SOCR) is an online multi-institutional research and education organization. SOCR designs, validates and broadly shares a suite of online tools for statistical computing, and interactive materials for hands-on learning and teaching concepts in data science, statistical analysis and probability theory. The SOCR resources are platform agnostic based on HTML, XML and Java, and all materials, tools and services are freely available over the Internet.
Data and information visualization is the practice of designing and creating easy-to-communicate and easy-to-understand graphic or visual representations of a large amount of complex quantitative and qualitative data and information with the help of static, dynamic or interactive visual items. Typically based on data and information collected from a certain domain of expertise, these visualizations are intended for a broader audience to help them visually explore and discover, quickly understand, interpret and gain important insights into otherwise difficult-to-identify structures, relationships, correlations, local and global patterns, trends, variations, constancy, clusters, outliers and unusual groupings within data. When intended for the general public to convey a concise version of known, specific information in a clear and engaging manner, it is typically called information graphics.
GGobi is a free statistical software tool for interactive data visualization. GGobi allows extensive exploration of the data with Interactive dynamic graphics. It is also a tool for looking at multivariate data. R can be used in sync with GGobi. The GGobi software can be embedded as a library in other programs and program packages using an application programming interface (API) or as an add-on to existing languages and scripting environments, e.g., with the R command line or from a Perl or Python scripts. GGobi prides itself on its ability to link multiple graphs together.
Michael Louis Friendly is an American-Canadian psychologist, Professor of Psychology at York University in Ontario, Canada, and director of its Statistical Consulting Service, especially known for his contributions to graphical methods for categorical and multivariate data, and on the history of data and information visualisation.
Leland Wilkinson was an American statistician and computer scientist at H2O.ai and Adjunct Professor of Computer Science at University of Illinois at Chicago. Wilkinson developed the SYSTAT statistical package in the early 1980s, sold it to SPSS in 1995, and worked at SPSS for 10 years recruiting and managing the visualization team. He left SPSS in 2008 and became Executive VP of SYSTAT Software Inc. in Chicago. He then served as the VP of Data Visualization at Skytree, Inc and VP of Statistics at Tableau Software before joining H2O.ai. His research focused on scientific visualization and statistical graphics. In these communities he was well known for his book The Grammar of Graphics, which was the foundation for the R package ggplot2.
A motion chart is a dynamic bubble chart which allows efficient and interactive exploration and visualization of longitudinal multivariate data. Motion charts provide mechanisms for mapping ordinal, nominal and quantitative variables onto time, 2D coordinate axes, size, colors, glyphs and appearance characteristics, which facilitate the interactive display of multidimensional and temporal data.
ggplot2 is an open-source data visualization package for the statistical programming language R. Created by Hadley Wickham in 2005, ggplot2 is an implementation of Leland Wilkinson's Grammar of Graphics—a general scheme for data visualization which breaks up graphs into semantic components such as scales and layers. ggplot2 can serve as a replacement for the base graphics in R and contains a number of defaults for web and print display of common scales. Since 2005, ggplot2 has grown in use to become one of the most popular R packages.
RStudio IDE is an integrated development environment for R, a programming language for statistical computing and graphics. It is available in two formats: RStudio Desktop is a regular desktop application while RStudio Server runs on a remote server and allows accessing RStudio using a web browser. The RStudio IDE is a product of Posit PBC.
Hadley Alexander Wickham is a New Zealand statistician known for his work on open-source software for the R statistical programming environment. He is the chief scientist at Posit PBC and an adjunct professor of statistics at the University of Auckland, Stanford University, and Rice University. His work includes the data visualisation system ggplot2 and the tidyverse, a collection of R packages for data science based on the concept of tidy data.
Yihui Xie is a Chinese statistician, data scientist and software engineer who formerly worked for Posit PBC. He is the principal author of the open-source software package Knitr for data analysis in the R programming language, and has also written the book Dynamic Documents with R and knitr.
Heike Hofmann is a statistician and Professor in the Department of Statistics at Iowa State University.
Deborah F. Swayne is an American statistician who worked for AT&T Labs and chaired the Section on Statistical Graphics of the American Statistical Association. She is known for her work as coauthor of GGobi, a software tool for interactive data visualization, and is president of the GGobi Foundation. She retired in 2016.
Andreas Buja is a Swiss statistician and professor of statistics. He is the Liem Sioe Liong/First Pacific Company professor in the Statistics department of The Wharton School at the University of Pennsylvania in Philadelphia, United States. Buja joined Center for Computational Mathematics (CCM) as a Senior Research Scientist in January 2020.
Luke Tierney is an American statistician and computer scientist. A fellow of the Institute of Mathematical Statistics since 1988 and of the American Statistical Association since 1991, Tierney is currently a professor of statistics at the University of Iowa. Through his past work on programming languages such as R and Lisp, Tierney now holds a position on the developing team known as the R Core.