Parallel analysis

Last updated

Parallel analysis, also known as Horn's parallel analysis, is a statistical method used to determine the number of components to keep in a principal component analysis or factors to keep in an exploratory factor analysis. It is named after psychologist John L. Horn, who created the method, publishing it in the journal Psychometrika in 1965. [1] The method compares the eigenvalues generated from the data matrix to the eigenvalues generated from a Monte-Carlo simulated matrix created from random data of the same size. [2]

Contents

Evaluation and comparison with alternatives

Parallel analysis is regarded as one of the more accurate methods for determining the number of factors or components to retain. In particular, unlike early approaches to dimensionality estimation (such as examining scree plots), has the virtue of an objective decision criterion. [3] Since its original publication, multiple variations of parallel analysis have been proposed. [4] [5] Other methods of determining the number of factors or components to retain in an analysis include the scree plot, Kaiser rule, or Velicer's MAP test. [6]

Anton Formann provided both theoretical and empirical evidence that parallel analysis's application might not be appropriate in many cases since its performance is influenced by sample size, item discrimination, and type of correlation coefficient. [7]

Implementation

Parallel analysis has been implemented in JASP, SPSS, SAS, STATA, and MATLAB [8] [9] [10] and in multiple packages for the R programming language, including the psych [11] [12] multicon, [13] hornpa, [14] and paran packages. [15] [16]

See also

Related Research Articles

Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable, i.e., multivariate random variables. Multivariate statistics concerns understanding the different aims and background of each of the different forms of multivariate analysis, and how they relate to each other. The practical application of multivariate statistics to a particular problem may involve several types of univariate and multivariate analyses in order to understand the relationships between variables and their relevance to the problem being studied.

Psychological statistics is application of formulas, theorems, numbers and laws to psychology. Statistical methods for psychology include development and application statistical theory and methods for modeling psychological data. These methods include psychometrics, factor analysis, experimental designs, and Bayesian statistics. The article also discusses journals in the same field.

Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally covers specialized fields within psychology and education devoted to testing, measurement, assessment, and related activities. Psychometrics is concerned with the objective measurement of latent constructs that cannot be directly observed. Examples of latent constructs include intelligence, introversion, mental disorders, and educational achievement. The levels of individuals on nonobservable latent variables are inferred through mathematical modeling based on what is observed from individuals' responses to items on tests and scales.

<span class="mw-page-title-main">Principal component analysis</span> Method of data analysis

Principal component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory data analysis, visualization and data preprocessing.

Factor analysis is a statistical method used to describe variability among observed, correlated variables in terms of a potentially lower number of unobserved variables called factors. For example, it is possible that variations in six observed variables mainly reflect the variations in two unobserved (underlying) variables. Factor analysis searches for such joint variations in response to unobserved latent variables. The observed variables are modelled as linear combinations of the potential factors plus "error" terms, hence factor analysis can be thought of as a special case of errors-in-variables models.

Cronbach's alpha, also known as tau-equivalent reliability or coefficient alpha, is a reliability coefficient and a measure of the internal consistency of tests and measures. It was named after the American psychologist Lee Cronbach.

In statistics and research, internal consistency is typically a measure based on the correlations between different items on the same test. It measures whether several items that propose to measure the same general construct produce similar scores. For example, if a respondent expressed agreement with the statements "I like to ride bicycles" and "I've enjoyed riding bicycles in the past", and disagreement with the statement "I hate bicycles", this would be indicative of good internal consistency of the test.

Norman Cliff is an American psychologist. He received his Ph.D. from Princeton in psychometrics in 1957. After research positions in the US Public Health Service and at Educational Testing Service he joined the University of Southern California in 1962. He has had a number of research interests, including quantification of cognitive processes, scaling and measurement theory, computer-interactive psychological measurement, multivariate statistics, and ordinal methods. One of his major contributions to psychometrics was the method for rotation of canonical components. Asserting that much of psychological data have only ordinal justification, Cliff also published various papers and a book on ordinal methods for research. On the one hand this included extensions to the established ordinal methods for correlating data. However, on the other hand, Cliff also suggested that there are viable and robust ordinal alternatives to mean comparisons. He introduced a measure of proportional difference between two sets of data often referred to as Cliff's delta. He has been president of the Psychometric Society and of the Society for Multivariate Experimental Psychology. Now an Emeritus Professor, he lives in New Mexico.

William Roger Revelle is a psychology professor at Northwestern University working in personality psychology. Revelle studies the biological basis of personality and motivation, psychometric theory, the structure of daily mood, and models of attention and memory.

Latent growth modeling is a statistical technique used in the structural equation modeling (SEM) framework to estimate growth trajectories. It is a longitudinal analysis technique to estimate growth over a period of time. It is widely used in the field of psychology, behavioral science, education and social science. It is also called latent growth curve analysis. The latent growth model was derived from theories of SEM. General purpose SEM software, such as OpenMx, lavaan, AMOS, Mplus, LISREL, or EQS among others may be used to estimate growth trajectories.

<span class="mw-page-title-main">Scree plot</span> Diagnostic plot

In multivariate statistics, a scree plot is a line plot of the eigenvalues of factors or principal components in an analysis. The scree plot is used to determine the number of factors to retain in an exploratory factor analysis (FA) or principal components to keep in a principal component analysis (PCA). The procedure of finding statistically significant factors or components using a scree plot is also known as a scree test. Raymond B. Cattell introduced the scree plot in 1966.

In statistics, confirmatory factor analysis (CFA) is a special form of factor analysis, most commonly used in social science research. It is used to test whether measures of a construct are consistent with a researcher's understanding of the nature of that construct. As such, the objective of confirmatory factor analysis is to test whether the data fit a hypothesized measurement model. This hypothesized model is based on theory and/or previous analytic research. CFA was first developed by Jöreskog (1969) and has built upon and replaced older methods of analyzing construct validity such as the MTMM Matrix as described in Campbell & Fiske (1959).

Mauchly's sphericity test or Mauchly's W is a statistical test used to validate a repeated measures analysis of variance (ANOVA). It was developed in 1940 by John Mauchly.

<i>Psychometrika</i> Academic journal

Psychometrika is the official journal of the Psychometric Society, a professional body dedicated to psychometrics and quantitative psychology. The journal focuses on quantitative methods for the measurement and evaluation of human behavior, including statistical methods and other mathematical techniques. Past editors include Marion Richardson, Dorothy Adkins, Norman Cliff, and Willem J. Heiser. According to Journal Citation Reports, the journal had an impact factor of 1.959 in 2019.

<span class="mw-page-title-main">Exploratory factor analysis</span> Statistical method in psychology

In multivariate statistics, exploratory factor analysis (EFA) is a statistical method used to uncover the underlying structure of a relatively large set of variables. EFA is a technique within factor analysis whose overarching goal is to identify the underlying relationships between measured variables. It is commonly used by researchers when developing a scale and serves to identify a set of latent constructs underlying a battery of measured variables. It should be used when the researcher has no a priori hypothesis about factors or patterns of measured variables. Measured variables are any one of several attributes of people that may be observed and measured. Examples of measured variables could be the physical height, weight, and pulse rate of a human being. Usually, researchers would have a large number of measured variables, which are assumed to be related to a smaller number of "unobserved" factors. Researchers must carefully consider the number of measured variables to include in the analysis. EFA procedures are more accurate when each factor is represented by multiple measured variables in the analysis.

<span class="mw-page-title-main">Anton Formann</span> Austrian research psychologist, statistician and psychometrician

Anton K. Formann was an Austrian research psychologist, statistician, and psychometrician. He is renowned for his contributions to item response theory, latent class analysis, the measurement of change, mixture models, categorical data analysis, and quantitative methods for research synthesis (meta-analysis).

Test construction strategies are the various ways that items in a psychological measure are created and decided upon. They are most often associated with personality tests but can also be applied to other psychological constructs such as mood or psychopathology. There are three commonly used general strategies: inductive, deductive, and empirical. Scales created today will often incorporate elements of all three methods.

<i>The Vectors of Mind</i> Book published by psychologist Louis Leon Thurstone

The Vectors of Mind is a book published by American psychologist Louis Leon Thurstone in 1935 that summarized Thurstone's methodology for multiple factor analysis.

In statistical models applied to psychometrics, congeneric reliability a single-administration test score reliability coefficient, commonly referred to as composite reliability, construct reliability, and coefficient omega. is a structural equation model (SEM)-based reliability coefficients and is obtained from on a unidimensional model. is the second most commonly used reliability factor after tau-equivalent reliability(; also known as Cronbach's alpha), and is often recommended as its alternative.

The Kaiser–Meyer–Olkin (KMO) test is a statistical measure to determine how suited data is for factor analysis. The test measures sampling adequacy for each variable in the model and the complete model. The statistic is a measure of the proportion of variance among variables that might be common variance. The higher the proportion, the higher the KMO-value, the more suited the data is to factor analysis.

References

  1. Horn, John L. (June 1965). "A rationale and test for the number of factors in factor analysis". Psychometrika. 30 (2): 179–185. doi:10.1007/bf02289447. PMID   14306381. S2CID   19663974.
  2. Mike Allen (11 April 2017). The SAGE Encyclopedia of Communication Research Methods. SAGE Publications. p. 518. ISBN   978-1-4833-8142-8.
  3. Zwick, William R.; Velicer, Wayne F. (1986). "Comparison of five rules for determining the number of components to retain". Psychological Bulletin. 99 (3): 432–442. doi:10.1037/0033-2909.99.3.432.
  4. Glorfeld, Louis W. (2 July 2016). "An Improvement on Horn's Parallel Analysis Methodology for Selecting the Correct Number of Factors to Retain". Educational and Psychological Measurement. 55 (3): 377–393. doi:10.1177/0013164495055003002. S2CID   123508406.
  5. Crawford, Aaron V.; Green, Samuel B.; Levy, Roy; Lo, Wen-Juo; Scott, Lietta; Svetina, Dubravka; Thompson, Marilyn S. (September 2010). "Evaluation of Parallel Analysis Methods for Determining the Number of Factors". Educational and Psychological Measurement. 70 (6): 885–901. doi:10.1177/0013164410379332. S2CID   63269411.
  6. Velicer, W.F. (1976). "Determining the number of components from the matrix of partial correlations". Psychometrika. 41 (3): 321–327. doi:10.1007/bf02293557. S2CID   122907389.
  7. Tran, U. S.; Formann, A. K. (2009). "Performance of parallel analysis in retrieving unidimensionality in the presence of binary data". Educational and Psychological Measurement. 69: 50–61. doi:10.1177/0013164408318761. S2CID   143051337.
  8. Hayton, James C.; Allen, David G.; Scarpello, Vida (29 June 2016). "Factor Retention Decisions in Exploratory Factor Analysis: a Tutorial on Parallel Analysis". Organizational Research Methods. 7 (2): 191–205. doi:10.1177/1094428104263675. S2CID   61286653.
  9. O'Connor, Brian. "Programs for Number of Components and Factors". people.ok.ubc.ca.
  10. O’connor, Brian P. (September 2000). "SPSS and SAS programs for determining the number of components using parallel analysis and Velicer's MAP test". Behavior Research Methods, Instruments, & Computers. 32 (3): 396–402. doi: 10.3758/BF03200807 . PMID   11029811.
  11. Revelle, William (2007). "Determining the number of factors: the example of the NEO-PI-R" (PDF).{{cite journal}}: Cite journal requires |journal= (help)
  12. Revelle, William (8 January 2020). "psych: Procedures for Psychological, Psychometric, and PersonalityResearch".
  13. Sherman, Ryne A. (2 February 2015). "multicon: Multivariate Constructs".
  14. Huang, Francis (3 March 2015). "hornpa: Horn's (1965) Test to Determine the Number of Components/Factors".
  15. Dinno, Alexis. "Gently Clarifying the Application of Horn's Parallel Analysis to Principal Component Analysis Versus Factor Analysis" (PDF).{{cite journal}}: Cite journal requires |journal= (help)
  16. Dinno, Alexis (14 October 2018). "paran: Horn's Test of Principal Components/Factors".{{cite journal}}: Cite journal requires |journal= (help)