Cross-lagged panel model

Last updated

The cross-lagged panel model is a type of discrete time structural equation model used to analyze panel data in which two or more variables are repeatedly measured at two or more different time points. This model aims to estimate the directional effects that one variable has on another at different points in time. [1] [2] This model was first introduced in 1963 by Donald T. Campbell and refined during the 1970s by David A. Kenny. [3] Kenny has described it as follows: "Two variables, X and Y, are measured at two times, 1 and 2, resulting in four measures, X1, Y1, X2, and Y2. With these four measures, there are six possible relations among them – two synchronous or cross‐sectional relations (see cross‐sectional design) (between X1 and Y1 and between X2 and Y2), two stability relations (between X1 and X2 and between Y1 and Y2), and two cross‐lagged relations (between X1 and Y2 and between Y1 and X2)." [4] Though this approach is commonly believed to be a valid technique to identify causal relationships from panel data, its use for this purpose has been criticized, as it depends on certain assumptions, such as synchronicity and stationarity, that may not be valid. [5] [6] [7]

Related Research Articles

Simultaneous equations models are a type of statistical model in which the dependent variables are functions of other dependent variables, rather than just independent variables. This means some of the explanatory variables are jointly determined with the dependent variable, which in economics usually is the consequence of some underlying equilibrium mechanism. Take the typical supply and demand model: whilst typically one would determine the quantity supplied and demanded to be a function of the price set by the market, it is also possible for the reverse to be true, where producers observe the quantity that consumers demand and then set the price.

<span class="mw-page-title-main">Canonical correlation</span> Way of inferring information from cross-covariance matrices

In statistics, canonical-correlation analysis (CCA), also called canonical variates analysis, is a way of inferring information from cross-covariance matrices. If we have two vectors X = (X1, ..., Xn) and Y = (Y1, ..., Ym) of random variables, and there are correlations among the variables, then canonical-correlation analysis will find linear combinations of X and Y which have maximum correlation with each other. T. R. Knapp notes that "virtually all of the commonly encountered parametric tests of significance can be treated as special cases of canonical-correlation analysis, which is the general procedure for investigating the relationships between two sets of variables." The method was first introduced by Harold Hotelling in 1936, although in the context of angles between flats the mathematical concept was published by Jordan in 1875.

The general linear model or general multivariate regression model is a compact way of simultaneously writing several multiple linear regression models. In that sense it is not a separate statistical linear model. The various multiple linear regression models may be compactly written as

<span class="mw-page-title-main">David C. Geary</span> American evolutionary psychologist

David Cyril Geary is a United States cognitive developmental and evolutionary psychologist with interests in mathematical learning and sex differences. He is currently a Curators’ Professor and Thomas Jefferson Fellow in the Department of Psychological Sciences and Interdisciplinary Neuroscience Program at the University of Missouri in Columbia, Missouri.

<span class="mw-page-title-main">Structural equation modeling</span> Form of causal modeling that fit networks of constructs to data

Structural equation modeling (SEM) is a label for a diverse set of methods used by scientists in both experimental and observational research across the sciences, business, and other fields. It is used most in the social and behavioral sciences. A definition of SEM is difficult without reference to highly technical language, but a good starting place is the name itself.

Panel (data) analysis is a statistical method, widely used in social science, epidemiology, and econometrics to analyze two-dimensional panel data. The data are usually collected over time and over the same individuals and then a regression is run over these two dimensions. Multidimensional analysis is an econometric method in which data are collected over more than two dimensions.

Vector autoregression (VAR) is a statistical model used to capture the relationship between multiple quantities as they change over time. VAR is a type of stochastic process model. VAR models generalize the single-variable (univariate) autoregressive model by allowing for multivariate time series. VAR models are often used in economics and the natural sciences.

In statistics and econometrics, panel data and longitudinal data are both multi-dimensional data involving measurements over time. Panel data is a subset of longitudinal data where observations are for the same subjects each time.

In statistics, unit-weighted regression is a simplified and robust version of multiple regression analysis where only the intercept term is estimated. That is, it fits a model

Multilevel models are statistical models of parameters that vary at more than one level. An example could be a model of student performance that contains measures for individual students as well as measures for classrooms within which the students are grouped. These models can be seen as generalizations of linear models, although they can also extend to non-linear models. These models became much more popular after sufficient computing power and software became available.

<span class="mw-page-title-main">Mediation (statistics)</span> Statistical model

In statistics, a mediation model seeks to identify and explain the mechanism or process that underlies an observed relationship between an independent variable and a dependent variable via the inclusion of a third hypothetical variable, known as a mediator variable. Rather than a direct causal relationship between the independent variable and the dependent variable, a mediation model proposes that the independent variable influences the mediator variable, which in turn influences the dependent variable. Thus, the mediator variable serves to clarify the nature of the relationship between the independent and dependent variables.

<span class="mw-page-title-main">Diana Baumrind</span> Clinical and developmental psychologist

Diana Blumberg Baumrind was a clinical and developmental psychologist known for her research on parenting styles and for her critique of the use of deception in psychological research.

In statistics, confirmatory factor analysis (CFA) is a special form of factor analysis, most commonly used in social science research. It is used to test whether measures of a construct are consistent with a researcher's understanding of the nature of that construct. As such, the objective of confirmatory factor analysis is to test whether the data fit a hypothesized measurement model. This hypothesized model is based on theory and/or previous analytic research. CFA was first developed by Jöreskog (1969) and has built upon and replaced older methods of analyzing construct validity such as the MTMM Matrix as described in Campbell & Fiske (1959).

In statistics and econometrics, a distributed lag model is a model for time series data in which a regression equation is used to predict current values of a dependent variable based on both the current values of an explanatory variable and the lagged values of this explanatory variable.

Positive affectivity (PA) is a human characteristic that describes how much people experience positive affects ; and as a consequence how they interact with others and with their surroundings.

In statistics and regression analysis, moderation occurs when the relationship between two variables depends on a third variable. The third variable is referred to as the moderator variable or simply the moderator. The effect of a moderating variable is characterized statistically as an interaction; that is, a categorical or continuous variable that is associated with the direction and/or magnitude of the relation between dependent and independent variables. Specifically within a correlational analysis framework, a moderator is a third variable that affects the zero-order correlation between two other variables, or the value of the slope of the dependent variable on the independent variable. In analysis of variance (ANOVA) terms, a basic moderator effect can be represented as an interaction between a focal independent variable and a factor that specifies the appropriate conditions for its operation.

Impression formation in social psychology refers to the processes by which different pieces of knowledge about another are combined into a global or summary impression. Social psychologist Solomon Asch is credited with the seminal research on impression formation and conducted research on how individuals integrate information about personality traits. Two major theories have been proposed to explain how this process of integration takes place. The Gestalt approach views the formation of a general impression as the sum of several interrelated impressions. As an individual seeks to form a coherent and meaningful impression of another individual, previous impressions significantly influence the interpretation of subsequent information. In contrast to the Gestalt approach, the cognitive algebra approach asserts that individuals' experiences are combined with previous evaluations to form a constantly changing impression of a person. A related area to impression formation is the study of person perception, making dispositional attributions, and then adjusting those inferences based on the information available.

In econometrics, the Arellano–Bond estimator is a generalized method of moments estimator used to estimate dynamic models of panel data. It was proposed in 1991 by Manuel Arellano and Stephen Bond, based on the earlier work by Alok Bhargava and John Denis Sargan in 1983, for addressing certain endogeneity problems. The GMM-SYS estimator is a system that contains both the levels and the first difference equations. It provides an alternative to the standard first difference GMM estimator.

<span class="mw-page-title-main">Ellen Hamaker</span> Dutch-American psychologist, and statistician

Ellen Louise "E.L." Hamaker is a Dutch-American psychologist, and statistician. Since 2018 she has been a full professor at Utrecht University, holding the chair Longitudinal Data Analysis at the Department of Methodology and Statistics. Her work focuses on the development of statistical models for the analysis of intensive longitudinal data in psychology, mainly within the frameworks of structural equation modeling and time series analysis.

Relationship science is an interdisciplinary field dedicated to the scientific study of interpersonal relationship processes. Due to its interdisciplinary nature, relationship science is made-up of researchers of various professional backgrounds within psychology and outside of psychology, but most researchers who identify with the field are psychologists by training. Additionally, the field's emphasis has historically been close and intimate relationships, which includes predominantly dating & married couples, parent-child relationships, and friendships & social networks, but some also study less salient social relationships such as colleagues and acquaintances.

References

  1. Kuiper, Rebecca M.; Ryan, Oisín (2018-09-03). "Drawing Conclusions from Cross-Lagged Relationships: Re-Considering the Role of the Time-Interval". Structural Equation Modeling . 25 (5): 809–823. doi: 10.1080/10705511.2018.1431046 . ISSN   1070-5511.
  2. "Cross-Lagged Panel Analysis". The SAGE Encyclopedia of Communication Research Methods. 2455 Teller Road, Thousand Oaks, California, 91320: SAGE Publications, Inc. 2017. doi:10.4135/9781483381411.n117. ISBN   978-1-4833-8143-5.{{cite encyclopedia}}: CS1 maint: location (link)
  3. Berry, Daniel; Willoughby, Michael T. (July 2017). "On the Practical Interpretability of Cross-Lagged Panel Models: Rethinking a Developmental Workhorse". Child Development . 88 (4): 1186–1206. doi:10.1111/cdev.12660. PMID   27878996.
  4. Kenny, David A. (2014-09-29). "Cross-Lagged Panel Design". Wiley StatsRef: Statistics Reference Online. Chichester, UK: John Wiley & Sons, Ltd. pp. stat06464. doi:10.1002/9781118445112.stat06464. ISBN   978-1-118-44511-2.
  5. Ellen, Hamaker; Rebecca, Kuiper; Raoul, Grasman (March 2015). "A Critique of the Cross-Lagged Panel Model" (PDF). Psychological Methods . 20 (1): 102–116. doi:10.1037/a0038889. PMID   25822208.
  6. Mund, Marcus; Nestler, Steffen (September 2019). "Beyond the Cross-Lagged Panel Model: Next-generation statistical tools for analyzing interdependencies across the life course". Advances in Life Course Research . 41: 100249. doi:10.1016/j.alcr.2018.10.002. S2CID   150324087.
  7. Kenny, David A. (1975). "Cross-lagged panel correlation: A test for spuriousness". Psychological Bulletin . 82 (6): 887–903. doi:10.1037/0033-2909.82.6.887. ISSN   0033-2909.