Qualitative comparative analysis

Last updated

In statistics, qualitative comparative analysis (QCA) is a data analysis based on set theory to examine the relationship of conditions to outcome. QCA describes the relationship in terms of necessary conditions and sufficient conditions. [1] The technique was originally developed by Charles Ragin in 1987 [2] to study data sets that are too small for linear regression analysis but large for cross-case analysis. [3]

Contents

Summary of technique

In the case of categorical variables, QCA begins by listing and counting all types of cases which occur, where each type of case is defined by its unique combination of values of its independent and dependent variables. For instance, if there were four categorical variables of interest, {A,B,C,D}, and A and B were dichotomous (could take on two values), C could take on five values, and D could take on three, then there would be 60 possible types of observations determined by the possible combinations of variables, not all of which would necessarily occur in real life. By counting the number of observations that exist for each of the 60 unique combination of variables, QCA can determine which descriptive inferences or implications are empirically supported by a data set. Thus, the input to QCA is a data set of any size, from small-N to large-N, and the output of QCA is a set of descriptive inferences or implications the data supports.

In QCA's next step, inferential logic or Boolean algebra is used to simplify or reduce the number of inferences to the minimum set of inferences supported by the data. This reduced set of inferences is termed the "prime implicates" by QCA adherents. For instance, if the presence of conditions A and B is always associated with the presence of a particular value of D, regardless of the observed value of C, then the value that C takes is irrelevant. Thus, all five inferences involving A and B and any of the five values of C may be replaced by the single descriptive inference "(A and B) implies the particular value of D".

To establish that the prime implicants or descriptive inferences derived from the data by the QCA method are causal requires establishing the existence of causal mechanism using another method such as process tracing, formal logic, intervening variables, or established multidisciplinary knowledge. [4] The method is used in social science and is based on the binary logic of Boolean algebra, and attempts to ensure that all possible combinations of variables that can be made across the cases under investigation are considered.

Motivation

The technique of listing case types by potential variable combinations assists with case selection by making investigators aware of all possible case types that would need to be investigated, at a minimum, if they exist, in order to test a certain hypothesis or to derive new inferences from an existing data set. In situations where the available observations constitute the entire population of cases, this method alleviates the small N problem by allowing inferences to be drawn by evaluating and comparing the number of cases exhibiting each combination of variables. The small N problem arises when the number of units of analysis (e.g. countries) available is inherently limited. For example: a study where countries are the unit of analysis is limited in that are only a limited number of countries in the world (less than 200), less than necessary for some (probabilistic) statistical techniques. By maximizing the number of comparisons that can be made across the cases under investigation, causal inferences are according to Ragin possible. [5] This technique allows the identification of multiple causal pathways and interaction effects that may not be detectable via statistical analysis that typically requires its data set to conform to one model. Thus, it is the first step to identifying subsets of a data set conforming to particular causal pathway based on the combinations of covariates prior to quantitative statistical analyses testing conformance to a model; and helps qualitative researchers to correctly limit the scope of claimed findings to the type of observations they analyze.

Criticism

As this is a logical (deterministic) and not a statistical (probabilistic) technique, with "crisp-set" QCA (csQCA), the original application of QCA, variables can only have two values, which is problematic as the researcher has to determine the values of each variable. For example: GDP per capita has to be divided by the researcher in two categories (e.g. low = 0 and high = 1). But as this variable is essentially a continuous variable, the division will always be arbitrary. A second, related problem is that the technique does not allow an assessment of the effect of the relative strengths of the independent variables (as they can only have two values). [5] Ragin, and other scholars such as Lasse Cronqvist, have tried to deal with these issues by developing new tools that extend QCA, such as multi-value QCA (mvQCA) and fuzzy set QCA (fsQCA). Note: Multi-value QCA is simply QCA applied to observations having categorical variables with more than two values. Crisp-Set QCA can be considered a special case of Multi-value QCA. [6]

Statistical methodologists have argued that QCA's strong assumptions render its findings both fragile and prone to type I error. Simon Hug argues that deterministic hypotheses and error-free measures are exceedingly rare in social science and uses Monte Carlo simulations to demonstrate the fragility of QCA results if either assumption is violated. [7] Chris Krogslund, Donghyun Danny Choi, and Mathias Poertner further demonstrate that QCA results are highly sensitive to minor parametric and model-susceptibility changes and are vulnerable to type I error. [8] Bear F. Braumoeller further explores the vulnerability of the QCA family of techniques to both type I error and multiple inference. [9] Braumoeller also offers a formal test of the null hypothesis and demonstrates that even very convincing QCA findings may be the result of chance. [10]

Response to criticisms

QCA can be performed probabilistically or deterministically with observations of categorical variables. For instance, the existence of a descriptive inference or implication is supported deterministically by the absence of any counter-example cases to the inference; i.e. if a researcher claims condition X implies condition Y, then, deterministically, there must not exist any counterexample cases having condition X, but not condition Y. However, if the researcher wants to claim that condition X is a probabilistic 'predictor' of condition Y, in another similar set of cases, then the proportion of counterexample cases to an inference to the proportion of cases having that same combination of conditions can be set at a threshold value of for example 80% or higher. For each prime implicant that QCA outputs via its logical inference reduction process, the "coverage" — percentage out of all observations that exhibit that implication or inference — and the "consistency" — the percentage of observations conforming to that combination of variables having that particular value of the dependent variable or outcome — are calculated and reported, and can be used as indicators of the strength of such an explorative probabilistic inference. In real-life complex societal processes, QCA enables the identification of multiple sets of conditions that are consistently associated with a particular output value in order to explore for causal predictors.

Fuzzy set QCA aims to handle variables, such as GDP per capita, where the number of categories, decimal values of monetary units, becomes too large to use mvQCA, or in cases were uncertainty or ambiguity or measurement error in the classification of a case needs to be acknowledged. [11]

Fields of use

QCA has now become used in many more fields than political science which Ragin first developed the method for. [12] Today the method has been used in:

See also

Related Research Articles

A Bayesian network is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). While it is one of several forms of causal notation, causal networks are special cases of Bayesian networks. Bayesian networks are ideal for taking an event that occurred and predicting the likelihood that any one of several possible known causes was the contributing factor. For example, a Bayesian network could represent the probabilistic relationships between diseases and symptoms. Given symptoms, the network can be used to compute the probabilities of the presence of various diseases.

Forecasting is the process of making predictions based on past and present data. Later these can be compared (resolved) against what happens. For example, a company might estimate their revenue in the next year, then compare it against the actual results creating a variance actual analysis. Prediction is a similar but more general term. Forecasting might refer to specific formal statistical methods employing time series, cross-sectional or longitudinal data, or alternatively to less formal judgmental methods or the process of prediction and resolution itself. Usage can vary between areas of application: for example, in hydrology the terms "forecast" and "forecasting" are sometimes reserved for estimates of values at certain specific future times, while the term "prediction" is used for more general estimates, such as the number of times floods will occur over a long period.

A case study is an in-depth, detailed examination of a particular case within a real-world context. For example, case studies in medicine may focus on an individual patient or ailment; case studies in business might cover a particular firm's strategy or a broader market; similarly, case studies in politics can range from a narrow happening over time like the operations of a specific political campaign, to an enormous undertaking like world war, or more often the policy analysis of real-world problems affecting multiple stakeholders.

<span class="mw-page-title-main">Qualitative research</span> Form of research

Qualitative research is a type of research that aims to gather and analyse non-numerical (descriptive) data in order to gain an understanding of individuals' social reality, including understanding their attitudes, beliefs, and motivation. This type of research typically involves in-depth interviews, focus groups, or observations in order to collect data that is rich in detail and context. Qualitative research is often used to explore complex phenomena or to gain insight into people's experiences and perspectives on a particular topic. It is particularly useful when researchers want to understand the meaning that people attach to their experiences or when they want to uncover the underlying reasons for people's behavior. Qualitative methods include ethnography, grounded theory, discourse analysis, and interpretative phenomenological analysis. Qualitative research methods have been used in sociology, anthropology, political science, psychology, communication studies, social work, folklore, educational research, information science and software engineering research.

<span class="mw-page-title-main">Quantitative research</span> All procedures for the numerical representation of empirical facts

Quantitative research is a research strategy that focuses on quantifying the collection and analysis of data. It is formed from a deductive approach where emphasis is placed on the testing of theory, shaped by empiricist and positivist philosophies.

<span class="mw-page-title-main">Content analysis</span> Research method for studying documents and communication artifacts

Content analysis is the study of documents and communication artifacts, which might be texts of various formats, pictures, audio or video. Social scientists use content analysis to examine patterns in communication in a replicable and systematic manner. One of the key advantages of using content analysis to analyse social phenomena is their non-invasive nature, in contrast to simulating social experiences or collecting survey answers.

<span class="mw-page-title-main">Mathematical statistics</span> Branch of statistics

Mathematical statistics is the application of probability theory, a branch of mathematics, to statistics, as opposed to techniques for collecting statistical data. Specific mathematical techniques which are used for this include mathematical analysis, linear algebra, stochastic analysis, differential equations, and measure theory.

In statistics, classification is the problem of identifying which of a set of categories (sub-populations) an observation belongs to. Examples are assigning a given email to the "spam" or "non-spam" class, and assigning a diagnosis to a given patient based on observed characteristics of the patient.

<span class="mw-page-title-main">Granger causality</span> Statistical hypothesis test for forecasting

The Granger causality test is a statistical hypothesis test for determining whether one time series is useful in forecasting another, first proposed in 1969. Ordinarily, regressions reflect "mere" correlations, but Clive Granger argued that causality in economics could be tested for by measuring the ability to predict the future values of a time series using prior values of another time series. Since the question of "true causality" is deeply philosophical, and because of the post hoc ergo propter hoc fallacy of assuming that one thing preceding another can be used as a proof of causation, econometricians assert that the Granger test finds only "predictive causality". Using the term "causality" alone is a misnomer, as Granger-causality is better described as "precedence", or, as Granger himself later claimed in 1977, "temporally related". Rather than testing whether Xcauses Y, the Granger causality tests whether X forecastsY.

Designing Social Inquiry: Scientific Inference in Qualitative Research is an influential 1994 book written by Gary King, Robert Keohane, and Sidney Verba that lays out guidelines for conducting qualitative research. The central thesis of the book is that qualitative and quantitative research share the same "logic of inference." The book primarily applies lessons from regression-oriented analysis to qualitative research, arguing that the same logics of causal inference can be used in both types of research.

<span class="mw-page-title-main">Comparative historical research</span> Method in the social sciences

Comparative historical research is a method of social science that examines historical events in order to create explanations that are valid beyond a particular time and place, either by direct comparison to other historical events, theory building, or reference to the present day. Generally, it involves comparisons of social processes across times and places. It overlaps with historical sociology. While the disciplines of history and sociology have always been connected, they have connected in different ways at different times. This form of research may use any of several theoretical orientations. It is distinguished by the types of questions it asks, not the theoretical framework it employs.

<span class="mw-page-title-main">Charles C. Ragin</span> American sociologist

Charles C. Ragin is Chancellor's Professor of Sociology at the University of California, Irvine.

Probabilistic causation is a concept in a group of philosophical theories that aim to characterize the relationship between cause and effect using the tools of probability theory. The central idea behind these theories is that causes raise the probabilities of their effects, all else being equal.

In the statistical analysis of observational data, propensity score matching (PSM) is a statistical matching technique that attempts to estimate the effect of a treatment, policy, or other intervention by accounting for the covariates that predict receiving the treatment. PSM attempts to reduce the bias due to confounding variables that could be found in an estimate of the treatment effect obtained from simply comparing outcomes among units that received the treatment versus those that did not. Paul R. Rosenbaum and Donald Rubin introduced the technique in 1983.

Causal reasoning is the process of identifying causality: the relationship between a cause and its effect. The study of causality extends from ancient philosophy to contemporary neuropsychology; assumptions about the nature of causality may be shown to be functions of a previous event preceding a later one. The first known protoscientific study of cause and effect occurred in Aristotle's Physics. Causal inference is an example of causal reasoning.

David Collier is an American political scientist specializing in comparative politics. He is Chancellor's Professor Emeritus at the University of California, Berkeley. He works in the fields of comparative politics, Latin American politics, and methodology. His father was the anthropologist Donald Collier.

Process tracing is a research method used to develop and test theories. It is generally understood as a "within-case" method to draw inferences on the basis of causal mechanisms. It has been used in social sciences, as well as in natural sciences.

Causal inference is the process of determining the independent, actual effect of a particular phenomenon that is a component of a larger system. The main difference between causal inference and inference of association is that causal inference analyzes the response of an effect variable when a cause of the effect variable is changed. The study of why things occur is called etiology, and can be described using the language of scientific causal notation. Causal inference is said to provide the evidence of causality theorized by causal reasoning.

Causal analysis is the field of experimental design and statistical analysis pertaining to establishing cause and effect. Exploratory causal analysis (ECA), also known as data causality or causal discovery is the use of statistical algorithms to infer associations in observed data sets that are potentially causal under strict assumptions. ECA is a type of causal inference distinct from causal modeling and treatment effects in randomized controlled trials. It is exploratory research usually preceding more formal causal research in the same way exploratory data analysis often precedes statistical hypothesis testing in data analysis

Necessary Condition Analysis (NCA) is a research approach and tool employed to discern "necessary conditions" within datasets. These indispensable conditions stand as pivotal determinants of particular outcomes, wherein the absence of such conditions ensures the absence of the intended result. Illustratively, the admission of a student into a Ph.D. program necessitates an adequate GMAT score; the progression of AIDS mandates the presence of HIV; and the realization of organizational change will not occur without the commitment of management. Singular in nature, these conditions possess the potential to function as bottlenecks for the desired outcome. Their absence unequivocally guarantees the failure of the intended objective, a deficiency that cannot be offset by the influence of other contributing factors. It is noteworthy, however, that the mere presence of the necessary condition does not ensure the assured attainment of success. In such instances, the condition demonstrates its necessity but lacks sufficiency. To obviate the risk of failure, the simultaneous satisfaction of each distinct necessary condition is imperative. NCA serves as a systematic mechanism, furnishing the rationale and methodological apparatus requisite for the identification and assessment of necessary conditions within extant or novel datasets. It is a powerful method for investigating causal relationships and determining the minimum requirements that must be present for an outcome to be achieved.

References

  1. Ragin, Charles C. (2014). The comparative method : moving beyond qualitative and quantitative strategies : with a new introduction. Oakland, California. ISBN   978-0-520-95735-0. OCLC   881322765.{{cite book}}: CS1 maint: location missing publisher (link)
  2. Ragin, Charles C. (1987). The Comparative Method: Moving Beyond Qualitative and Quantitative Strategies . University of California Press. ISBN   9780520058347.
  3. 1 2 Mattke, Jens; Maier, Christian; Weitzel, Tim; Thatcher, Jason Bennett (2021-01-01). "Qualitative comparative analysis in the information systems discipline: a literature review and methodological recommendations". Internet Research. 31 (5): 1493–1517. doi:10.1108/INTR-09-2020-0529. ISSN   1066-2243. S2CID   235510041.
  4. qualitative comparative analysis - History Of qualitative comparative analysis | Encyclopedia.com: Dictionary Of Sociology
  5. 1 2 J. Goldthorpe, "Current issues in comparative macrosociology" in Comparative social research, 16, 1997, pp. 1–26.
  6. Rihoux, Benoît (2006), "Qualitative Comparative Analysis (QCA) and Related Systematic Comparative Methods: Recent Advances and Remaining Challenges for Social Science Research", International Sociology, 21 (5): 679, doi:10.1177/0268580906067836, S2CID   14390758
  7. Hug, Simon (2013-04-01). "Qualitative Comparative Analysis: How Inductive Use and Measurement Error Lead to Problematic Inference". Political Analysis. 21 (2): 252–265. doi: 10.1093/pan/mps061 . ISSN   1047-1987.
  8. Krogslund, Chris; Choi, Donghyun Danny; Poertner, Mathias (2015-01-01). "Fuzzy Sets on Shaky Ground: Parameter Sensitivity and Confirmation Bias in fsQCA". Political Analysis. 23 (1): 21–41. doi:10.1093/pan/mpu016. ISSN   1047-1987.
  9. Braumoeller, Bear F. (2015-07-25). "Guarding Against False Positives in Qualitative Comparative Analysis". Political Analysis. 23 (4): 471–487. doi:10.1093/pan/mpv017. ISSN   1047-1987.
  10. Braumoeller, Bear (2015-05-19). "QCAfalsePositive: Tests for Type I Error in Qualitative Comparative Analysis (QCA)". Comprehensive R Archive Network. R Project. Retrieved 2015-08-26.
  11. Rihoux, Benoît (2013), "QCA, 25 Years after"The Comparative Method": Mapping, Challenges, and Innovations--Mini-Symposium", Political Research Quarterly, 66: 167–235, doi:10.1177/1065912912468269
  12. Roig-Tierno, Norat; Gonzalez-Cruz, Tomas F.; Llopis-Martinez, Jordi (2017-01-01). "An overview of qualitative comparative analysis: A bibliometric analysis". Journal of Innovation & Knowledge. 2 (1): 15–23. doi: 10.1016/j.jik.2016.12.002 . hdl: 10419/190708 . ISSN   2444-569X.
  13. Misangyi, Vilmos F.; Greckhamer, Thomas; Furnari, Santi; Fiss, Peer C.; Crilly, Donal; Aguilera, Ruth (2017-01-01). "Embracing Causal Complexity: The Emergence of a Neo-Configurational Perspective". Journal of Management. 43 (1): 255–282. doi:10.1177/0149206316679252. ISSN   0149-2063. S2CID   64878752.
  14. Romme, A.G.L. (1995), Self-organizing Processes in Top Management Teams: A Boolean Comparative Approach. Journal of Business Research 34 (1): 11-34.
  15. Kask and Linton (2013) Business mating: when startups get it right http://www.tandfonline.com/doi/abs/10.1080/08276331.2013.876765#.U0UIwvl_t8E
  16. Lee, Jae-Nam; Park, YoungKi; Straub, Detmar; Koo, Yunmo (2019-12-01). "Holistic Archetypes of IT Outsourcing Strategy: A Contingency Fit and Configurational Approach". Management Information Systems Quarterly. 43 (4): 1201–1225. ISSN   0276-7783.
  17. Invernizzi, Diletta Colette; Locatelli, Giorgio; Brookes, Naomi; Davis, Allison (2020-11-01). "Qualitative comparative analysis as a method for project studies: The case of energy infrastructure". Renewable and Sustainable Energy Reviews. 133: 110314. doi:10.1016/j.rser.2020.110314. hdl: 11311/1204820 . ISSN   1364-0321. S2CID   224852514. Archived from the original on 14 September 2020.
  18. Olya, Hossein G. T.; Akhshik, Arash (23 January 2018). "Tackling the Complexity of the Pro-environmental Behavior Intentions of Visitors to Turtle Sites". Journal of Travel Research. 58 (2): 313–332. doi:10.1177/0047287517751676. S2CID   158334028.
  19. SUKHOV, ALEXANDRE; SIHVONEN, ANTTI; OLSSON, LARS E.; MAGNUSSON, PETER R. (2018-11-28). "That Makes Sense to Me: Openness to Change and Sensemaking in Idea Screening". International Journal of Innovation Management. 22 (8): 1840009. doi: 10.1142/s1363919618400091 . ISSN   1363-9196.
  20. Aşkun, Volkan; Çi̇zel, Rabia; Çi̇zel, Beykan (2021-08-01). "Ülkelerin İnovasyon Düzeyinin Sosyal Sermaye, Ekonomik Değer Algısı ve Politik Kültürle Karmaşık İlişkisi: Bulanık Küme Nitel Karşılaştırmalı Analiz". Eskişehir Osmangazi Üniversitesi İktisadi ve İdari Bilimler Dergisi (in Turkish). 16 (2): 317–340. doi: 10.17153/oguiibf.895910 . ISSN   1306-6730. S2CID   238790554.
  21. Linton, Gabriel; Kask, Johan (2017). "Configurations of entrepreneurial orientation and competitive strategy for high performance". Journal of Business Research. 70: 168–176. doi:10.1016/j.jbusres.2016.08.022. ISSN   0148-2963.
  22. Stevenson 2013. "Does Technology have an Impact on Learning? A Fuzzy Set Analysis of Historical Data on the Role of Digital Repertoires in Shaping the Outcomes of Classroom Pedagogy." Computers & Education 69 (0): 148-58.
  23. Basurto, X. (2013), "Linking Multi-Level Governance to Local Common-Pool Resource Theory using Fuzzy-Set Qualitative Comparative Analysis: Insights from Twenty Years of Biodiversity Conservation in Costa Rica." Global Environmental Change 23 (3): 573-87.
  24. Blackman, T. (2013), "Exploring Explanations for Local Reductions in Teenage Pregnancy Rates in England: An Approach Using Qualitative Comparative Analysis." Social Policy and Society 12 (1):61-72.
  25. Johansson, Tobias; Kask, Johan (2017). "Configurations of business strategy and marketing channels for e-commerce and traditional retail formats: A Qualitative Comparison Analysis (QCA) in sporting goods retailing". Journal of Retailing and Consumer Services. 34: 326–333. doi:10.1016/j.jretconser.2016.07.009. ISSN   0969-6989.
  26. Olya, Hossein GT; Mehran, Javaneh (2017-06-01). "Modelling tourism expenditure using complexity theory". Journal of Business Research. 75: 147–158. doi:10.1016/j.jbusres.2017.02.015.
  27. Olya, Hossein G. T.; Altinay, Levent (2016). "Asymmetric modeling of intention to purchase tourism weather insurance and loyalty". Journal of Business Research. 69 (8): 2791–2800. doi:10.1016/j.jbusres.2015.11.015.
  28. Olya, Hossein; Gavilyan, Yaqub (2017). "Configurational Models to Predict Residents' Support for Tourism Development". Journal of Travel Research. 56 (7): 893–912. doi:10.1177/0047287516667850. S2CID   157763812.
  29. Çi̇zel, Beykan; Kirtil, İsmail Gökay; Çi̇zel, Rabia; Aşkun, Volkan (2021-12-01). "Covid-19 Pandemisi Sürecinde Seyahatten Kaçınma Niyetinin Karmaşıklık Bakış Açısı ile Analizi". Anatolia: Turizm Araştırmaları Dergisi (in Turkish). 32 (2): 212–223. doi: 10.17123/atad.896981 . ISSN   1300-4220. S2CID   239749559.
  30. Bara C. Incentives and opportunities: A complexity-oriented explanation of violent ethnic conflict. Journal of Peace Research. 2014;51(6):696-710. doi:10.1177/0022343314534458
  31. Binder M. Paths to intervention: What explains the UN’s selective response to humanitarian crises? Journal of Peace Research. 2015;52(6):712-726. doi:10.1177/0022343315585847
  32. Schneider, C.Q., Maerz, S.F. Legitimation, cooptation, and repression and the survival of electoral autocracies. Z Vgl Polit Wiss 11, 213–235 (2017). https://doi.org/10.1007/s12286-017-0332-2

Further reading