Generalized randomized block design

Last updated

In randomized statistical experiments, generalized randomized block designs (GRBDs) are used to study the interaction between blocks and treatments. For a GRBD, each treatment is replicated at least two times in each block; this replication allows the estimation and testing of an interaction term in the linear model (without making parametric assumptions about a normal distribution for the error). [1]

Contents

Univariate response

GRBDs versus RCBDs: Replication and interaction

Like a randomized complete block design (RCBD), a GRBD is randomized. Within each block, treatments are randomly assigned to experimental units: this randomization is also independent between blocks. In a (classic) RCBD, however, there is no replication of treatments within blocks. [2]

Two-way linear model: Blocks and treatments

The experimental design guides the formulation of an appropriate linear model. Without replication, the (classic) RCBD has a two-way linear-model with treatment- and block-effects but without a block-treatment interaction. Without replicates, this two-way linear-model that may be estimated and tested without making parametric assumptions (by using the randomization distribution, without using a normal distribution for the error). [3] In the RCBD, the block-treatment interaction cannot be estimated using the randomization distribution; a fortiori there exists no "valid" (i.e. randomization-based) test for the block-treatment interaction in the analysis of variance (anova) of the RCBD. [4]

The distinction between RCBDs and GRBDs has been ignored by some authors, and the ignorance regarding the GRCBD has been criticized by statisticians like Oscar Kempthorne and Sidney Addelman. [5] The GRBD has the advantage that replication allows block-treatment interaction to be studied. [6]

GRBDs when block-treatment interaction lacks interest

However, if block-treatment interaction is known to be negligible, then the experimental protocol may specify that the interaction terms be assumed to be zero and that their degrees of freedom be used for the error term. [7] GRBD designs for models without interaction terms offer more degrees of freedom for testing treatment-effects than do RCBs with more blocks: An experimenter wanting to increase power may use a GRBD rather than RCB with additional blocks, when extra blocks-effects would lack genuine interest.

Multivariate analysis

The GRBD has a real-number response. For vector responses, multivariate analysis considers similar two-way models with main effects and with interactions or errors. Without replicates, error terms are confounded with interaction, and only error is estimated. With replicates, interaction can be tested with the multivariate analysis of variance and coefficients in the linear model can be estimated without bias and with minimum variance (by using the least-squares method). [8] [9]

Functional models for block-treatment interactions: Testing known forms of interaction

Non-replicated experiments are used by knowledgeable experimentalists when replications have prohibitive costs. When the block-design lacks replicates, interactions have been modeled. For example, Tukey's F-test for interaction (non-additivity) has been motivated by the multiplicative model of Mandel (1961); this model assumes that all treatment-block interactions are proportion to the product of the mean treatment-effect and the mean block-effect, where the proportionality constant is identical for all treatment-block combinations. Tukey's test is valid when Mandel's multiplicative model holds and when the errors independently follow a normal distribution.

Tukey's F-statistic for testing interaction has a distribution based on the randomized assignment of treatments to experimental units. When Mandel's multiplicative model holds, the F-statistics randomization distribution is closely approximated by the distribution of the F-statistic assuming a normal distribution for the error, according to the 1975 paper of Robinson. [10]

The rejection of multiplicative interaction need not imply the rejection of non-multiplicative interaction, because there are many forms of interaction. [11] [12]

Generalizing earlier models for Tukey's test are the “bundle-of-straight lines” model of Mandel (1959) [13] and the functional model of Milliken and Graybill (1970), which assumes that the interaction is a known function of the block and treatment main-effects. Other methods and heuristics for block-treatment interaction in unreplicated studies are surveyed in the monograph Milliken & Johnson (1989).

See also

Notes

    • Wilk, page 79.
    • Lentner and Biship, page 223.
    • Addelman (1969) page 35.
    • Hinkelmann and Kempthorne, page 314, for example; c.f. page 312.
    • Wilk, page 79.
    • Addelman (1969) page 35.
    • Hinkelmann and Kempthorne, page 314.
    • Lentner and Bishop, page 223.
    • Wilk, page 79.
    • Addelman (1969) page 35.
    • Lentner and Bishop, page 223.
    A more detailed treatment occurs in Chapter 9.7 in Hinkelmann and Kempthorne. (Hinkelmann and Kempthorne do discuss block-treatment interaction for more complicated blocking structures, like crossed-blocking factors in Chapter 9.6, and for forms of "non-additivity" that may be removed by transformations).
  1. Wilk, Addelman, Hinkelmann and Kempthorne.
    • Complaints about the neglect of GRBDs in the literature and ignorance among practitioners are stated by Addelman (1969) page 35.
    • Wilk, page 79.
    • Addelman (1969) page 35.
    • Lentner and Bishop, page 223.
    • Addelman (1970) page 1104.
    If the scientists do not know that the block-treatment interaction is zero, Addelman requires that the generalized randomized block design be used, because otherwise the block-treatment interaction and the error are confounded. In this situation, where scientists are uncertain whether the block-treatment interaction is zero, Hinkelmann and Kempthorne recommend that the generalized randomized block design be used "if at all possible" (page 312).
  2. Johnson & Wichern (2002 , p. 312, “Multivariate two-way fixed-effects model with interaction”, in “6.6 Two-way multivariate analysis of variance”, p. 307–317)
  3. Mardia, Kent & Bibby (1979 , p. 352, “Tests for interactions”, in 12.7 Two-way classification, p. 350-356)
  4. Hinkelmann & Kempthorne (2008 , p. 305)
  5. Milliken & Johnson (1989 , 1.6 Tukey's single degree-of-freedom test for nonadditivity, pp. 7-8)
  6. Lentner & Bishop (1993 , p. 214, in 6.8 Nonadditivity of blocks and treatments, pp. 213–216)
  7. Milliken & Johnson (1989 , 1.8 Mandel's bundle-of-straight lines model, pp. 17-29)

Related Research Articles

Analysis of variance (ANOVA) is a collection of statistical models and their associated estimation procedures used to analyze the differences among means. ANOVA was developed by the statistician Ronald Fisher. ANOVA is based on the law of total variance, where the observed variance in a particular variable is partitioned into components attributable to different sources of variation. In its simplest form, ANOVA provides a statistical test of whether two or more population means are equal, and therefore generalizes the t-test beyond two means. In other words, the ANOVA is used to test the difference between two or more means.

<span class="mw-page-title-main">Design of experiments</span> Design of tasks

The design of experiments, also known as experiment design or experimental design, is the design of any task that aims to describe and explain the variation of information under conditions that are hypothesized to reflect the variation. The term is generally associated with experiments in which the design introduces conditions that directly affect the variation, but may also refer to the design of quasi-experiments, in which natural conditions that influence the variation are selected for observation.

<span class="mw-page-title-main">Statistical inference</span> Process of using data analysis

Statistical inference is the process of using data analysis to infer properties of an underlying distribution of probability. Inferential statistical analysis infers properties of a population, for example by testing hypotheses and deriving estimates. It is assumed that the observed data set is sampled from a larger population.

The following outline is provided as an overview of and topical guide to statistics:

<span class="mw-page-title-main">Experiment</span> Scientific procedure performed to validate a hypothesis

An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs when a particular factor is manipulated. Experiments vary greatly in goal and scale but always rely on repeatable procedure and logical analysis of the results. There also exist natural experimental studies.

<span class="mw-page-title-main">Interaction (statistics)</span> Statistical term

In statistics, an interaction may arise when considering the relationship among three or more variables, and describes a situation in which the effect of one causal variable on an outcome depends on the state of a second causal variable. Although commonly thought of in terms of causal relationships, the concept of an interaction can also describe non-causal associations. Interactions are often considered in the context of regression analyses or factorial experiments.

In the statistical theory of the design of experiments, blocking is the arranging of experimental units that are similar to one another in groups (blocks) based on one or more variables. These variables are chosen carefully to minimize the impact of their variability on the observed outcomes. There are different ways that blocking can be implemented, resulting in different confounding effects. However, the different methods share the same purpose: to control variability introduced by specific factors that could influence the outcome of an experiment. The roots of blocking originated from the statistician, Ronald Fisher, following his development of ANOVA.

Random assignment or random placement is an experimental technique for assigning human participants or animal subjects to different groups in an experiment using randomization, such as by a chance procedure or a random number generator. This ensures that each participant or subject has an equal chance of being placed in any group. Random assignment of participants helps to ensure that any differences between and within the groups are not systematic at the outset of the experiment. Thus, any differences between groups recorded at the end of the experiment can be more confidently attributed to the experimental procedures or treatment.

<span class="mw-page-title-main">Randomized experiment</span> Experiment using randomness in some aspect, usually to aid in removal of bias

In science, randomized experiments are the experiments that allow the greatest reliability and validity of statistical estimates of treatment effects. Randomization-based inference is especially important in experimental design and in survey sampling.

<span class="mw-page-title-main">Pseudoreplication</span>

Pseudoreplication has many definitions. Pseudoreplication was originally defined in 1984 by Stuart H. Hurlbert as the use of inferential statistics to test for treatment effects with data from experiments where either treatments are not replicated or replicates are not statistically independent. Subsequently, Millar and Anderson identified it as a special case of inadequate specification of random factors where both random and fixed factors are present. It is sometimes narrowly interpreted as an inflation of the number of samples or replicates which are not statistically independent. This definition omits the confounding of unit and treatment effects in a misspecified F-ratio. In practice, incorrect F-ratios for statistical tests of fixed effects often arise from a default F-ratio that is formed over the error rather the mixed term.

Repeated measures design is a research design that involves multiple measures of the same variable taken on the same or matched subjects either under different conditions or over two or more time periods. For instance, repeated measurements are collected in a longitudinal study in which change over time is assessed.

Multivariate analysis of covariance (MANCOVA) is an extension of analysis of covariance (ANCOVA) methods to cover cases where there is more than one dependent variable and where the control of concomitant continuous independent variables – covariates – is required. The most prominent benefit of the MANCOVA design over the simple MANOVA is the 'factoring out' of noise or error that has been introduced by the covariant. A commonly used multivariate version of the ANOVA F-statistic is Wilks' Lambda (Λ), which represents the ratio between the error variance and the effect variance.

In statistics, restricted randomization occurs in the design of experiments and in particular in the context of randomized experiments and randomized controlled trials. Restricted randomization allows intuitively poor allocations of treatments to experimental units to be avoided, while retaining the theoretical benefits of randomization. For example, in a clinical trial of a new proposed treatment of obesity compared to a control, an experimenter would want to avoid outcomes of the randomization in which the new treatment was allocated only to the heaviest patients.

In the design of experiments, completely randomized designs are for studying the effects of one primary factor without the need to take other nuisance variables into account. This article describes completely randomized designs that have one primary factor. The experiment compares the values of a response variable based on the different levels of that primary factor. For completely randomized designs, the levels of the primary factor are randomly assigned to the experimental units.

A glossary of terms used in experimental research.

In statistics, a mixed-design analysis of variance model, also known as a split-plot ANOVA, is used to test for differences between two or more independent groups whilst subjecting participants to repeated measures. Thus, in a mixed-design ANOVA model, one factor is a between-subjects variable and the other is a within-subjects variable. Thus, overall, the model is a type of mixed-effects model.

Oscar Kempthorne was a British statistician and geneticist known for his research on randomization-analysis and the design of experiments, which had wide influence on research in agriculture, genetics, and other areas of science.

In statistics, the two-way analysis of variance (ANOVA) is an extension of the one-way ANOVA that examines the influence of two different categorical independent variables on one continuous dependent variable. The two-way ANOVA not only aims at assessing the main effect of each independent variable but also if there is any interaction between them.

References