CUSUM

CUSUM chart
CUSUM chart
Originally proposed by	E. S. Page
Process observations
Rational subgroup size	n = 1
Measurement type	Cumulative sum of a quality characteristic
Quality characteristic type	Variables data
Underlying distribution	Normal distribution
Performance
Size of shift to detect	≤ 1.5σ
Process variation chart
	Not applicable
Process mean chart
Center line	The target value, T, of the quality characteristic
Upper control limit
Lower control limit
Plotted statistic

Last updated December 09, 2024

In statistical quality control, the CUSUM (or cumulative sum control chart) is a sequential analysis technique developed by E. S. Page of the University of Cambridge. It is typically used for monitoring change detection.^[1] CUSUM was announced in Biometrika, in 1954, a few years after the publication of Wald's sequential probability ratio test (SPRT).^[2]

E. S. Page referred to a "quality number" $\theta$ , by which he meant a parameter of the probability distribution; for example, the mean. He devised CUSUM as a method to determine changes in it, and proposed a criterion for deciding when to take corrective action. When the CUSUM method is applied to changes in mean, it can be used for step detection of a time series.

A few years later, George Alfred Barnard developed a visualization method, the V-mask chart, to detect both increases and decreases in $\theta$ .^[3]

Method

As its name implies, CUSUM involves the calculation of a cumulative sum (which is what makes it "sequential"). Samples from a process $x_{n}$ are assigned weights $\omega _{n}$ , and summed as follows:

S_{0}=0

S_{n+1}=\max(0,S_{n}+x_{n+1}-\omega _{n})

When the value of S exceeds a certain threshold value, a change in value has been found. The above formula only detects changes in the positive direction. When negative changes need to be found as well, the min operation should be used instead of the max operation, and this time a change has been found when the value of S is below the (negative) value of the threshold value.

Page did not explicitly say that $\omega$ represents the likelihood function, but this is common usage.

This differs from SPRT by always using zero function as the lower "holding barrier" rather than an actual lower "holding barrier".^[1] Also, CUSUM does not require the use of the likelihood function.

As a means of assessing CUSUM's performance, Page defined the average run length (A.R.L.) metric; "the expected number of articles sampled before action is taken." He further wrote:^[2]

When the quality of the output is satisfactory the A.R.L. is a measure of the expense incurred by the scheme when it gives false alarms, i.e., Type I errors (Neyman & Pearson, 1936^[4]). On the other hand, for constant poor quality the A.R.L. measures the delay and thus the amount of scrap produced before the rectifying action is taken, i.e., Type II errors.

Example

The following example shows 20 observations $X$ of a process with a mean of 0 and a standard deviation of 0.5.

From the $Z$ column, it can be seen that $X$ never deviates by 3 standard deviations ( $3\sigma$ ), so simply alerting on a high deviation will not detect a failure, whereas CUSUM shows that the $S_{H}$ value exceeds 4 at the 17th observation.

Column	Description
$X$	The observations of the process with an expected mean ${\bar {x}}$ of 0 and an expected standard deviation $\sigma _{X}$ of 0.5
$Z$	The normalized observations, i.e. centered around the mean and scaled by the standard deviation $Z_{n}={\frac {X_{n}-{\bar {x}}}{\sigma _{X}}}$
$S_{H}$	The high CUSUM value, detecting a positive anomaly, ${S_{H}}_{n+1}=\max(0,{S_{H}}_{n}+Z_{n+1}-\omega )$
$S_{L}$	The low CUSUM value, detecting a negative anomaly, ${S_{L}}_{n+1}=\max(0,{S_{L}}_{n}-Z_{n+1}-\omega )$

where $\omega$ is a critical level parameter (tunable, same as threshold T) that's used to adjust the sensitivity of change detection: larger $\omega$ makes CUSUM less sensitive to the change and vice versa.

Variants

Cumulative observed-minus-expected plots ^[1] are a related method.

Related Research Articles

In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of possible outcomes for an experiment. It is a mathematical description of a random phenomenon in terms of its sample space and the probabilities of events.

In signal processing, group delay and phase delay are functions that describe in different ways the delay times experienced by a signal’s various sinuoidal frequency components as they pass through a linear time-invariant (LTI) system.

Control charts are graphical plots used in production control to determine whether quality and manufacturing processes are being controlled under stable conditions. The hourly status is arranged on the graph, and the occurrence of abnormalities is judged based on the presence of data that differs from the conventional trend or deviates from the control limit line. Control charts are classified into Shewhart individuals control chart and CUSUM(CUsUM)(or cumulative sum control chart)(ISO 7870-4).

Importance sampling is a Monte Carlo method for evaluating properties of a particular distribution, while only having samples generated from a different distribution than the distribution of interest. Its introduction in statistics is generally attributed to a paper by Teun Kloek and Herman K. van Dijk in 1978, but its precursors can be found in statistical physics as early as 1949. Importance sampling is also related to umbrella sampling in computational physics. Depending on the application, the term may refer to the process of sampling from this alternative distribution, the process of inference, or both.

In computer vision and image processing, Otsu's method, named after Nobuyuki Otsu, is used to perform automatic image thresholding. In the simplest form, the algorithm returns a single intensity threshold that separate pixels into two classes, foreground and background. This threshold is determined by minimizing intra-class intensity variance, or equivalently, by maximizing inter-class variance. Otsu's method is a one-dimensional discrete analogue of Fisher's discriminant analysis, is related to Jenks optimization method, and is equivalent to a globally optimal k-means performed on the intensity histogram. The extension to multi-level thresholding was described in the original paper, and computationally efficient implementations have since been proposed.

Probability theory and statistics have some commonly used conventions, in addition to standard mathematical notation and mathematical symbols.

Robust statistics are statistics that maintain their properties even if the underlying distributional assumptions are incorrect. Robust statistical methods have been developed for many common problems, such as estimating location, scale, and regression parameters. One motivation is to produce statistical methods that are not unduly affected by outliers. Another motivation is to provide methods with good performance when there are small departures from a parametric distribution. For example, robust methods work well for mixtures of two normal distributions with different standard deviations; under this model, non-robust methods like a t-test work poorly.

A cyclostationary process is a signal having statistical properties that vary cyclically with time. A cyclostationary process can be viewed as multiple interleaved stationary processes. For example, the maximum daily temperature in New York City can be modeled as a cyclostationary process: the maximum temperature on July 21 is statistically different from the temperature on December 20; however, it is a reasonable approximation that the temperature on December 20 of different years has identical statistics. Thus, we can view the random process composed of daily maximum temperatures as 365 interleaved stationary processes, each of which takes on a new value once per year.

In statistics the Cramér–von Mises criterion is a criterion used for judging the goodness of fit of a cumulative distribution function $compared to a given empirical distribution function, or for comparing two empirical distributions. It is also used as a part of other algorithms, such as minimum distance estimation. It is defined as$

Fiducial inference is one of a number of different types of statistical inference. These are rules, intended for general application, by which conclusions can be drawn from samples of data. In modern statistical practice, attempts to work with fiducial inference have fallen out of fashion in favour of frequentist inference, Bayesian inference and decision theory. However, fiducial inference is important in the history of statistics since its development led to the parallel development of concepts and tools in theoretical statistics that are widely used. Some current research in statistical methodology is either explicitly linked to fiducial inference or is closely connected to it.

The sequential probability ratio test (SPRT) is a specific sequential hypothesis test, developed by Abraham Wald and later proven to be optimal by Wald and Jacob Wolfowitz. Neyman and Pearson's 1933 result inspired Wald to reformulate it as a sequential analysis problem. The Neyman-Pearson lemma, by contrast, offers a rule of thumb for when all the data is collected.

Probabilistic design is a discipline within engineering design. It deals primarily with the consideration and minimization of the effects of random variability upon the performance of an engineering system during the design phase. Typically, these effects studied and optimized are related to quality and reliability. It differs from the classical approach to design by assuming a small probability of failure instead of using the safety factor. Probabilistic design is used in a variety of different applications to assess the likelihood of failure. Disciplines which extensively use probabilistic design principles include product design, quality control, systems engineering, machine design, civil engineering and manufacturing.

The combination of quality control and genetic algorithms led to novel solutions of complex quality control design and optimization problems. Quality is the degree to which a set of inherent characteristics of an entity fulfils a need or expectation that is stated, general implied or obligatory. ISO 9000 defines quality control as "A part of quality management focused on fulfilling quality requirements". Genetic algorithms are search algorithms, based on the mechanics of natural selection and natural genetics.

In probability theory and statistics, the skew normal distribution is a continuous probability distribution that generalises the normal distribution to allow for non-zero skewness.

The Omega ratio is a risk-return performance measure of an investment asset, portfolio, or strategy. It was devised by Con Keating and William F. Shadwick in 2002 and is defined as the probability weighted ratio of gains versus losses for some threshold return target. The ratio is an alternative for the widely used Sharpe ratio and is based on information the Sharpe ratio discards.

In probability theory and statistics, the generalized chi-squared distribution is the distribution of a quadratic form of a multinormal variable, or a linear combination of different normal variables and squares of normal variables. Equivalently, it is also a linear sum of independent noncentral chi-square variables and a normal variable. There are several other such generalizations for which the same term is sometimes used; some of them are special cases of the family discussed here, for example the gamma distribution.

In statistics and signal processing, step detection is the process of finding abrupt changes in the mean level of a time series or signal. It is usually considered as a special case of the statistical method known as change detection or change point detection. Often, the step is small and the time series is corrupted by some kind of noise, and this makes the problem challenging because the step may be hidden by the noise. Therefore, statistical and/or signal processing algorithms are often required.

In probability theory, an interacting particle system (IPS) is a stochastic process $on some configuration space given by a site space, a countably-infinite-order graph and a local state space, a compact metric space . More precisely IPS are continuous-time Markov jump processes describing the collective behavior of stochastically interacting components. IPS are the continuous-time analogue of stochastic cellular automata.$

Foreground detection is one of the major tasks in the field of computer vision and image processing whose aim is to detect changes in image sequences. Background subtraction is any technique which allows an image's foreground to be extracted for further processing.

Wireless sensor networks (WSN) are a spatially distributed network of autonomous sensors used for monitoring an environment. Energy cost is a major limitation for WSN requiring the need for energy efficient networks and processing. One of major energy costs in WSN is the energy spent on communication between nodes and it is sometimes desirable to only send data to a gateway node when an event of interest is triggered at a sensor. Sensors will then only open communication during a probable event, saving on communication costs. Fields interested in this type of network include surveillance, home automation, disaster relief, traffic control, health care and more.

References

1 2 3 Grigg; Farewell, VT; Spiegelhalter, DJ; et al. (2003). "The Use of Risk-Adjusted CUSUM and RSPRT Charts for Monitoring in Medical Contexts". Statistical Methods in Medical Research. 12 (2): 147–170. doi:10.1177/096228020301200205. PMID 12665208.
~~1 2 Page, E. S. (June 1954). "Continuous Inspection Scheme". Biometrika. 41 (1/2): 100–115. doi:10.1093/biomet/41.1-2.100. hdl: 10338.dmlcz/135207 . JSTOR 2333009.~~
~~↑ Barnard, G.A. (1959). "Control charts and stochastic processes". Journal of the Royal Statistical Society . B (Methodological) (21, number 2): 239–71. JSTOR 2983801.~~
~~↑ "Sufficient statistics and uniformly most powerful tests of statistical hypotheses". Statistical Research Memoirs . I: 113–137.~~

External links

~~"Engineering Statistics Handbook - Cusum Control Charts"~~

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[grigg-1] 1 2 3 Grigg; Farewell, VT; Spiegelhalter, DJ; et al. (2003). "The Use of Risk-Adjusted CUSUM and RSPRT Charts for Monitoring in Medical Contexts". Statistical Methods in Medical Research. 12 (2): 147–170. doi:10.1177/096228020301200205. PMID 12665208.

[page-2] ~~1 2 Page, E. S. (June 1954). "Continuous Inspection Scheme". Biometrika. 41 (1/2): 100–115. doi:10.1093/biomet/41.1-2.100. hdl: 10338.dmlcz/135207 . JSTOR 2333009.~~

[3] ~~↑ Barnard, G.A. (1959). "Control charts and stochastic processes". Journal of the Royal Statistical Society . B (Methodological) (21, number 2): 239–71. JSTOR 2983801.~~

[4] ~~↑ "Sufficient statistics and uniformly most powerful tests of statistical hypotheses". Statistical Research Memoirs . I: 113–137.~~

[1]

[2]

[3]

[4]