Heavy traffic approximation

Last updated

In queueing theory, a discipline within the mathematical theory of probability, a heavy traffic approximation (sometimes heavy traffic limit theorem [1] or diffusion approximation) is the matching of a queueing model with a diffusion process under some limiting conditions on the model's parameters. The first such result was published by John Kingman who showed that when the utilisation parameter of an M/M/1 queue is near 1 a scaled version of the queue length process can be accurately approximated by a reflected Brownian motion. [2]

Contents

Heavy traffic condition

Heavy traffic approximations are typically stated for the process X(t) describing the number of customers in the system at time t. They are arrived at by considering the model under the limiting values of some model parameters and therefore for the result to be finite the model must be rescaled by a factor n, denoted [3] :490

and the limit of this process is considered as n  ∞.

There are three classes of regime under which such approximations are generally considered.

  1. The number of servers is fixed and the traffic intensity (utilization) is increased to 1 (from below). The queue length approximation is a reflected Brownian motion. [4] [5] [6]
  2. Traffic intensity is fixed and the number of servers and arrival rate are increased to infinity. Here the queue length limit converges to the normal distribution. [7] [8] [9]
  3. A quantity β is fixed where
with ρ representing the traffic intensity and s the number of servers. Traffic intensity and the number of servers are increased to infinity and the limiting process is a hybrid of the above results. This case, first published by Halfin and Whitt is often known as the Halfin–Whitt regime [1] [10] [11] or quality-and-efficiency-driven (QED) regime. [12]

Results for a G/G/1 queue

Theorem 1. [13] Consider a sequence of G/G/1 queues indexed by .
For queue let denote the random inter-arrival time, denote the random service time; let denote the traffic intensity with and ; let denote the waiting time in queue for a customer in steady state; Let and

Suppose that , , and . then

provided that:

(a)

(b) for some , and are both less than some constant for all .

Heuristic argument

Let be the difference between the nth service time and the nth inter-arrival time; Let be the waiting time in queue of the nth customer;

Then by definition:

After recursive calculation, we have:

Let , with are i.i.d; Define and ;

Then we have

we get by taking limit over .

Thus the waiting time in queue of the nth customer is the supremum of a random walk with a negative drift.

Random walk can be approximated by a Brownian motion when the jump sizes approach 0 and the times between the jump approach 0.

We have and has independent and stationary increments. When the traffic intensity approaches 1 and tends to , we have after replaced with continuous value according to functional central limit theorem. [14] :110 Thus the waiting time in queue of the th customer can be approximated by the supremum of a Brownian motion with a negative drift.

Theorem 2. [15] :130 Let be a Brownian motion with drift and standard deviation starting at the origin, and let

if

otherwise

Conclusion

under heavy traffic condition

Thus, the heavy traffic limit theorem (Theorem 1) is heuristically argued. Formal proofs usually follow a different approach which involve characteristic functions. [4] [16]

Example

Consider an M/G/1 queue with arrival rate , the mean of the service time , and the variance of the service time . What is average waiting time in queue in the steady state?

The exact average waiting time in queue in steady state is given by:

The corresponding heavy traffic approximation:

The relative error of the heavy traffic approximation:

Thus when , we have :

Related Research Articles

Queueing theory Mathematical study of waiting lines, or queues

Queueing theory is the mathematical study of waiting lines, or queues. A queueing model is constructed so that queue lengths and waiting time can be predicted. Queueing theory is generally considered a branch of operations research because the results are often used when making business decisions about the resources needed to provide a service.

In the theory of stochastic processes, the Karhunen–Loève theorem, also known as the Kosambi–Karhunen–Loève theorem is a representation of a stochastic process as an infinite linear combination of orthogonal functions, analogous to a Fourier series representation of a function on a bounded interval. The transformation is also known as Hotelling transform and eigenvector transform, and is closely related to principal component analysis (PCA) technique widely used in image processing and in data analysis in many fields.

Yule–Simon distribution

In probability and statistics, the Yule–Simon distribution is a discrete probability distribution named after Udny Yule and Herbert A. Simon. Simon originally called it the Yule distribution.

Variational Bayesian methods are a family of techniques for approximating intractable integrals arising in Bayesian inference and machine learning. They are typically used in complex statistical models consisting of observed variables as well as unknown parameters and latent variables, with various sorts of relationships among the three types of random variables, as might be described by a graphical model. As typical in Bayesian inference, the parameters and latent variables are grouped together as "unobserved variables". Variational Bayesian methods are primarily used for two purposes:

  1. To provide an analytical approximation to the posterior probability of the unobserved variables, in order to do statistical inference over these variables.
  2. To derive a lower bound for the marginal likelihood of the observed data. This is typically used for performing model selection, the general idea being that a higher marginal likelihood for a given model indicates a better fit of the data by that model and hence a greater probability that the model in question was the one that generated the data.

The birth–death process is a special case of continuous-time Markov process where the state transitions are of only two types: "births", which increase the state variable by one and "deaths", which decrease the state by one. The model's name comes from a common application, the use of such models to represent the current size of a population where the transitions are literal births and deaths. Birth–death processes have many applications in demography, queueing theory, performance engineering, epidemiology, biology and other areas. They may be used, for example, to study the evolution of bacteria, the number of people with a disease within a population, or the number of customers in line at the supermarket.

Fresnel diffraction Diffraction

In optics, the Fresnel diffraction equation for near-field diffraction is an approximation of the Kirchhoff–Fresnel diffraction that can be applied to the propagation of waves in the near field. It is used to calculate the diffraction pattern created by waves passing through an aperture or around an object, when viewed from relatively close to the object. In contrast the diffraction pattern in the far field region is given by the Fraunhofer diffraction equation.

In queueing theory, a discipline within the mathematical theory of probability, a Jackson network is a class of queueing network where the equilibrium distribution is particularly simple to compute as the network has a product-form solution. It was the first significant development in the theory of networks of queues, and generalising and applying the ideas of the theorem to search for similar product-form solutions in other networks has been the subject of much research, including ideas used in the development of the Internet. The networks were first identified by James R. Jackson and his paper was re-printed in the journal Management Science’s ‘Ten Most Influential Titles of Management Sciences First Fifty Years.’

A ratio distribution is a probability distribution constructed as the distribution of the ratio of random variables having two other known distributions. Given two random variables X and Y, the distribution of the random variable Z that is formed as the ratio Z = X/Y is a ratio distribution.

In queueing theory, a discipline within the mathematical theory of probability, a G-network is an open network of G-queues first introduced by Erol Gelenbe as a model for queueing systems with specific control functions, such as traffic re-routing or traffic destruction, as well as a model for neural networks. A G-queue is a network of queues with several types of novel and useful customers:

In queueing theory, a discipline within the mathematical theory of probability, the Pollaczek–Khinchine formula states a relationship between the queue length and service time distribution Laplace transforms for an M/G/1 queue. The term is also used to refer to the relationships between the mean queue length and mean waiting/service time in such a model.

M/M/1 queue Queue with Markov (Poisson) arrival process, exponential service time distribution and one server

In queueing theory, a discipline within the mathematical theory of probability, an M/M/1 queue represents the queue length in a system having a single server, where arrivals are determined by a Poisson process and job service times have an exponential distribution. The model name is written in Kendall's notation. The model is the most elementary of queueing models and an attractive object of study as closed-form expressions can be obtained for many metrics of interest in this model. An extension of this model with more than one server is the M/M/c queue.

In queueing theory, a discipline within the mathematical theory of probability, Kingman's formula also known as the VUT equation, is an approximation for the mean waiting time in a G/G/1 queue. The formula is the product of three terms which depend on utilization (U), variability (V) and service time (T). It was first published by John Kingman in his 1961 paper The single server queue in heavy traffic. It is known to be generally very accurate, especially for a system operating close to saturation.

In queueing theory, a discipline within the mathematical theory of probability, the M/M/c queue is a multi-server queueing model. In Kendall's notation it describes a system where arrivals form a single queue and are governed by a Poisson process, there are c servers, and job service times are exponentially distributed. It is a generalisation of the M/M/1 queue which considers only a single server. The model with infinitely many servers is the M/M/∞ queue.

Vibration of plates

The vibration of plates is a special case of the more general problem of mechanical vibrations. The equations governing the motion of plates are simpler than those for general three-dimensional objects because one of the dimensions of a plate is much smaller than the other two. This suggests that a two-dimensional plate theory will give an excellent approximation to the actual three-dimensional motion of a plate-like object, and indeed that is found to be true.

In optics, the Fraunhofer diffraction equation is used to model the diffraction of waves when the diffraction pattern is viewed at a long distance from the diffracting object, and also when it is viewed at the focal plane of an imaging lens.

In queueing theory, a discipline within the mathematical theory of probability, an M/G/1 queue is a queue model where arrivals are Markovian, service times have a General distribution and there is a single server. The model name is written in Kendall's notation, and is an extension of the M/M/1 queue, where service times must be exponentially distributed. The classic application of the M/G/1 queue is to model performance of a fixed head hard disk.

In queueing theory, a discipline within the mathematical theory of probability, a fluid queue is a mathematical model used to describe the fluid level in a reservoir subject to randomly determined periods of filling and emptying. The term dam theory was used in earlier literature for these models. The model has been used to approximate discrete models, model the spread of wildfires, in ruin theory and to model high speed data networks. The model applies the leaky bucket algorithm to a stochastic source.

In queueing theory, a discipline within the mathematical theory of probability, the M/M/∞ queue is a multi-server queueing model where every arrival experiences immediate service and does not wait. In Kendall's notation it describes a system where arrivals are governed by a Poisson process, there are infinitely many servers, so jobs do not need to wait for a server. Each job has an exponentially distributed service time. It is a limit of the M/M/c queue model where the number of servers c becomes very large.

In queueing theory, a discipline within the mathematical theory of probability, an M/D/1 queue represents the queue length in a system having a single server, where arrivals are determined by a Poisson process and job service times are fixed (deterministic). The model name is written in Kendall's notation. Agner Krarup Erlang first published on this model in 1909, starting the subject of queueing theory. An extension of this model with more than one server is the M/D/c queue.

Maximal entropy random walk (MERW) is a popular type of biased random walk on a graph, in which transition probabilities are chosen accordingly to the principle of maximum entropy, which says that the probability distribution which best represents the current state of knowledge is the one with largest entropy. While standard random walk chooses for every vertex uniform probability distribution among its outgoing edges, locally maximizing entropy rate, MERW maximizes it globally by assuming uniform probability distribution among all paths in a given graph.

References

  1. 1 2 Halfin, S.; Whitt, W. (1981). "Heavy-Traffic Limits for Queues with Many Exponential Servers" (PDF). Operations Research. 29 (3): 567. doi:10.1287/opre.29.3.567.
  2. Kingman, J. F. C.; Atiyah (October 1961). "The single server queue in heavy traffic". Mathematical Proceedings of the Cambridge Philosophical Society . 57 (4): 902. Bibcode:1961PCPS...57..902K. doi:10.1017/S0305004100036094. JSTOR   2984229.
  3. Gautam, Natarajan (2012). Analysis of Queues: Methods and Applications. CRC Press. ISBN   9781439806586.
  4. 1 2 Kingman, J. F. C. (1962). "On Queues in Heavy Traffic". Journal of the Royal Statistical Society. Series B (Methodological). 24 (2): 383–392. doi:10.1111/j.2517-6161.1962.tb00465.x. JSTOR   2984229.
  5. Iglehart, Donald L.; Ward, Whitt (1970). "Multiple Channel Queues in Heavy Traffic. II: Sequences, Networks, and Batches" (PDF). Advances in Applied Probability. 2 (2): 355–369. doi:10.2307/1426324. JSTOR   1426324 . Retrieved 30 Nov 2012.
  6. Köllerström, Julian (1974). "Heavy Traffic Theory for Queues with Several Servers. I". Journal of Applied Probability. 11 (3): 544–552. doi:10.2307/3212698. JSTOR   3212698.
  7. Iglehart, Donald L. (1965). "Limiting Diffusion Approximations for the Many Server Queue and the Repairman Problem". Journal of Applied Probability. 2 (2): 429–441. doi:10.2307/3212203. JSTOR   3212203.
  8. Borovkov, A. A. (1967). "On limit laws for service processes in multi-channel systems". Siberian Mathematical Journal. 8 (5): 746–763. doi:10.1007/BF01040651.
  9. Iglehart, Donald L. (1973). "Weak Convergence in Queueing Theory". Advances in Applied Probability. 5 (3): 570–594. doi:10.2307/1425835. JSTOR   1425835.
  10. Puhalskii, A. A.; Reiman, M. I. (2000). "The multiclass GI/PH/N queue in the Halfin-Whitt regime". Advances in Applied Probability. 32 (2): 564. doi:10.1239/aap/1013540179.
  11. Reed, J. (2009). "The G/GI/N queue in the Halfin–Whitt regime". The Annals of Applied Probability. 19 (6): 2211–2269. arXiv: 0912.2837 . doi:10.1214/09-AAP609.
  12. Whitt, W. (2004). "Efficiency-Driven Heavy-Traffic Approximations for Many-Server Queues with Abandonments" (PDF). Management Science . 50 (10): 1449–1461. CiteSeerX   10.1.1.139.750 . doi:10.1287/mnsc.1040.0279. JSTOR   30046186.
  13. Gross, D.; Shortie, J. F.; Thompson, J. M.; Harris, C. M. (2013). "Bounds and Approximations". Fundamentals of Queueing Theory. pp. 329–368. doi:10.1002/9781118625651.ch7. ISBN   9781118625651.
  14. Chen, H.; Yao, D. D. (2001). "Technical Desiderata". Fundamentals of Queueing Networks. Stochastic Modelling and Applied Probability. Vol. 46. pp. 97–124. doi:10.1007/978-1-4757-5301-1_5. ISBN   978-1-4419-2896-2.
  15. Chen, H.; Yao, D. D. (2001). "Single-Station Queues". Fundamentals of Queueing Networks. Stochastic Modelling and Applied Probability. Vol. 46. pp. 125–158. doi:10.1007/978-1-4757-5301-1_6. ISBN   978-1-4419-2896-2.
  16. Asmussen, S. R. (2003). "Steady-State Properties of GI/G/1". Applied Probability and Queues. Stochastic Modelling and Applied Probability. Vol. 51. pp. 266–301. doi:10.1007/0-387-21525-5_10. ISBN   978-0-387-00211-8.