Mean value analysis

Last updated

In queueing theory, a discipline within the mathematical theory of probability, mean value analysis (MVA) is a recursive technique for computing expected queue lengths, waiting time at queueing nodes and throughput in equilibrium for a closed separable system of queues. The first approximate techniques were published independently by Schweitzer [1] and Bard, [2] [3] followed later by an exact version by Lavenberg and Reiser published in 1980. [4] [5]

Contents

It is based on the arrival theorem, which states that when one customer in an M-customer closed system arrives at a service facility he/she observes the rest of the system to be in the equilibrium state for a system with M  1 customers.

Problem setup

Consider a closed queueing network of K M/M/1 queues, with M customers circulating in the system. Suppose that the customers are indistinguishable from each other, so that the network has a single class of customers. To compute the mean queue length and waiting time at each of the nodes and throughput of the system we use an iterative algorithm starting with a network with 0 customers.

Write μi for the service rate at node i and P for the customer routing matrix where element pij denotes the probability that a customer finishing service at node i moves to node j for service. To use the algorithm, we first compute the visit ratio row vector v, a vector such that v = v P.

Now write Li(n) for the mean number of customers at queue i when there is a total of n customers in the system (this includes the job currently being served at queue i) and Wj(n) for the mean time spent by a customer in queue i when there is a total of n customers in the system. Denote the throughput of a system with m customers by λm.

Algorithm

The algorithm [6] starts with an empty network (zero customers), then increases the number of customers by 1 at each iteration until there are the required number (M) of customers in the system.

To initialise, set Lk(0) = 0 for k = 1,...,K. (This sets the average queue length in a system with no customers to zero at all nodes.)

Repeat for m = 1,...,M:

1. For k = 1, ..., K compute the waiting time at each node using the arrival theorem:
2. Then compute the system throughput using Little's law:
3. Finally, use Little's law applied to each queue to compute the mean queue lengths for k = 1, ..., K:

End repeat.

Bard–Schweitzer method

The Bard–Schweitzer approximation estimates the average number of jobs at node k to be: [1] [7]

which is a linear interpolation. From the above formulas, this approximation yields fixed-point relationships which can be solved numerically. This iterative approach often goes under the name of approximate MVA (AMVA) and it is typically faster than the recursive approach of MVA. [8] :38

Pseudocode

set Lk(m) = M/K

repeat until convergence:

Multiclass networks

In the case of multiclass networks with R classes of customers, each queue k can feature different service rates μk,r for each job class r=1,...,R, although certain restrictions exist in the case of first-come first-served stations due to the assumptions of the BCMP theorem in the multiclass case.

The waiting time Wk,r experienced by class-r jobs at queue k can still be related to the total mean queue-length at node k using a generalization of the arrival theorem:

where is a vector of customer population for the R classes and subtracts one from the r-th element of , assuming that .

For networks with a single customer class the MVA algorithm is very fast and time taken grows linearly with the number of customers and number of queues. However, in multiclass models the number of multiplications and additions and the storage requirements for MVA grow exponentially with the number of customer classes. Practically, the algorithm works well for 3-4 customer classes, [9] although this generally depends on the implementation and the structure of the model. For example, the Tree-MVA method can scale to larger models if the routing matrix is sparse. [10]

Exact values for mean performance metrics can be obtained in large models using the method of moments, which requires log-quadratic time. The method of moments can solve in practice models with up to 10 classes of customers or sometimes larger, which are typically inaccessible by means of exact MVA. [9] [11] This technique however does not use the arrival theorem and relies on solving systems of linear equations involving the normalizing constant of state probabilities for the queueing network.

Approximate MVA (AMVA) algorithms, such as the Bard-Schweitzer method, offer instead an alternative solution technique that provides low complexity also on multiclass networks and typically deliver highly accurate results. [12]

Extensions

The mean value analysis algorithm has been applied to a class of PEPA models describing queueing networks and the performance of a key distribution center. [13]

Software

See also

Related Research Articles

<span class="mw-page-title-main">Queueing theory</span> Mathematical study of waiting lines, or queues

Queueing theory is the mathematical study of waiting lines, or queues. A queueing model is constructed so that queue lengths and waiting time can be predicted. Queueing theory is generally considered a branch of operations research because the results are often used when making business decisions about the resources needed to provide a service.

<span class="mw-page-title-main">Decision tree learning</span> Machine learning algorithm

Decision tree learning is a supervised learning approach used in statistics, data mining and machine learning. In this formalism, a classification or regression decision tree is used as a predictive model to draw conclusions about a set of observations.

In queueing theory, a discipline within the mathematical theory of probability, a Jackson network is a class of queueing network where the equilibrium distribution is particularly simple to compute as the network has a product-form solution. It was the first significant development in the theory of networks of queues, and generalising and applying the ideas of the theorem to search for similar product-form solutions in other networks has been the subject of much research, including ideas used in the development of the Internet. The networks were first identified by James R. Jackson and his paper was re-printed in the journal Management Science’s ‘Ten Most Influential Titles of Management Sciences First Fifty Years.’

<span class="mw-page-title-main">F-score</span> Statistical measure of a tests accuracy

In statistical analysis of binary classification, the F-score or F-measure is a measure of a test's accuracy. It is calculated from the precision and recall of the test, where the precision is the number of true positive results divided by the number of all positive results, including those not identified correctly, and the recall is the number of true positive results divided by the number of all samples that should have been identified as positive. Precision is also known as positive predictive value, and recall is also known as sensitivity in diagnostic binary classification.

In queueing theory, a discipline within the mathematical theory of probability, Buzen's algorithm (or convolution algorithm) is an algorithm for calculating the normalization constant G(N) in the Gordon–Newell theorem. This method was first proposed by Jeffrey P. Buzen in his 1971 PhD dissertation and subsequently published in a refereed journal in 1973. Computing G(N) is required to compute the stationary probability distribution of a closed queueing network.

In queueing theory, a discipline within the mathematical theory of probability, a G-network is an open network of G-queues first introduced by Erol Gelenbe as a model for queueing systems with specific control functions, such as traffic re-routing or traffic destruction, as well as a model for neural networks. A G-queue is a network of queues with several types of novel and useful customers:

In queueing theory, a discipline within the mathematical theory of probability, the Pollaczek–Khinchine formula states a relationship between the queue length and service time distribution Laplace transforms for an M/G/1 queue. The term is also used to refer to the relationships between the mean queue length and mean waiting/service time in such a model.

<span class="mw-page-title-main">M/M/1 queue</span> Queue with Markov (Poisson) arrival process, exponential service time distribution and one server

In queueing theory, a discipline within the mathematical theory of probability, an M/M/1 queue represents the queue length in a system having a single server, where arrivals are determined by a Poisson process and job service times have an exponential distribution. The model name is written in Kendall's notation. The model is the most elementary of queueing models and an attractive object of study as closed-form expressions can be obtained for many metrics of interest in this model. An extension of this model with more than one server is the M/M/c queue.

In queueing theory, a discipline within the mathematical theory of probability, a BCMP network is a class of queueing network for which a product-form equilibrium distribution exists. It is named after the authors of the paper where the network was first described: Baskett, Chandy, Muntz, and Palacios. The theorem is a significant extension to a Jackson network allowing virtually arbitrary customer routing and service time distributions, subject to particular service disciplines.

In probability theory, a product-form solution is a particularly efficient form of solution for determining some metric of a system with distinct sub-components, where the metric for the collection of components can be written as a product of the metric across the different components. Using capital Pi notation a product-form solution has algebraic form

In queueing theory, a discipline within the mathematical theory of probability, the Gordon–Newell theorem is an extension of Jackson's theorem from open queueing networks to closed queueing networks of exponential servers where customers cannot leave the network. Jackson's theorem cannot be applied to closed networks because the queue length at a node in the closed network is limited by the population of the network. The Gordon–Newell theorem calculates the open network solution and then eliminates the infeasible states by renormalizing the probabilities. Calculation of the normalizing constant makes the treatment more awkward as the whole state space must be enumerated. Buzen's algorithm or mean value analysis can be used to calculate the normalizing constant more efficiently.

In queueing theory, a discipline within the mathematical theory of probability, the arrival theorem states that "upon arrival at a station, a job observes the system as if in steady state at an arbitrary instant for the system without that job."

<span class="mw-page-title-main">Fork–join queue</span> Type of queue

In queueing theory, a discipline within the mathematical theory of probability, a fork–join queue is a queue where incoming jobs are split on arrival for service by numerous servers and joined before departure. The model is often used for parallel computations or systems where products need to be obtained simultaneously from different suppliers. The key quantity of interest in this model is usually the time taken to service a complete job. The model has been described as a "key model for the performance analysis of parallel and distributed systems." Few analytical results exist for fork–join queues, but various approximations are known.

In queueing theory, a discipline within the mathematical theory of probability, the M/M/c queue is a multi-server queueing model. In Kendall's notation it describes a system where arrivals form a single queue and are governed by a Poisson process, there are c servers, and job service times are exponentially distributed. It is a generalisation of the M/M/1 queue which considers only a single server. The model with infinitely many servers is the M/M/∞ queue.

In queueing theory, a discipline within the mathematical theory of probability, an M/G/1 queue is a queue model where arrivals are Markovian, service times have a General distribution and there is a single server. The model name is written in Kendall's notation, and is an extension of the M/M/1 queue, where service times must be exponentially distributed. The classic application of the M/G/1 queue is to model performance of a fixed head hard disk.

In queueing theory, a discipline within the mathematical theory of probability, a fluid queue is a mathematical model used to describe the fluid level in a reservoir subject to randomly determined periods of filling and emptying. The term dam theory was used in earlier literature for these models. The model has been used to approximate discrete models, model the spread of wildfires, in ruin theory and to model high speed data networks. The model applies the leaky bucket algorithm to a stochastic source.

In queueing theory, a discipline within the mathematical theory of probability, the M/M/∞ queue is a multi-server queueing model where every arrival experiences immediate service and does not wait. In Kendall's notation it describes a system where arrivals are governed by a Poisson process, there are infinitely many servers, so jobs do not need to wait for a server. Each job has an exponentially distributed service time. It is a limit of the M/M/c queue model where the number of servers c becomes very large.

In queueing theory, a discipline within the mathematical theory of probability, a heavy traffic approximation is the matching of a queueing model with a diffusion process under some limiting conditions on the model's parameters. The first such result was published by John Kingman who showed that when the utilisation parameter of an M/M/1 queue is near 1 a scaled version of the queue length process can be accurately approximated by a reflected Brownian motion.

In queueing theory, a discipline within the mathematical theory of probability, an M/D/1 queue represents the queue length in a system having a single server, where arrivals are determined by a Poisson process and job service times are fixed (deterministic). The model name is written in Kendall's notation. Agner Krarup Erlang first published on this model in 1909, starting the subject of queueing theory. An extension of this model with more than one server is the M/D/c queue.

<span class="mw-page-title-main">Polling system</span>

In queueing theory, a discipline within the mathematical theory of probability, a polling system or polling model is a system where a single server visits a set of queues in some order. The model has applications in computer networks and telecommunications, manufacturing and road traffic management. The term polling system was coined at least as early as 1968 and the earliest study of such a system in 1957 where a single repairman servicing machines in the British cotton industry was modelled.

References

  1. 1 2 Schweitzer, P. J.; Serazzi, G.; Broglia, M. (1993). "A survey of bottleneck analysis in closed networks of queues". Performance Evaluation of Computer and Communication Systems. Lecture Notes in Computer Science. Vol. 729. p. 491. doi:10.1007/BFb0013865. ISBN   978-3-540-57297-8.
  2. Bard, Yonathan (1979). Some Extensions to Multiclass Queueing Network Analysis. North-Holland Publishing Co. pp. 51–62. ISBN   978-0-444-85332-5.{{cite book}}: |journal= ignored (help)
  3. Adan, I.; Wal, J. (2011). "Mean Values Techniques". Queueing Networks. International Series in Operations Research & Management Science. Vol. 154. pp. 561–586. doi:10.1007/978-1-4419-6472-4_13. ISBN   978-1-4419-6471-7.
  4. Reiser, M.; Lavenberg, S. S. (1980). "Mean-Value Analysis of Closed Multichain Queuing Networks". Journal of the ACM . 27 (2): 313. doi: 10.1145/322186.322195 . S2CID   8694947.
  5. Reiser, M. (2000). "Mean Value Analysis: A Personal Account". Performance Evaluation: Origins and Directions. Lecture Notes in Computer Science. Vol. 1769. pp. 491–504. doi:10.1007/3-540-46506-5_22. ISBN   978-3-540-67193-0.
  6. Bose, Sanjay K. (2001). An introduction to queueing systems. Springer. p. 174. ISBN   978-0-306-46734-9.
  7. Schweitzer, Paul (1979). "Approximate analysis of multiclass closed networks of queues". Proceedings of International Conference on Stochastic Control and Optimization.
  8. Tay, Y. C. (2010). "Analytical Performance Modeling for Computer Systems". Synthesis Lectures on Computer Science. 2: 1–116. doi:10.2200/S00282ED1V01Y201005CSL002. S2CID   207318911.
  9. 1 2 Casale, G. (2011). "Exact analysis of performance models by the Method of Moments" (PDF). Performance Evaluation. 68 (6): 487–506. CiteSeerX   10.1.1.302.1139 . doi:10.1016/j.peva.2010.12.009.
  10. Hoyme, K. P.; Bruell, S. C.; Afshari, P. V.; Kain, R. Y. (1986). "A tree-structured mean value analysis algorithm". ACM Transactions on Computer Systems. 4 (2): 178–185. doi: 10.1145/214419.214423 .
  11. Casale, G. (2008). "CoMoM: A Class-Oriented Algorithm for Probabilistic Evaluation of Multiclass Queueing Networks". IEEE Transactions on Software Engineering. 35 (2): 162–177. CiteSeerX   10.1.1.302.1139 . doi:10.1016/j.peva.2010.12.009.
  12. Zahorjan, John; Eager, Derek L.; Sweillam, Hisham M. (1988). "Accuracy, speed, and convergence of approximate mean value analysis". Performance Evaluation. 8 (4): 255–270. doi:10.1016/0166-5316(88)90028-4.
  13. Thomas, N.; Zhao, Y. (2010). "Mean value analysis for a class of PEPA models". Comput. J. 54 (5): 643–652. doi:10.1093/comjnl/bxq064. S2CID   12824669.
  14. Bertoli, M.; Casale, G.; Serazzi, G. (2009). "JMT: performance engineering tools for system modeling" (PDF). ACM SIGMETRICS Performance Evaluation Review. 36 (4): 10. doi:10.1145/1530873.1530877. S2CID   6920559.
  15. Marzolla, M. (2014). "The Octave Queueing Package". Quantitative Evaluation of Systems. Lecture Notes in Computer Science. Vol. 8657. pp. 174–177. doi:10.1007/978-3-319-10696-0_14. ISBN   978-3-319-10695-3. S2CID   4978676.