Figures
Abstract
This study applied a closed BCMP queueing network to a real-world model, examining the limitations of the theoretical solution and the possibility of replacing theoretical values with those from parallel simulation. Parallel computing was applied to mean value analysis (MVA). We first obtained computational and theoretical values by varying the number of nodes from 33 to 300 and customers from 250 to 1500 in a system with three customer classes. The computation time increased proportionally with the number of nodes but exponentially with the number of customers, reaching 146,798.86 seconds for 33 nodes, 3 customer classes, and 1500 customers. We then considered a system with more customer classes; due to the greater computational burden, we proposed addressing this problem with simulation. By using a large-scale computing environment (a supercomputer), it was possible to obtain the theoretical solutions for up to three customer classes and verify the simulation accuracy. The parallel simulations’ performance-evaluation indices, such as the average number of people in the system, converged to the theoretical values within an acceptable error range after 100,000 simulation hours for systems with four or more customer classes. These results demonstrate that the proposed parallel simulation approach can serve as an accurate and computationally efficient alternative to theoretical solutions for large-scale closed BCMP queueing networks.
Citation: Mizuno S, Ohba H (2024) Limitations of calculating theoretical solutions for closed BCMP queueing networks and verification of alternative theoretical values by parallel simulation. PLoS ONE 19(12): e0311533. https://doi.org/10.1371/journal.pone.0311533
Editor: Palaniyappan Sathyaprakash, SASTRA Deemed University: Shanmugha Arts Science Technology and Research Academy, INDIA
Received: February 17, 2024; Accepted: September 19, 2024; Published: December 17, 2024
Copyright: © 2024 Mizuno, Ohba. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The datasets generated during and/or analysed during the current study are available in the GitHub repository (https://github.com/smzn/Limitations-of-Calculating-Theoretical-Solutions-for-Closed-BCMP-Queueing-Networks-and-Verification-).
Funding: This work was supported by JSPS (Japan Society for the Promotion of Science) KAKENHI (grant number JP21K11774). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Queueing theory has been a fundamental tool for analyzing and optimizing the performance of various systems, such as communication networks, manufacturing systems, and service facilities. The Baskett, Chandy, Muntz, and Palacios (BCMP) queueing network, introduced by Baskett et al. in 1975 [1], is a versatile and widely-used model that extends the earlier works of Jackson [2] and Gordon/Newell [3]. The BCMP network allows for open, closed, and mixed network types, four service disciplines, and multiple customer classes, making it applicable to a wide range of real-world problems [4–7].
Despite its theoretical elegance, the application of queueing theory to large-scale systems has been hindered by the computational complexity involved. Papadimitriou and Tsitsiklis [8] showed that finding the optimal control policy in a multi-class closed queueing network is an intractable problem with an exponentially large computational time. Consequently, the application of queueing theory to large-scale systems has been limited in practice.
Recently, "waiting"—an essential concept of queuing theory—has been revisited because of its connection to crowding. The global spread of the novel coronavirus has increased awareness of social distance [9]. The importance of managing queues and reducing waiting times has become more evident in the context of the pandemic, as crowding in public spaces can increase the risk of disease transmission [10]. Before the pandemic, crowding while "waiting in line" was accepted as a natural occurrence. However, reduction of crowding is now essential to prevent the spreading of infection. Effective queue management strategies, informed by queueing theory, can play a crucial role in redesigning public spaces and facilities, such as theme parks, shopping malls, and schools, to ensure public safety and well-being [11].
Nearly 50 years after the introduction of the BCMP queueing network, advances in computing power and parallel processing have opened up new possibilities for applying queueing theory to large-scale systems [12, 13]. This study aims to investigate the computational limits of solving large-scale closed BCMP queueing networks using exact methods, such as mean value analysis (MVA) [14], in a modern parallel computing environment. Furthermore, we propose an alternative approach using parallel simulations to approximate performance measures when the number of customer classes is large, making exact solutions computationally intractable [15–17]. The calculation time and memory used for parallel computing were evaluated for various numbers of nodes, customer classes, and people in the network, indicating the required computing resources for different model scales. Calculations involving four or more customer classes are challenging for existing methods; thus, we proposed using simulations for performance evaluation. The simulation results showed sufficiently acceptable errors compared to the theoretical values obtained. Additionally, the simulation approach can be applied when the number of customer classes is four or more, significantly reducing the computational resources required. This alternative simulation method confirms that theoretical values can be computed with significantly fewer computational resources.
Previous simulation studies have proposed the maximum and the minimum number of customers at each node, temporal correlation of the number of customers in the system at each node, and distribution of the number of people when travel time is considered [18]. Combining these with the proposed simulations is expected to provide insights that cannot be obtained with theoretical values, thus making it easier to apply the model to real-world situations.
The rest of this paper is organized as follows. Section 1.1 reviews the related work on computational methods for closed BCMP queueing networks and their applications, while Section 1.2 outlines the contributions of this study. Section 2 describes the methodology, including the definition of a closed BCMP queueing network, the mean value analysis method, the proposed algorithm for parallel computing using MPI, and the parallel simulation approach. Section 3 presents the experimental results and discussion, covering the massively parallel computing environment for MVA, factors affecting the calculation of theoretical solutions, considerations for the number of parallels, and the calculation of performance metrics using parallel simulation. This section also includes an accuracy verification of parallel simulations and explores the use of parallel simulation as an alternative to theoretical values. Additionally, it discusses the limitations of calculating theoretical values for closed BCMP and the effects of parallel simulation. Finally, Section 4 concludes the paper, summarizing the key findings and suggesting potential future research directions.
1.1 Related work
Although there are both open and closed types of queueing networks, performance metrics, such as the mean number of people in the system, can be obtained by finding the normalization constant based on the results of the product-form solution [4–6]. For open-type queuing networks, the normalization constant can be easily obtained numerically, and it is possible to make the network large. By contrast, when obtaining the normalization constant for closed-type queuing networks, all combinations that depend on the number of nodes, customer classes, and people in the network must be calculated, which requires massive computational resources [7]. This makes it difficult to upscale the queueing network in a closed system.
Recent studies have focused on developing efficient computational methods for analyzing large-scale closed queueing networks and their applications in various domains. Patel and Bhathawala [19] proposed a performance analysis method for closed queueing networks with multiple customer classes and non-exponential service times, extending the applicability of the BCMP model. Li and Fang [20] developed a parallel simulation approach for large-scale closed queueing networks with multi-class customers, demonstrating the scalability of simulation-based methods.
The integration of machine learning techniques with queueing theory has also gained attention in recent years. Osakwe et al. [21] proposed a deep reinforcement learning approach for performance optimization of queueing systems, showcasing the potential of data-driven methods in queueing network control. Sato and Yokota [22] applied closed queueing network models to analyze the performance of smart factories, highlighting the relevance of queueing theory in modern manufacturing systems.
Approximation methods have been developed to tackle the computational complexity of closed BCMP queueing networks. Chen and Yuan [23] proposed an approximate analysis method for closed queueing networks with load-dependent service times, providing a computationally efficient alternative to exact solutions.
Previous studies of computational methods for closed queueing networks, shown in Table 1, have presented numerical calculations on small networks. Moreover, the type of computational environment required was not clearly indicated. In the real world, closed queueing networks tend to be large. Therefore, estimating the computing resources required in the current computing environment is essential.
Typical algorithms for computing closed BCMP are convolution [30] and mean value analysis [31–33]. The convolution method directly calculates performance evaluation indices, such as the stationary distribution and the mean number of people in the system, by finding the normalization constant. However, recursive calculations are often used because of the formula structure for obtaining normalization constants. Although recursive calculations are effective in programming techniques such as memorization [34], they are not effective in parallelizing a large-scale computing environment. The calculation method depends on CPU performance, making it difficult to increase the scale of the model. This is exacerbated by several factorial and power calculations in the formulas and the fact that there is a dropout of orders of magnitude.
By contrast, although the mean value analysis method uses an enormous amount of memory, its calculation structure is suitable for parallel computation, suggesting great potential for large-scale applications. Many approximation algorithms have also been proposed [24–27], some of them using Brownian models [35, 36] or adding complex service conditions [37]. However, in today’s large-scale computing environment, sufficient memory and large-scale calculations can be expected without using approximations.
Simulations of closed BCMP have been studied for a long time, largely because of the difficulty of obtaining theoretical values for closed queueing networks, which have been used as an alternative method [33, 38, 39]. Simulations provide dynamic information that cannot be obtained under static conditions [40, 41], which is crucial when dealing with real-world models. Parallelization has been proposed to speed up simulation [42, 43]; however, the scale of the queueing network has remained small, and in terms of computational resources, the number of parallels has remained limited. Therefore, simulations have not reached a level appropriate for real-world applications.
Several studies have proposed real-world applications of queuing [44], e.g. to urban transportation networks [45] and theme parks [46]. For instance, Mizuno et al. [18, 46] computed a closed queueing network for mobile vehicles in a theme park and performed optimal node placement of mobile vehicles. Nevertheless, only one type of mobile vehicle was used because of computational complexity. The application of machine learning to social systems is also progressing and being increasingly linked with queueing theory. Links with supervised learning (e.g., neural networks [47, 48]) and with the game theory that incorporates customers’ experiential behavior [49] have been proposed, but a computational environment for large-scale implementation is essential for social implementation.
The growing use of automation, seen in self-driving cars and in robots performing picking tasks in factories, makes it important to evaluate large, closed queueing networks. If the performance evaluation values for closed BCMP queues can be rapidly calculated, evaluation and optimization of the performance of realistic queueing networks would become feasible.
1.2 Contributions of this study
This study makes the following contributions:
- For closed BCMP, we used MVA, an exact method for obtaining theoretical solutions, to show the limits computable in a modern computing environment. The performance-evaluation index of the closed BCMP could be obtained adequately for large-scale networks, especially when the number of customer classes was three or less. The scale of the systems considered in this study, in terms of the number of nodes, customer classes, and people in the network, is significantly larger than those in previous studies, as shown in Table 1. This demonstrates the novelty and importance of our work in tackling the challenges of large-scale system analysis.
- When the number of customer classes is four or more, the number of combinations in MVA becomes enormous, making it difficult to obtain a theoretical solution. Therefore, simulations were conducted and their results were compared with the theoretical solution.
- Using simulations with verified accuracy, we calculated performance metrics for closed BCMP with four or more customer classes. This simulation-based approach serves as an efficient alternative to exact methods for analyzing large-scale closed BCMP queueing networks that were previously intractable.
- The degree of simulation convergence was verified using the set simulation error-index to confirm the convergence of the simulation.
- Using the results of this study, it is possible to choose parameter settings such as the number of nodes, computational resources, and computation time for the construction of a closed BCMP optimization model.
2. Materials and methods
This section defines the proposed closed BCMP queueing network and presents a parallel computation algorithm for this large-scale network. We also propose using simulations as an alternative method for calculating the theoretical values.
2.1 Definition of a closed BCMP queueing network
The basic model of the closed BCMP queueing network proposed in this study is defined as follows [1]:
- There are C types of customer classes served in a network, and a customer belongs to one of the classes. There shall be no change of customer class during the process.
- There are N nodes in the network.
- When the total number of customers in the network is K, the number of customers of class c (1≤c≤C) in node n (1≤n≤N) is knc≥0, and
. Moreover,
.
- There shall be no arrival of guests from outside the network, as it is of closed type.
- Each node consists of a single server performing a first-come-first-service (FCFS).
- At location n, the service time follows an exponential distribution with service rate μn (1≤n≤N), independent of the customer class.
- The total arrival rate of class c customers arriving from within the network at node n is αnc.
- A customer of class v served at node i moves to node j in class w according to a Markov chain R = (riv,jw) that satisfies
(1)
As the class transition of customers is not considered this time, it is assumed that riv,jw = 0(v≠w). - The arrival rate αiv satisfies the following traffic equation for this closed network:
The state of node n is represented by . Let snj denote the class of the jth customer in line in order of arrival at node n. Let s = (s1,s2,⋯,sN) denote the state in the network and k = (k1,k2,⋯,kc) denote the number of people by customer class.
From this, the stationary distribution π(s) in the closed BCMP model is given by
(3)
where G(k) is the normalization constant
(4)
Additionally, fn(sn) obeys
(5)
Eqs (1)–(5) describe the fundamental properties of the closed BCMP queueing network model, as defined in the original paper by Baskett et al. [1].
2.2 Calculation of BCMP performance-evaluation index using mean value analysis
The mean number of people in the system for customer class c at node n using the mean value analysis method is calculated using the following updated formulas [4]: The mean intra-system time Tn,c(k) (1≤n≤ N,1≤c≤C) is
(6)
The mean number of people in the system Ln,c(k) is
(8)
Eqs (6)–(8) represent the basic steps of the mean value analysis (MVA) method for closed BCMP queueing networks, as described in Bolch et al. [4].
2.3 Proposed algorithm for using parallel computing (MPI) in MVA
The mean value analysis method can calculate performance evaluation indicators, such as the mean number of people in the system, through a simple calculation process. However, as the number of sites, classes, and people in the system increases and the scale of the model grows, the computation time becomes longer. Therefore, parallel computation is used to reduce computation time. The following algorithm describes the parallel computation flow when Message Passing Interface (MPI) [50] is applied to MVA:
- Registration of the configuration information
- Specify the number of nodes N; the number of classes C; the number of customers by class kc (1≤c≤C); the node service rate μn (1≤n≤N); and the number of parallels MP. The process number is mp (0≤mp≤MP−1), where mp = 0 is called the root process and is represented by mp0.
- mp0: Set transition probability R, find arrival rate αnc, and broadcast to other processes.
- MVA calculation for j in 1,⋯, K
- For each iteration, mp0 obtains a set satisfying
. For this, we define the set
. Using this, the direct product set K(j) of the number of classes C of Kc follows
(9) when the number of people in the system is j. Thus,
, is obtained.
- mp0 divides K(j) into
, and passes it to each process.
whereis each m-th class people vector passed to process i when the total number of customers is j.
- Using
passed in each process, find
from Eqs (6)–(8).
- mp0 aggregates the
obtained by each process and broadcasts Ln,c(∙) to each process.
- If index = K is exceeded, the iteration is terminated, and performance evaluation indicators, such as the average number of people in the system, are stored.
- For each iteration, mp0 obtains a set satisfying
Fig 1 illustrates the detailed steps of the parallel computation algorithm for MVA using MPI, providing a visual representation of the process described in this section. Eq (9) is an original set definition introduced in this study to calculate the overlap combination for each iteration j in the proposed parallel computation algorithm for MVA. In the above algorithm, Eq (9) in step B.a calculates the overlap combination for each iteration j such that the number of customers in each class does not exceed j, and the total number of customers in all classes is j. In B.b, the overlap combination obtained in B.a is distributed to parallel process i. In B.c, the calculation of Eqs (6)–(8) is performed in each parallel process. Finally, in B.d, the iterative calculation is performed using the root process mp0 to aggregate.
2.4 Calculation of performance metrics using parallel simulation
When the network becomes large, e.g., when there is an increase in the number of customer classes, the computing resources required become enormous, making computation challenging. Therefore, as an alternative to the theoretical calculation method of MVA, we propose using the mean number of people in the system in parallel simulations when calculating performance evaluation indices. The process is as follows:
- Registration of the configuration information
- Specify the number of nodes N; the number of classes C; the number of customers by class kc (1≤c≤C); the node service rate μn (1≤n≤N); the number of simulations S; and the simulation end condition ε. The process number is mp (0≤mp≤MP−1), where mp = 0 is called the root process and is represented by mp0.
- mp0: Set transition probability R, find arrival rate αnc, and broadcast to other processes.
- Perform event-driven parallel simulation
- Initial event setup: In each process, randomly assign customers in the network to nodes and give the first customer at each node a randomly distributed service time.
- Event processing: In each process, the minimum-service-time customer is served from the entire node; after the service is completed, the customer is moved to another node and the simulation time is updated.
- Calculate performance indicators such as the mean number of people in the system for each process.
- In
is used to calculate the mean
, where
is the mean number of people in the system for node n and class c of simulation number s.
- The allowable error in the number of people is given by
(10)
Repeat until Eq (10) is satisfied.
Eq (10) is an original formula for evaluating the error precision of the parallel simulation approach proposed in this study. As shown in Fig 2, the algorithm consists of two main parts: (A) Registration of the configuration information, and (B) Performing the event-driven parallel simulation. The flowchart clearly depicts the iterative nature of the simulation process and the error checking mechanism, which ensures the accuracy of the results. This visual representation enhances the understanding of our proposed parallel simulation approach for BCMP queueing networks.
3. Results and discussion
In this chapter, we present and discuss the results of our study on the limitations of calculating theoretical values for closed BCMP queueing networks and the effectiveness of parallel simulation as an alternative method. We begin by describing the massively parallel computing environment used for the mean value analysis (MVA) in Section 3.1. In Section 3.2, we examine the factors affecting the calculation of the theoretical solution for a closed BCMP queueing network, followed by a consideration of the number of parallels for MVA parallel computation in Section 3.3.
Next, we propose the use of parallel simulation to calculate performance metrics for closed BCMP when the number of customer classes is large, making MVA computationally challenging. We verify the accuracy of parallel simulations in Section 3.4.1 and present the results of using parallel simulation as an alternative to theoretical values in Section 3.4.2.
Finally, in Section 3.5, we discuss the limitations of calculating theoretical values for closed BCMP and the effects of parallel simulation, highlighting the contributions and implications of our study.
3.1 Massively parallel computing environment for MVA
In this study, the large-scale parallel computing environment for MVA was OCTOPUS [51], a large-scale computing system at the Cybermedia Center of Osaka University. The specifications of the computing environment are listed in Table 2.
3.2 Factors affecting the calculation of the theoretical solution for a closed BCMP queueing network
Here, we present the results of applying MPI to MVA to compute closed BCMP theoretical solutions. Table 3 summarizes the computation time, memory used, and number of combinations when N, C, and K are varied for 128 parallels. For example, consider the case N = 33,C = 3,K = 500, and MP = 128; the number of people per customer class was assumed to be equally K/C for every class. In this case, the computation time in MVA was 4166.07 seconds (approximately 70 minutes), and the maximum memory used was 114.44 GB.
Fig 3 shows the change in computation time as N and K increased. Note that the computation time grew proportionally to the increase in N (Fig 4). The computational complexity [4] also increased linearly as ON = D1N,(D1 is constant). However, Fig 5 shows that the computation time increased exponentially with K. In this case, the computational complexity was OK = D2(D3+K)C,(D2 and D3 are constants). The calculated amount of memory used was
[4]. Thus, the calculation results show that the increase in calculation time is proportional to N but grows exponentially when K is increased.
Table 3 shows the results up to C = 3. (The calculation could not be performed for C≥4 in this calculation environment.) With N = 33 and K = 500, the total number of combinations when calculating with MVA was 501 for C = 1, 63,001 for C = 2, 4,713,408 for C = 3, and 252,047,375 for C = 4, requiring several combination calculations. The number of combinations and the memory that can be held by C≥4 must be allocated, which is a significant burden for a computing environment.
3.3 Consideration of the number of parallels for MVA parallel computation for closed BCMP
The effect on computational time of the number of parallels in the parallelization of MVA for the closed BCMP is illustrated in Table 4. In particular, the table summarizes the computation time for the cases N = 33,C = 3, and K = 500, varying the number of parallels from 1 to 256. Because the maximum number of parallels on a single compute server is 128, two compute servers were used when the number of parallels was larger than this. Increasing the number of parallels reduces the amount of computation time required. However, the amount of memory used increases, and consequently, so does the utilization cost. Considering the rate of decrease in computation time, memory usage, and computer usage fees, 96~128 parallels were considered appropriate for using one compute server.
3.4 Calculation of performance metrics for closed BCMP using parallel simulation
In the closed BCMP, the calculation of performance evaluation indices using parallel computation in MVA has become a major burden due to the increase in computation volume as the number of customer classes increases. Herein, instead of calculating the performance evaluation index strictly using MVA, we consider calculating it using parallel simulation. First, the simulation accuracy is confirmed using the obtained MVA calculation results. Next, performance evaluation indexes are calculated for patterns with a large number of customer classes that could not be obtained using MVA. The simulation can be performed even without the large-scale computing environment used in the parallel computation of MVA, although the accuracy must be carefully verified. The following simulation results were calculated on a Mac Pro (model: Late 2013, processor: 3GHz 8-core Intel Xeon E5, memory: 32GB 1866MHz DDR3), with eight parallel calculations.
3.4.1 Accuracy verification of parallel simulations.
First, simulations with N = 33,C = 3, and K = 500 were performed to calculate the error relative to the theoretical values calculated by MVA. With a total number of people K in the network, the error tolerance Ep at each node and each class is
(11)
If it is assumed that p = 0.3, an error of 30.0% is assigned to each node and class on average. Subsequently, for K = 500, Eq (11) for each node and class yields 4.54 for C = 1; 2.27 for C = 2; and 1.51 for C = 3. The root mean squared error between the simulation and theoretical values for each node and class is
(12)
where
represents the theoretical value for node n and class c. Each simulation was performed eight times, and the mean
over the six simulations, excluding the best and worst cases, was used. Table 5 lists the result obtained using Eq (12). The bolded area is the value of Eq (12) below Ep. Approximately 7,000 simulation hours were required to reach the error tolerance. Similarly, the simulation accuracy was verified when N and K increased. Table 6 summarizes the change in the value of Eq (12) when N and K were varied for C = 3. In all cases, the simulations were found to be sufficiently accurate.
3.4.2 Alternative theoretical value using parallel simulation.
Because the simulation accuracy was confirmed in the previous section, simulations were performed for C≥4, a challenging scenario to compute. As a performance-evaluation index, the mean number of people in the network was calculated. The simulation considered eight simultaneous independent simulations, and the average was obtained from the mean number of people in the system obtained for each simulation value. The squared error with that average was obtained, and the remaining six simulation values, excluding those with the largest and smallest squared errors, were used to calculate the error of the simulation values at each time from Eq (10). Table 7 summarizes the change in Eq (10) when the number of classes increased under N = 33 and K = 500. In Eq (10), even when ε = 0.01, the value converged after 100,000 simulation hours, and an approximation to the theoretical value was obtained.
Consider the simulation results for C = 4, shown in Fig 6. The horizontal axis of the figure is the standard deviation of the average number of customers in the system obtained in the simulation for each process, averaged over all processes. The vertical axis is the value of Eq (12). From this, we can see that larger values along the horizontal axis correspond to smaller root-mean-square errors. The correlation coefficient between the two axes was –0.8920. The transition probability matrix of this simulation was generated for each program. It is easier for a simulation to converge when the queueing network is characterized by locations where the number of customers in the system is concentrated. Thus, this simulation is more likely to obtain good values in environments with localized crowding, thus increasing the accuracy of congestion assessment.
The real-time rate is the ratio of the real time taken in the simulation to the simulation time. When C = 3,K = 500, and simulation time was 100000, the real time rate was 0.8200 for N = 33; 12.1941 for N = 100; and 26.7621 for N = 133. The impact of N was larger than that of C because N > C and the transition probability, which goes as o((N∙C)2), must be used when a customer moves to the next node after service is completed.
3.5 Limitations of calculating theoretical values for closed BCMP and effects of parallel simulation
The computational environment to calculate the theoretical values for the closed BCMP was described in Section 3.1, and the results obtained in that environment were verified in Section 3.2. We also showed the appropriate computational environment for the closed BCMP in Section 3.3. In the presented computational environment, C = 3 is the limit for N = 33 and K = 500. In Section 3.4, we confirmed the accuracy of the simulation using the values obtained in Section 3.2. Moreover, we obtained simulation times for which the simulated values could replace the theoretical values. The results in these sections show the limit of the theoretical value calculation for the closed BCMP using the computational environment. Moreover, we showed that simulation can replace the theoretical value when C≥4 or more for N = 33 and K = 500. Furthermore, we have shown that, to obtain the theoretical value of the closed BCMP, an exact method, such as MVA, can be used, rather than an approximate method—even for a larger closed BCMP model than in previous studies. Furthermore, the computational environment and method are explicit and implemented in parallel simulation. This environment can be used as an alternative for models where calculations are challenging or impossible.
In a closed BCMP, simulation requires fewer computational resources and provides dynamic information, but requires longer actual computation time. The closed BCMP model can be utilized in many situations using the method of calculating performance evaluation indices for parallel computation of MVA. This study showcases the limitations of such parallel computation; moreover, the simulation accuracy is verified in this study according to the parameters of the closed BCMP.
4. Conclusion
This study performed the parallelization and simulation of MVA to BCMP, the traditionally used and a representative model of a closed queueing network, to scale up the closed BCMP successfully in a large-scale computing environment. We could accommodate changes in parameters, such as the number of nodes and people in the network, up to realistic values. The computation time and memory used for parameter changes were also found, indicating the necessary preparation for the computation environment. However, in this case, the number of customer classes was limited to three because MVA consumes a large amount of memory for larger numbers than this. Therefore, to obtain the mean number of people in a system with more than four customer classes, we performed simulations with the theoretical values of the obtained range and confirmed that the simulation values converged within the acceptable range. This indicates that the simulation method proposed in this study can be used as an alternative to calculating theoretical values of performance-evaluation indicators in a closed BCMP. This has important practical implications: although the number of customer classes is expected to increase as modern society becomes more diverse, performance evaluation indices, such as the mean number of people in the system, can still be obtained using highly accurate simulations.
This paper contributes to the literature in that it reports on the possibility of scaling queueing theory to real-world applications. Queueing theory is mathematically compelling but hard to apply in the real world because of the flexibility of its definition and its large scale. As shown in this paper, simulation allows flexible BCMP to be constructed on a large scale.
Queueing theory is also applicable to optimization, because it can calculate performance evaluation indices such as the mean number of people in the system. The application of queueing theory to optimization has nevertheless been limited, primarily because of its computational complexity. During optimization, the objective function is iteratively calculated several times. If the performance evaluation value of the closed BCMP is used for the objective function and only the computation time of one MVA is considered, the computation time to obtain the optimal solution is substantially larger. Here, we considered a large-scale closed BCMP model; its computation time was reduced by parallel computation. Therefore, this study has shown a computational environment that increases the possibility of applying queuing theory to optimization.
In the present study, the number of servers for each node was limited to one. Increasing the number of servers might provide more realistic values. Using service types other than FCFS and allowing for customer class transition should be considered for handling realistic models. The use of MVA also makes it difficult to calculate higher-order moments. Higher-order moments in conjunction with the normalization constants used in the convolution method could be calculated to evaluate realistic models. In the computational environment, memory space savings in recursive calculations in MVA and the use of GPUs are required for more generalized, large-scale closed BCMP calculations.
Acknowledgments
This work was (partly) achieved through the use of OCTOPUS at the Cybermedia Center, Osaka University.
References
- 1. Baskett F, Chandy KM, Muntz RR, Palacios FG (1975) Open, closed, and mixed networks of queues with different classes of customers. JACM 22:248–260.
- 2. Jackson JR (1957) Networks of waiting lines. Oper Res 5:518–521.
- 3. Gordon WJ, Newell GF (1967) Closed queuing systems with exponential servers. Oper Res 15:254–265.
- 4.
Bolch G, Greiner S, De Meer H, Trivedi KS (2006) Queueing networks and Markov chains: modeling and performance evaluation with computer science applications. John Wiley & Sons.
- 5. Allen AO (1990) Probability, statistics, and queueing theory. Gulf Professional Publishing.
- 6. Stewart WJ (2009) Probability, Markov chains, queues, and simulation: the mathematical basis of performance modeling. Princeton University Press.
- 7. Kino I, Queueing Network, Asakura Shoten, 2003.
- 8. Papadimitriou C, Tsitsiklis J (1999). The complexity of optimal queueing network. Math Oper Res 24:293–305.
- 9. Mizuno S, Ohba H (2022) Optimizing intra-facility crowding in Wi-Fi environments using continuous-time Markov chains. Discov Internet Things 2:5. https://doi.org/10.1007/s43926-022-00026-x
- 10. Chu D.K., Akl E.A., Duda S., Solo K., Yaacoub S., Schünemann H.J., et al., Physical distancing, face masks, and eye protection to prevent person-to-person transmission of SARS-CoV-2 and COVID-19: a systematic review and meta-analysis, The Lancet 395 (2020) 1973–1987. pmid:32497510
- 11. Jiang R., Shen S., Zhang Y., Analyzing the impact of COVID-19 on transportation systems using queueing theory, Transportation Research Part A: Policy and Practice 145 (2021) 244–262.
- 12. Awan I., Ahmad B., & Ahmad S. (2007). Performance analysis of networks of queues under active queue management scheme. Simulation Modelling Practice and Theory, 15(4), 416–425.
- 13. Mesheryakov R., Moiseev A., Demin A., Dorofeev V., & Sorokin V. (2016). Using Parallel Computing in Queueing Network Simulation. In Key Engineering Materials (Vol. 685, pp. 943–947). Trans Tech Publications, Ltd. https://doi.org/10.4028/www.scientific.net/kem.685.943
- 14. Reiser M, Lavenberg SS (1980) Mean-value analysis of closed multichain queuing networks. JACM 27:313–322.
- 15.
Lagershausen S., Performance analysis of closed queueing networks, Lecture Notes in Economics and Mathematical Systems, Springer, 2013.
- 16. Rege K.M., Multiclass queueing models for performance analysis of computer systems, Sadhana 15 (1990) 355–363.
- 17. Bertsimas D., Gamarnik D., Tsitsiklis J.N., Performance analysis of multiclass queueing networks, SIGMETRICS Performance Evaluation Review 27 (1999) 11–14.
- 18. Mizuno S, Suzuki K, Imai H, Yamaki N (2019) Effect of simulated cooperation on optimal node placement using a queuing network—A case study at Hamanako Garden Park—. J Jpn Ind Manag 69:220–228. (ここまでがIntroduction)
- 19. Patel R., Bhathawala P.H., Performance analysis of closed queueing networks with multiple classes and non-exponential service times, Journal of Industrial and Management Optimization 17 (2021) 1825–1842.
- 20. Li Z., Fang W., Parallel simulation of large-scale closed queueing networks with multi-class customers, in: 2020 IEEE 22nd International Conference on High Performance Computing and Communications, 2020, pp. 822–829.
- 21.
Osakwe I.C., Aneiba A., Misra S., Performance optimization of queuing systems using deep reinforcement learning, in: Machine Learning for Predictive Analysis, Springer, Cham, 2021, pp. 25–42.
- 22. Sato K., Yokota Y., Modeling and performance evaluation of a closed queueing network for a smart factory, in: 2020 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM), 2020, pp. 1165–1169.
- 23. Chen X., Yuan D., Approximate analysis of closed queueing networks with load-dependent service times, Performance Evaluation 148 (2021) 102180.
- 24. Wang H, Sevcik KC (2000) Experiments with improved approximate mean value analysis algorithms. Perform Eval 39:189–206.
- 25. Suri R, Sahu S, Vernon M (2007) Approximate mean value analysis for closed queuing networks with multiple-server stations. In Proceedings of the 2007 Industrial Engineering Research Conference 1–6.
- 26. Wang H, Sevcik KC, Serazzi G, Wang S (2008) The general form linearizer algorithms: A new family of approximate mean value analysis algorithms. Perform Eval 65:129–151.
- 27. Casale G (2008) A note on stable flow-equivalent aggregation in closed networks. Queueing Syst 60:193–202.
- 28. Kumar J, Shinde VA (2016) Comparative study of mean value analysis and convolution algorithm for queueing networks. Int J Comput Intell Syst 6: 8–17.
- 29.
Legato P, Mazza RM (2019) Class Aggregation for Multi-class Queueing Networks with FCFS Multi-server Stations. In International Conference on Queueing Theory and Network Applications 221–239. Springer, Cham.
- 30. Buzen JP (1973) Computational algorithms for closed queueing networks with exponential servers. Commun. ACM 16:527–531.
- 31.
Zhang L, Down DG (2019) SMVA: a stable mean value analysis algorithm for closed systems with load-dependent queues. In Systems Modeling: Methodologies and Tools 11–28. Springer, Cham.
- 32.
Boucherie RJ, Van Dijk NM, editors (2010) Queueing networks: a fundamental approach. Springer Science & Business Media.
- 33. Kobayashi H, Mark BL (2009) System modeling and analysis: Foundations of system performance evaluation. Pearson Education, India.
- 34. Guttag JV (2017) Introduction to Computation & Programming Using Python. Phi Learning.
- 35. Harrison JM, Nguyen V (1993) Brownian models of multiclass queueing networks: Current status and open problems. Queueing Syst 13:5–40. https://doi.org/10.1007/BF01158927
- 36. Schwerer E, Van Mieghem JA (1994) Brownian models of closed queueing networks: Explicit solutions for balanced three-station systems. Ann Appl Probab 4:448–477. https://doi.org/10.1214/aoap/1177005068
- 37. Shin YW, Moon DH (2021) A unified approach for an approximation of tandem queues with failures and blocking under several types of service-failure interactions. Computers & Operations Research 127:105161. https://doi.org/10.1016/j.cor.2020.105161
- 38.
Taylor S, editor (2014) Agent-based modeling and simulation. Springer.
- 39. Whitt W (1983) Performance of the queueing network analyzer. Bell Syst Tech J 62:2817–2843.
- 40. Shen H, Hong LJ, Zhang X (2018) Enhancing stochastic kriging for queueing simulation with stylized models. IISE Transactions 50:943–958. https://doi.org/10.1080/24725854.2018.1465242
- 41. Lin Z, Matta A, Shanthikumar JG (2019) Combining simulation experiments and analytical models with area-based accuracy for performance evaluation of manufacturing systems. IISE Transactions 51:266–283. https://doi.org/10.1080/24725854.2018.1490046
- 42. Wagner DB, Lazowska ED (1989) Parallel simulation of queueing networks: Limitations and potentials. ACM SIGMETRICS Perform Eval Rev 17:146–155.
- 43. Mesheryakov R, Moiseev A, Demin A, Dorofeev V, Sorokin V (2016) Using parallel computing in queueing network simulation. Key Eng Mater 685:943–947. Trans Tech Publications Ltd.
- 44.
Mizuno S, Seki M, Fujisawa Y, Kakehi A (2015) Expression of Relation of Medical Institutions and the Simulation of the Relation Recovery for Disaster. In International Conference on Computer Information Systems and Industrial Applications 976–979. Atlantis Press.
- 45. Dad J, Ouali M, Lebbah Y (2011) A multiclass BCMP queueing modeling and simulation-based road traffic flow analysis. World Acad Sci Eng Tech 5:312–317.
- 46. Mizuno S, Iwamoto S, Seki M, Yamaki N (2016) Proposal for optimal placement platform of bikes using queueing networks. SpringerPlus 5:1–4.
- 47. Khayyati S, Tan B (2022) Supervised-learning-based approximation method for multi-server queueing networks under different service disciplines with correlated interarrival and service times. International Journal of Production Research 60:5176–5200. https://doi.org/10.1080/00207543.2021.1951448
- 48. Dieleman NA, Berkhout J, Heidergott B (2023) A neural network approach to performance analysis of tandem lines: The value of analytical knowledge. Computers & Operations Research 152: 106124. https://doi.org/10.1016/j.cor.2022.106124
- 49. Sakuma Y, Masuyama H, Fukuda E (2020) A discrete-time single-server Poisson queueing game: Equilibria simulated by an agent-based model. European Journal of Operational Research 283:253–264. https://doi.org/10.1016/j.ejor.2019.11.003
- 50.
MPI for Python, https://mpi4py.readthedocs.io/en/stable/ Accessed May 2022
- 51.
OCTOPUS (Osaka university Cybermedia cenTer Over-Petascale Universal Supercomputer), http://www.hpc.cmc.osaka-u.ac.jp/en/octopus/ Accessed May 2022