Abstract
Consider the problem of establishing a service quota that sets the maximum number of items, entities, or people that should be scheduled for service during a particular period. Service is performed by a multiserver system using a first-come, first-serve discipline. If the number of customers seeking service is less than or equal to the quota, then they are all scheduled for service; otherwise, the quota is serviced, and remaining customers are delayed. Some delayed customers may leave the system before receiving service. Determination of an economic service quota requires the trade-off of possible overtime cost of scheduling too many customers for service in the period and the postponement cost of scheduling too few. A Markov chain analysis is used to develop a distribution for the number of customers postponed. Simulation is needed to estimate the expected total overtime given a fixed number of parallel servers. This method permits the estimation of the expected total cost of using the service quota policy and finding the optimal quota. A numerical example is provided.
Get full access to this article
View all access options for this article.
