Abstract
A queuing model analysis is given for the performance evaluation of a dual scalar processor (DSP), which is composed of two scalar units and one vector unit. The performance evaluation is for three different processor models with equivalent hardware capacity: homoge neous DSP (DSP1), heterogeneous DSP (DSP2), and multiprocessor (MP), with particular attention given to the case of short vector lengths. It is found that the performance of DSP1 is preferable to that of MP for most actual workloads, and in particular for the work load of the Japan Atomic Energy Research Institute. The effect of the dual scalar capacity, that is, of attach ing a secondary scalar unit, is also investigated. It is shown that the throughput of DSP1 is about 1.8 times that of the uniprocessor, for a vectorization ratio of 90%.
Get full access to this article
View all access options for this article.
