Sage Journals: Discover world-class research

Abstract

This study examined whether the average usability score for a series of tasks was the same as the usability score for the product if usability was measured only after all the tasks had been completed. Fifty participants completed a set of tasks for five websites and fourteen mock voting ballots. Subjective usability assessment was made with the System Usability Scale (SUS). Participants completed the SUS either after each task (five or fourteen SUS administrations, respectively) or after completing the entire set of tasks (one SUS). The results show that the average SUS scores for the task-level assessments were significantly higher than the SUS scores for the test-level assessments. Results were similar for the ballot and website conditions. Task-level SUS scores on the Honda websites (M = 65.5) were significantly higher than the test-level SUS scores (M = 42.8), p < 0.0001. Similar results were observed in the ballot condition, where task-level usability assessments were higher (M = 59.5) than test-level assessments (M = 38.5), p < 0.0001. Practitioners and those interpreting SUS scores need to be aware of how these experimental differences can lead to different assessment metrics.

Get full access to this article

View all access options for this article.

References

Bangor

Kortum

P. T.

Miller

J. T.

(2008). An empirical evaluation of the system usability scale. Intl. Journal of Human–Computer Interaction, 24(6), 574-594.

Baumeister

R. F.

Bratslavsky

Finkenauer

Vohs

K. D.

(2001). Bad is stronger than good. Review of general psychology, 5(4), 323.

Brooke

(1996). “SUS: a “quick and dirty” usability scale”. Jordan

P. W.

Thomas

Weerdmeester

B. A.

McClelland

A. L.

Usability Evaluation in Industry. London: Taylor and Francis.

Ebbinghaus

H. E.

(1902). Grundzuge der Psychologie. Leipzig: Von Veit.

Finstad

(2006). The system usability scale and non-native English speakers. Journal of usability studies, 1(4), 185-188.

Hornbæk

Law

(2007). Meta-analysis of correlations among usability measures. In Proceedings of CHI 2007 (pp. 617-626). San Jose, CA: ACM.

Hoffmann-Jørgensen

Pisier

(1976). The law of large numbers and the central limit theorem in Banach spaces. The Annals of Probability, 587-599.

Kortum

Acymyan

C.Z.

(2013). How Low Can You Go? Is the System Usability Scale Range Restricted? Journal of Usability Studies, 9(1), 14-24.

Lund

(2001). Measuring Usability with the USE Questionnaire. Usability Interface: the usability SIG newsletter of the Society for Technical Communications, 8(2). Online at http://www.stcsig.org/usability/newsletter/0110_measuring_with_use.html

10.

Sauro

Lewis

J.R.

(2009). Correlations among Prototypical Usability Metrics: Evidence for the Construct of Usability. Proceedings of the Conference in Human Factors in Computing Systems, 1609-1618.

The Relationship Between Task-level and Test-level System Usability Scale Scores

Abstract

Get full access to this article

References