How Many Participants: A Simple Means for Concurrent Monitoring

Abstract

User trialling, particularly in a design context with tight time and budget constraints, begs the question what makes up a sufficient number of participants for observing proportionally enough of the phenomena at issue; e.g. usability problems. Statistical approaches such as estimating the number of species do not seem to be applicable since the required, mathematically ‘neat’ sampling conditions do not match the gathering of observations with consecutively involved participants in a user trial. Therefore, we resorted to the well-known binomial model, precipitating (without sampling restrictions) an anticipated increase in overlap, i.e., a rising proportion of shared observations between participants in an ongoing trial, or, in other words, diminishing returns in terms of unique observations. In Ergonomics/Human Factors (E/HF) literature, the application of the binomial model has given rise to retrospective assessments involving the number of participants that would have been enough, by hindsight, to discover e.g. 80% of all usability problems, which, by reference to case studies, eventually gave rise to the rule of thumb that about five participants are sufficient. The present paper summarises and extends two earlier papers in providing a simple statistic in order to monitor concurrently the proportion of information gained so far in a trial. Careful scrutiny is given to the origin of estimates being biased downward, that is: the underestimation of the asymptotic number of usability problems to be discovered given the observations after a number of participants. On the basis of both hypothetical examples and empirical studies it is shown that the ‘five-is-enough’ rule of thumb may hold, but may equally well be much too optimistic. With the proposed statistic for concurrent monitoring, it can arguably be decided on whether or not to continue a trial.

Get full access to this article

View all access options for this article.

References

Arisz

Kanis

(1999). Towards concurrent monitoring of the number of subjects in user trials. Contemporary Ergonomics, 417–412.

Arisz

Kanis

Rooden

M.J.

(2000). How many participants: A simple statistic with some limitation. Contemporary Ergonomics (in press).

Bunge

Fitzpatrick

(1993). Estimating the Number of Species: A Review. Journal of the American Statistical Association, 88, 364–373.

Chao

Shen-Ming

(1992). Estimating the Number of Classes via Sample Coverage. Journal of the American Statistical Association, 87, 210–217.

Kanis

Weegels

M.F.

Green

W.S.

(1999). Scientific research in a design context. Contemporary Ergonomics (pp.374–378).

Kanis

Green

W.S.

(2000) Research for usage oriented design: quantitative? qualitative? This procceedings.

Lewis

J.R.

(1994). Sample Sizes for Usability Studies: Additional Considerations. Human Factors, 36, 368–378.

Nielsen

Landauer

T.K.

(1993). A Mathematical Model of the Finding of Usability Problems. In Interchi 1993 (pp. 206–213).

Nielsen

(1994). Estimating the number of subjects needed for a thinking aloud test. International Journal of Computer Studies, 41, 385–397.

10.

Rooden

M.J.

Green

W.S.

Kanis

(1999). Difficulties in usage of a coffee maker predicted on the basis of design models. Proceedings HFES Annual Meeting (pp. 476–480).

11.

Virzi

R.A.

(1992). Refining the Test Phase of Usability Evaluation: How many subjects is enough ? Human Factors, 34, 457–468.