Usability Testing: How to statistically compare the usability of different designs
Usability testing can be used to quantify usability. This can allow you to compare a site’s statistical performance before and after a re-design, in order to find out if it has improved. This method is not perfect, but can be very useful.
Introduction – a useful application of usability testing
Usability testing is mostly used to identify and explore usability issues on a site. This normally employs the ‘talkaloud’ method of usability testing, where a user is asked to complete a task and then verbalise their thoughts during the usability testing. Such a use of usability testing is very valuable and almost all significant web development projects will conduct this kind of usability testing before launch.
Usability testing can, however, be used in a different way. The ‘benchmarking’ method of usability testing allows you to get a statistical measure of a site’s usability during a usability testing session. This is a very powerful idea, because it allows us to use usability testing sessions to statistically compare the usability of different designs and/or sites.
How it works – using usability testing to get a measure of usability
The generally-accepted definition of usability is that provided by ISO 9241, which states that usability is “the extent to which a product can be used by specified users to achieve specified goals with effectiveness, efficiency, and satisfaction in a specified context of use”. So, in order to use usability testing to get a quantification of a site’s usability, we need to find a way of statistically measuring effectiveness, efficiency and satisfaction.
In the ‘benchmarking’ usability testing we have done for some of our clients, we have measured these on a per-task basis. Here are the ways we have found to statistically measure these parameters during a usability testing session:
- Effectiveness – This is usually the simplest one to measure during usability testing. For most tasks, you will be able to use a pass/fail measure of whether the user has managed to complete the task successfully. The percentage of users that manage to complete a task successfully therefore becomes a measure of the design’s effectiveness.
- Efficiency – A useful way to measure this characteristic can be timing the user on a task during the usability testing. The average task completion time of all the users participating in your usability testing therefore becomes a measure of efficiency.
- Satisfaction – The easiest way of measuring users’ satisfaction is normally in the form of a questionnaire. This questionnaire is typically administered after the usability testing session has ended and asks the user to rank their experience of the site (along a number of parameters) from 1 – 5 (1=poor, 5= good). The questions can relate to the entire site experience, or ask specific questions about each task.
Using these simple techniques you can get data which tells you (in a quantified way) how usable the site was found to be during usability testing. It should also be noted that during ‘benchmarking’ usability testing sessions, users are not encouraged to verbalise their thoughts, as this interrupts their thought-process and slows down task completion.
Conclusion – a good use of usability testing
Using usability testing to quantify a site’s usability is a very good idea. If you measure your site’s usability in this way before and after a re-design, it will help you ascertain whether the re-design was a success. It will not give you a 100% guarantee that your re-designed site will perform better in the real world, but it’s a good indication. After all, it’s normally a good thing when more people can complete tasks, in a shorter amount of time and are happier doing it!
This article was written by Tim Fidgeon, of Spotless Interactive – a leading usability consultancy offering usability testing