Evaluation studies: From controlled to natural settings

Similar documents
Evaluation studies: From controlled to natural settings

Evaluation studies: From controlled to natural settings

Introducing Evaluation

Introducing Evaluation

Usability Testing. November 9, 2016

MMI 2. Tutorials. Winter Term 2017/18. LMU München - LFE Medieninformatik. Prof. Andreas Butz Renate Häuslschmid Christina Schneegaß

Overview of Today s Lecture. Analytical Evaluation / Usability Testing. ex: find a book at Amazon.ca via search

Learnability of software

Concepts of Usability. Usability Testing. Usability concept ISO/IS What is context? What is context? What is usability? How to measure it?

Usable Privacy and Security Introduction to HCI Methods January 19, 2006 Jason Hong Notes By: Kami Vaniea

EVALUATION OF PROTOTYPES USABILITY TESTING

User Testing & Automated Evaluation. Product Hall of Shame! User Testing & Automated Evaluation. Visual Design Review. Outline. Visual design review

..in a nutshell. credit: Chris Hundhausen Associate Professor, EECS Director, HELP Lab

Goals of Usability Evaluation

Foundation Level Syllabus Usability Tester Sample Exam

Software Development and Usability Testing

Course Outline. Department of Computing Science Faculty of Science. COMP 3450 Human Computer Interaction Design (3,1,0) Fall 2015

Usability Testing. November 14, 2016

Chapter 4. Evaluating Interface Designs

Usability Testing, Cont.

EVALUATION OF PROTOTYPES USABILITY TESTING

evaluation techniques goals of evaluation evaluation by experts cisc3650 human-computer interaction spring 2012 lecture # II.1

HCI Research Methods

Introducing Evaluation

How to Choose the Right UX Methods For Your Project

03 Usability Engineering

Usability Testing! Hall of Fame! Usability Testing!

Evaluation and Testing. Andreas Butz, LMU Media Informatics slides partially taken from MMI class

USABILITY EVALUATIONS OF Health Information Technology

Stream Features Application Usability Test Report

USABILITY REPORT A REPORT OF USABILITY FINDINGS FOR OFF THE BEATEN PATH WEBSITE

THE TITLE OF MY THESIS GOES HERE: USE ALL CAPS AND PUT THE SUBTITLE ON THE SECOND LINE

Usability and User Experience of Case Study Prof. Dr. Wiebke Möhring

Introduction What is Usability and Usability Testing?

User-Centered Analysis & Design

Problem and Solution Overview: An elegant task management solution, that saves busy people time.

Nektarios Kostaras, Mixalis Xenos. Hellenic Open University, School of Sciences & Technology, Patras, Greece

Interaction Design. Chapter 7 (June 22nd, 2011, 9am-12pm): Evaluation and Testing

MTAT : Software Testing

The Pluralistic Usability Walk-Through Method S. Riihiaho Helsinki University of Technology P.O. Box 5400, FIN HUT

Evaluation techniques 1

Evaluation techniques 1

Interfaces Homme-Machine

Product Development for Medical, Life Sciences, and Consumer Health

Cascading versus Indexed Menu Design

Evaluation in Information Visualization. An Introduction to Information Visualization Techniques for Exploring Large Database. Jing Yang Fall 2005

evision Review Project - Engagement Simon McLean, Head of Web & IT Support Information & Data Services.

UX Research. research methods overview. UX Research research methods overview. KP Ludwig John

Integrating User Evaluation into Software Development Environments

Age-Related Differences in Subjective Ratings of Hierarchical Information

Interaction Design. Recap Session (Previous lecture April 21, 2016, 9am-12pm): Process Models, Elements and Usability

IBM MANY EYES USABILITY STUDY

The Effects of Semantic Grouping on Visual Search

Interaction Design. Chapter 3 (May 11, 2017, 9am-12pm): Approaches to IxD Recap Session (Previous lecture): Process Models, Elements and Usability

User Experience Design I (Interaction Design)

Empirical Studies on the Security and Usability Impact of Immutability

8.1 Goals of Evaluation 8.2 Analytic Evaluation 8.3 Empirical Evaluation 8.4 Comparing and Choosing Evaluation Techniques

SFU CMPT week 11

CS 315 Intro to Human Computer Interaction (HCI)

Usability and Evaluation of BCKOnline PORTAL Prototype Early breast cancer Recurrent Advanced

Improving Government Websites and Surveys with Usability Testing

Cognitive walkthrough: Analytical evaluation without users. 9 lecture Dr. Kristina Lapin

HUMAN COMPUTER INTERACTION

Page 1. Ideas to windows. Lecture 7: Prototyping & Evaluation. Levels of prototyping. Progressive refinement

EVALUATING DIFFERENT TOUCH-BASED INTERACTION TECHNIQUES IN A PUBLIC INFORMATION KIOSK

8.1 Goals of Evaluation 8.2 Analytic Evaluation 8.3 Empirical Evaluation 8.4 Comparing and Choosing Evaluation Techniques

Experimental Validation of TranScribe Prototype Design

CS3205 HCI IN SOFTWARE DEVELOPMENT PROTOTYPING STRATEGIES. Tom Horton. * Material from: Floryan (UVa) Klemmer (UCSD, was at Stanford)

Team : Let s Do This CS147 Assignment 7 (Low-fi Prototype) Report

CUE-10: Moderation Page 1. Comparative Usability Evaluation 10. Moderation. Observing usability test moderators

Testing is a very big and important topic when it comes to software development. Testing has a number of aspects that need to be considered.

Interaction Design. Ruben Kruiper

Design and User Evaluation of Augmented-Reality Interfaces Topic Usability Evaluation Methods. Payel Bandyopadhyay MSc Student University of Helsinki

User Centered Design in Multimedia Broadcasting

Evaluation and Design Issues of Nordic DC Metadata Creation Tool

MIT GSL week 4 Wednesday. User Interfaces II

Foundation Level Syllabus Usability Tester Sample Exam Answers

USABILITY IN HEALTHCARE IT: DATA COLLECTION AND ANALYSIS APPROACHES

Usability Testing Methodology for the 2017 Economic Census Web Instrument

Usability of an adjusted IndoorTubes map design for indoor wayfinding on mobile devices

Usefulness of Nonverbal Cues from Participants in Usability Testing Sessions

Integrating Usability Design and Evaluation: Training Novice Evaluators in Usability Testing

Usability Report for Online Writing Portfolio

Detection of Web-Site Usability Problems: Empirical Comparison of Two Testing Methods

CPSC 444 Project Milestone III: Prototyping & Experiment Design Feb 6, 2018

Visual Appeal vs. Usability: Which One Influences User Perceptions of a Website More?

CSCI 3160: User Interface Design

User-centered Instructional Design

Eyetracking Study: Kent State University Library. Janie Ralston

Human-Computer Interaction

Evaluation. Why, What, Where, When to Evaluate Evaluation Types Evaluation Methods

UX Research. research methods overview. UX Research research methods overview. KP Ludwig John

Are Senior Smartphones really better?

THE EFFECT OF SCENT ON USER RECALL OF WEB SITE NAVIGATION

Implementing Games User Research Processes Throughout Development: Beyond Playtesting

EVALUATING USER INTERFACES. 15 & 17 January 2014

CS 160: Evaluation. Outline. Outline. Iterative Design. Preparing for a User Test. User Test

Evaluation Continued. Lecture 7 June 20, 2006

Preliminary Evidence for Top-down and Bottom-up Processes in Web Search Navigation

Usability. HCI - Human Computer Interaction

Transcription:

Evaluation studies: From controlled to natural settings Lecture 14 Kristina Lapin Vilnius University Slides adapted from www.id-book.com 1

The aims: Explain how to do usability testing Outline the basics of experimental design Describe how to do field studies www.id-book.com 2

Motivation http://www.asktog.com/columns/042butterlyballot.html 3

Usability testing Involves recording performance of typical users doing typical tasks. Controlled settings. Users are observed and timed. Data is recorded on video & key presses are logged. The data is used to calculate performance times, and to identify & explain errors. User satisfaction is evaluated using questionnaires & interviews. Field observations may be used to provide contextual understanding. www.id-book.com 4

www.id-book.com Usability testing Goals & questions focus on how well users perform tasks with the product. Comparison of products or prototypes common. Focus is on time to complete task & number & type of errors. Data collected by video & interaction logging. Testing is central. User satisfaction questionnaires & interviews provide data about users opinions. 5

Usability lab with observers watching a user & assistant www.id-book.com 6

Usability lab 1-3 video cameras, microphons Camera remote control 7

Testing a paper prototype http://www.iua.upf.es/~jblat/material/hci/2008/paper_prototyping/p21-rettig.pdf 8

Use case: the testing of NetMeeting, an early videoconferencing product Chris Nodder, Gayna Williams, Deborah Dubrow (1999) Evaluating the usability of an evolving collaborative product - changes in user type, tasks and evaluation methods over time. Proceedings of the international ACM SIGGROUP conference on Supporting group work, 1999. http://www.cis.gvsu.edu/~tao/cs623/p150-nodder.pdf 9

Use case: the testing of NetMeeting, an early videoconferencing product Chris Nodder, Gayna Williams, Deborah Dubrow (1999) Evaluating the usability of an evolving collaborative product - changes in user type, tasks and evaluation methods over time. Proceedings of the international ACM SIGGROUP conference on Supporting group work, 1999. http://www.cis.gvsu.edu/~tao/cs623/p150- nodder.pdf 10

Portable equipment for use in the field Tracksys portable lab include: camera with direct plug to PC, software GoToMeeting, remote control system, new eye-tracking devices www.id-book.com 11

Mobile head-mounted eye tracker www.id-book.com Picture courtesy of SensoMotoric Instruments (SMI), copyright 2010 12

www.id-book.com 13

Remote usability testing Handreds of users can be tested Participation in the natual context from geographically spread locations No human moderation needed http://www.slideshare.net/userzoom/case-study-lab-online-usability-testing- 4041695?from=ss_embed 14

Testing conditions Usability lab or other controlled space. Emphasis on: selecting representative users; developing representative tasks. 5-10 users typically selected. Tasks usually last no more than 30 minutes. The test conditions should be the same for every participant. Informed consent form explains procedures and deals with ethical issues. www.id-book.com 15

Some type of data Time to complete a task. Time to complete a task after a specified. time away from the product. Number and type of errors per task. Number of errors per unit of time. Number of navigations to online help or manuals. Number of users making a particular error. Number of users completing task successfully. www.id-book.com 16

Usability engineering orientation Aim is improvement with each version. Current level of performance. Minimum acceptable level of performance. Target level of performance. www.id-book.com 17

How many participants is enough for user testing? The number is a practical issue. Depends on: schedule for testing; availability of participants; cost of running tests. Typically 5-10 participants. Some experts argue that testing should continue until no new insights are gained. www.id-book.com 18

User Testing in the Design Process Empirical evaluation can happen at every stage Formative evaluation Happens throughout the design process Can evaluate scenarios, sketches, models, prototypes Summative evaluation ` Typically happens at the end Assesses system and interface design quality, i.e., how well have we done? www.id-book.com 19

Methods User Testing Design and implement scenario or prototype Record user behaviour Typical usage, or critical incidents Keystroke recording Thinking aloud protocols Videotape protocols Interviews for subjective impressions Analyze user behaviour Roles Understanding user methods Understanding user problems Discovering user thought processes 20

Think aloud protocol In this approach the user says out what she is thinking while she is carrying out a task or doing some problem solving. User s thinking is recorded as an audio record. Using this protocol we collect the reactions of the users. This is quite helpful because many aspects of human behavior and mind is not predictable by engineering models. 21

Carrying out the study User Testing Let users know that complete anonymity will be preserved Let them know that they may quit at any time Stress that the system is being tested, not the participant Indicate that you are only interested in their thoughts relevant to the system Demonstrate the thinking-aloud method by acting it out for a simple task, e.g., figuring out how to load a stapler Hand out instructions for each part of the study individually, not all at once Maintain a relaxed environment free of interruptions Occasionally encourage users to talk if they grow silent If users ask questions, try to get them to talk (e.g., What do you think is going on? and follow predefined rules on when to help or interrupt to help. Debrief each user after the experiment Improving the study The pilot study should debug the study. This minimize changes during the study, allowing quantitative data analysis. But improvements may be warranted. Experimenters role can be improved Tasks given to participant can be improved Written materials can be improved 22

Use case: name 3 features for each that can be tested by usability testing ipad 23

Testing goals Ar user expectations different for the ipad compared with iphone? Previous study of the iphone: people preferred using apps than browsing the web because the latter is slow and cumbersome Whether it is worth developing specific websites for the ipad (like for smartphones)? Or the desktop versions are acceptable? http://www.nngroup.com/reports/mobile/ipad/ 24

Participants Seven participants: All experienced iphone users who had owned iphone for at least 3 months had used a variety of apps. Age: 20-60 Occupations: food server, legal, medical staff, retired driver, homemaker, accounter 3 males, 4 females http://www.nngroup.com/reports/mobile/ipad/ 25

Tasks In the beginning: Ad-hoc tasks. Examples of used apps: http://www.nngroup.com/reports/mobile/ipad/ 26

Specific tasks http://www.nngroup.com/reports/mobile/ipad/ 27

The equipment Mobile usability kit http://www.nngroup.com/reports/mobile/ipad/ Procedure Camera recorded interactions and gestures using ipad Webcam xpressions of participants faces and think-aloud commentary. Observers watched the video rather than observing directly 28

The findings The participants were able to interact with websites on ipad but it was not optimal Links too small to tap on reliably The fonts sometimes dificult to read Usability problems were classified to interaction design principles: Mental models, navigation, the quality of images, touschscreen problems, lack of affordances, getting lost in the application, working memory, and the received feedback. http://www.nngroup.com/reports/mobile/ipad/ 29

www.id-book.com 30

31

Lack of Affordances: Where Can I Tap? www.id-book.com 32

Getting Lost in an Application 33

Experiments Predict the relationship between two or more variables. A/B testing Independent variable is manipulated by the researcher. Dependent variable depends on the independent variable. Typical experimental designs have one or two independent variable. Validated statistically & replicable. www.id-book.com 34

Hypotheses testing A hypothesis tests the effect of the independent variable on the dependent variable A null hypothesis No difference between dependent variables Alternative hypothesis There is a difference www.id-book.com 35

Experimental designs Different participants - single group of participants is allocated randomly to the experimental conditions. Same participants - all participants appear in both conditions. Matched participants - participants are matched in pairs, e.g., based on expertise, gender, etc. www.id-book.com 36

Example: structure in web page design The goals of experiment was to find the optimal depth versus breadth structure of hyperlinks Condition 1: 8 x 8 x 8 Condition 2: 16 x 32 Condition 3: 32 x 16 A same-participant experiment, random tasks Results C1: reaction time = 58 sek., SD = 23 C2: reaction time = 36 sek, SD=16 C3: reaction time = 46 sek, SD=26 Conclusion: breadth is preferable to depth. Larson, Czerwinski 1998 37

Different, same, matched participant design Design Advantages Disadvantages Different No order effects Many subjects & individual differences a problem Same Matched Few individuals, no individual differences Same as different participants but individual differences reduced Counter-balancing needed because of ordering effects Cannot be sure of perfect matching on all differences 38

Statistics: t-tests The measure results are used to compute the means and standard deviations (SD) SD statistical measure of the spread or variabilityarount the mean. T-test tests a significance of the diference beteen the means for the two condidition www.id-book.com 39

T-test (MS Excel) 40

T-test (MS Excel) Null hypothesis Experts are faster than novices 41

Usability testing & research Usability testing Improve products Few participants Results inform design Usually not completely replicable Conditions controlled as much as possible Procedure planned Results reported to developers Experiments for research Discover knowledge Many participants Results validated statistically Must be replicable Strongly controlled conditions Experimental design Scientific report to scientific community www.id-book.com 42

Field studies Field studies are done in natural settings. in the wild is a term for prototypes being used freely in natural settings. Aim to understand what users do naturally and how technology impacts them. Field studies are used in product design to: - identify opportunities for new technology; - determine design requirements; - decide how best to introduce new technology; - evaluate technology in use. www.id-book.com 43

Data collection & analysis Observation & interviews Notes, pictures, recordings Video Logging Analyzes Categorized Categories can be provided by theory Grounded theory Activity theory www.id-book.com 44

Data presentation The aim is to show how the products are being appropriated and integrated into their surroundings. Typical presentation forms include: vignettes, excerpts, critical incidents, patterns, and narratives. www.id-book.com 45

UbiFit Garden: An in the wild study www.id-book.com 46

Key points Usability testing is done in controlled conditions. Usability testing is an adapted form of experimentation. Experiments aim to test hypotheses by manipulating certain variables while keeping others constant. The experimenter controls the independent variable(s) but not the dependent variable(s). There are three types of experimental design: differentparticipants, same- participants, & matched participants. Field studies are done in natural environments. In the wild is a recent term for studies in which a prototype is freely used in a natural setting. Typically observation and interviews are used to collect field studies data. Data is usually presented as anecdotes, excerpts, critical incidents, patterns and narratives. www.id-book.com 47

References Rogers, Sharp, Preece (2011). Interaction design: Beyond Human Computer Interaction. Wiley Nielsen Norman Group Reports. Usability of ipad Apps and Websites: 2 Reports With Research Findings Larson, K., M. Czerwinski (1998) Web page design: implications of memory, structure and scent for information retrieval. In Proceedings of CHI 98, pp. 25-32 S. Consolvo, D. W. McDonald, T. Toscos, M. Chen, J.E. Froehlich, B. Harrison, P. Klasnja, A. LaMarca, L. LeGrand, R. Libby, I. Smith & J. A. Landay. Activity Sensing in the Wild: A Field Trial of UbiFit Garden, Proceedings of the Conference on Human Factors in Computing Systems: CHI 08, Florence, Italy, (2008), pp.1797-806. www.id-book.com 48