CS 160: Evaluation. Professor John Canny Spring /15/2006 1

Similar documents
CS 160: Evaluation. Outline. Outline. Iterative Design. Preparing for a User Test. User Test

Heuristic Evaluation. Hall of Fame or Shame? Hall of Fame or Shame? Hall of Fame! Heuristic Evaluation

USER INTERFACE DESIGN + PROTOTYPING + EVALUATION. Heuristic Evaluation. Prof. James A. Landay University of Washington CSE 440

Heuristic Evaluation! Hall of Fame or Shame?! Hall of Fame or Shame?! Hall of Fame or Shame?! Hall of Fame!!

HCI and Design SPRING 2016

Assignment 5 is posted! Heuristic evaluation and AB testing. Heuristic Evaluation. Thursday: AB Testing

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation

Design Reviews. Scott Klemmer. stanford hci group / cs147. tas: Marcello Bastea-Forte, Joel Brandt, Neil Patel, Leslie Wu, Mike Cammarano

Interaction Design. Heuristic Evaluation & Cognitive Walkthrough

Page 1. Ideas to windows. Lecture 7: Prototyping & Evaluation. Levels of prototyping. Progressive refinement

HEURISTIC EVALUATION WHY AND HOW

Assistant Professor Computer Science. Introduction to Human-Computer Interaction

Design Heuristics and Evaluation

CO328- Human Computer Interaction Michael Kölling Caroline Li. Heuristic Evaluation

cs465 principles of user interface design, implementation and evaluation

Analytical &! Empirical Evaluation

User Experience Report: Heuristic Evaluation

Lose It! Weight Loss App Heuristic Evaluation Report

dt+ux: Design Thinking for User Experience Design, Prototyping & Evaluation!

Heuristic evaluation is a usability inspection technique developed by Jakob Nielsen. The original set of heuristics was derived empirically from an

Heuristic Evaluation of [ Quest ]

IPM 10/11 T1.6 Discount Evaluation Methods

User Interface Evaluation

1. Select/view stores based on product type/category- 2. Select/view stores based on store name-

Heuristic Evaluation. Ananda Gunawardena. Carnegie Mellon University Computer Science Department Fall 2008

Jakob Nielsen s Heuristics (

Usability & User Centered Design. SWE 432, Fall 2018 Design and Implementation of Software for the Web

1. Problem Mix connects people who are interested in meeting new people over similar interests and activities.

Heuristic Evaluation of igetyou

Heuristic Evaluation. Heuristic evaluation evaluates the interface to identify usability problems against recognized usability design heuristics.

Heuristic Evaluation of Covalence

Overview of Today s Lecture. Analytical Evaluation / Usability Testing. ex: find a book at Amazon.ca via search

iscreen Usability INTRODUCTION

Heuristic Evaluation of [Slaptitude]

Usability Inspection Methods. Overview. Usability Measures. SMD157 Human-Computer Interaction Fall 2003

User-Centered Design. SWE 432, Fall 2017 Design and Implementation of Software for the Web

Usability. HCI - Human Computer Interaction

Heuristic Evaluation. Jon Kolko Professor, Austin Center for Design

Heuristic Evaluation of Math Out of the Box

CS 544 Discount Usability Engineering

SFU CMPT week 11

Due on: May 12, Team Members: Arpan Bhattacharya. Collin Breslin. Thkeya Smith. INFO (Spring 2013): Human-Computer Interaction

Heuristic Evaluation

Übung zur Vorlesung Mensch-Maschine-Interaktion. e5: Heuristic Evaluation

CogSysIII Lecture 9: User Modeling with GOMS

CS 147 Autumn 2017: Assignment 9 (Heuristic Evaluation Group Template) Instructor: James Landay. Fix: make web also block the desktop screen.

evaluation techniques goals of evaluation evaluation by experts cisc3650 human-computer interaction spring 2012 lecture # II.1

Software Quality. Martin Glinz. Thomas Fritz. Lecture 7 UI Design, Usability & Testing. Many thanks to Meghan Allen and Daniel Greenblatt.

Hyper Mesh Code analyzer

Chapter 15: Analytical evaluation

Analytical Evaluation

15/16 CSY2041 Quality and User-Centred Systems

CMSC434 Intro to Human-Computer Interaction. Visual Design #3 and Evaluation #1 Monday, April 8th, 2012 Instructor: Jon Froehlich TA: Kotaro Hara

Introduction to Internet Applications

Heuristic Evaluation of PLATELIST

Crab Shack Kitchen Web Application

Heuristic Evaluation

Computer Systems & Application

Chapter 10 Interactive Systems And Usability Organizational Requirements Engineering

Addition about Prototypes

MIT GSL week 4 Wednesday. User Interfaces II

Analytical evaluation

User-Centered Design. Jeff Bos, Design Insights BlackBerry

Design Principles. Overview. User-Center Design. SMD157 Human-Computer Interaction Fall User-center design Guidelines

A Heuristic Evaluation of Ohiosci.org

Heuristic Evaluation of NUIG Participate Module 1

How to Conduct a Heuristic Evaluation

Design Principles. Overview. User-Center Design. SMD157 Human-Computer Interaction Fall User-center design Guidelines

Usability. Daniela Rosner. Web Architecture, October 9, School of Information UC Berkeley

Usability in Multimedia. By Pınar Koçer Aydın and Özgür Bayram

Applying Usability to elearning

Usability analysis and inspection

Usability Testing! Hall of Fame! Usability Testing!

Why? Usability analysis and inspection. Evaluation. Evaluation. Measuring usability. Evaluating usability

Lecture 14: Heuristic Evaluation. Fall UI Design and Implementation 1

evision Review Project - Engagement Simon McLean, Head of Web & IT Support Information & Data Services.

Problem and Solution Overview: An elegant task management solution, that saves busy people time.

Evaluation in Information Visualization. An Introduction to Information Visualization Techniques for Exploring Large Database. Jing Yang Fall 2005

Announcements. Usability. Based on material by Michael Ernst, University of Washington. Outline. User Interface Hall of Shame

Lecture 22: Heuristic Evaluation. UI Hall of Fame or Shame? Spring User Interface Design and Implementation 1

Lecture 14: Heuristic Evaluation

Usability. CSE 331 Spring Slides originally from Robert Miller

Introduction to Usability and its evaluation

SEPA diary Heuristic evaluation

Group #: Evaluation Proposal For Digital Libraries, LIS 5472, Fall 2010, Professor Sanghee Oh

Heuristic Evaluation of Enable Ireland

activated is a platform that allows students to create, organize, and share the steps on their road to college.

Additional reading for this lecture: Heuristic Evaluation by Jakob Nielsen. Read the first four bulleted articles, starting with How to conduct a

Lecture 6. Usability Testing

dt+ux Design Thinking for User Experience Design, Prototyping & Evaluation Autumn 2016 Prof. James A. Landay Stanford University

UX DESIGN BY JULIA MITELMAN

Ryan Parsons Chad Price Jia Reese Alex Vassallo

Heuristic Evaluation of Team Betamax

HUMAN COMPUTER INTERACTION

Heuristic Evaluation of Groupware. How to do Heuristic Evaluation of Groupware. Benefits

Amsterdam Medical Center Department of Medical Informatics. Improve. Usability evaluation of the sign up process of the Improve app

Sri Vidya College of Engineering & Technology Question bank (unit 2) UNIT 2 TWO MARKS

Craigslist Heuristic Evaluation

Evaluation Types GOMS and KLM PRICPE. Evaluation 10/30/2013. Where we are in PRICPE: Analytical based on your head Empirical based on data

Transcription:

CS 160: Evaluation Professor John Canny Spring 2006 2/15/2006 1

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 2

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 3

Iterative Design Prototype low-fi paper, DENIM Design task analysis contextual inquiry scenarios sketching Evaluate low-fi testing, Heuristic eval 2/15/2006 4

Preparing for a User Test Objective: narrow or broad? Design the tasks Decide on whether to use video/audio Choose the setting Representative users 2/15/2006 5

User Test Roles: * Greeter * Facilitator: Help users to think aloud * Observers: record critical incidents 2/15/2006 6

Critical Incidents Critical incidents are unusual or interesting events during the study. Most of them are usability problems. They may also be moments when the user: * got stuck, or * suddenly understood something * said that s cool etc. 2/15/2006 7

The User Test The actual user test will look something like this: * Greet the user * Explain the test * Get user s signed consent * Demo the system * Run the test (maybe ½ hour) * Debrief 2/15/2006 8

10 steps to better evaluation 1. Introduce yourself some background will help relax the subject. 2/15/2006 9

10 steps 2. Describe the purpose of the observation (in general terms), and set the participant at ease * You're helping us by trying out this product in its early stages. * If you have trouble with some of the tasks, it's the product's fault, not yours. Don't feel bad; that's exactly what we're looking for. 2/15/2006 10

10 steps (contd.) 3. Tell the participant that it's okay to quit at any time, e.g.: * Although I don't know of any reason for this to happen, if you should become uncomfortable or find this test objectionable in any way, you are free to quit at any time. 2/15/2006 11

10 steps (contd.) 4. Talk about the equipment in the room. * Explain the purpose of each piece of equipment (hardware, software, video camera, microphones, etc.) and how it is used in the test. 2/15/2006 12

10 steps (contd.) 5. Explain how to think aloud. * Explain why you want participants to think aloud, and demonstrate how to do it. E.g.: * We have found that we get a great deal of information from these informal tests if we ask people to think aloud. Would you like me to demonstrate? 2/15/2006 13

10 steps (contd.) 6. Explain that you cannot provide help. 2/15/2006 14

10 steps (contd.) 7. Describe the tasks and introduce the product. * Explain what the participant should do and in what order. Give the participant written instructions for the tasks. * Don t demonstrate what you re trying to test. 2/15/2006 15

10 steps (contd.) 8. Ask if there are any questions before you start; then begin the observation. 2/15/2006 16

10 steps (contd.) 9. Conclude the observation. When the test is over: * Explain what you were trying to find. * Answer any remaining questions. * discuss any interesting behaviors you would like the participant to explain. 2/15/2006 17

10 steps (contd.) 10. Use the results. * When you see participants making mistakes, you should attribute the difficulties to faulty product design, not to the participant. 2/15/2006 18

Using the Results Update task analysis and rethink design * Rate severity & ease of fixing problems * Fix both severe problems & make the easy fixes Will thinking aloud give the right answers? * Not always * If you ask a question, people will always give an answer, even it is has nothing to do with the facts * Try to avoid leading questions 2/15/2006 19

High-order summary: Questions? Follow a loose master-apprentice model Observe, but help the user describe what they re doing Keep the user at ease 2/15/2006 20

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 21

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 22

Severity Rating Used to allocate resources to fix problems Estimate of consequences of that bug Combination of * Frequency * Impact * Persistence (one time or repeating) Should be calculated after all evaluations are in Should be done independently by all judges 2/15/2006 23

Severity Ratings (cont.) 0 - don t agree that this is a usability problem 1 - cosmetic problem 2 - minor usability problem 3 - major usability problem; important to fix 4 - usability catastrophe; imperative to fix 2/15/2006 24

Cost (to repair) ratings Later in the development process, it will be important to rate the cost (programmer time) of fixing usability problems. A similar rating system is usually used, but the ratings are made by programmers rather than usability experts or designers. With both sets of ratings, the team can optimize the benefit of programmer effort. 2/15/2006 25

Debriefing Conduct with evaluators, observers, and development team members. Discuss general characteristics of UI. Suggest potential improvements to address major usability problems. Make it a brainstorming session * little criticism until end of session 2/15/2006 26

Break Note: midterm coming up on Monday 2/27 2/15/2006 27

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 28

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 29

Discount Usability Engineering Cheap * No special labs or equipment needed * The more careful you are, the better it gets Fast * On order of 1 day to apply * Standard usability testing may take a week Easy to use * Can be taught in 2-4 hours 2/15/2006 30

Cost of user testing Its very expensive you need to schedule (and normally pay) many subjects. It takes many hours of the evaluation team s time. A user test can easily cost $10k s 2/15/2006 31

Discount Usability Engineering Based on: * Scenarios * Simplified thinking aloud * Heuristic Evaluation * Some other methods 2/15/2006 32

Scenarios Run through a particular task execution on a particular interface design Build just enough of the interface to support that A scenario is a simplest possible prototype 2/15/2006 33

Scenarios Eliminate parts of the system Compromise between horizontal and vertical prototypes 2/15/2006 34

Simplified thinking aloud Bring in users Give them real tasks on the system Ask them to think aloud as in other methods No video-taping rely on notes Less careful analysis and fewer testers 2/15/2006 35

Other budget methods Walkthroughs * Put yourself in the shoes of a user * Like a code walkthrough Action analysis * GOMS (later ) On-line, remote usability tests Heuristic evaluation 2/15/2006 36

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 37

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 38

Heuristic Evaluation Developed by Jakob Nielsen Helps find usability problems in a UI design Small set (3-5) of evaluators examine UI * Independently check for compliance with usability principles ( heuristics ) * Different evaluators will find different problems * Findings are aggregated afterwards Can be done on a working UI or on sketches 2/15/2006 39

Why Multiple Evaluators? Every evaluator doesn t find every problem Good evaluators find both easy & hard ones 2/15/2006 40

Heuristic Evaluation Process Evaluators go through UI several times * Inspect various dialogue elements * Compare with list of heuristics Heuristics * Nielsen s heuristics * Supplementary list of category-specific heuristics * Get them by grouping usability problems from previous user tests on similar products Use violations to redesign/fix problems 2/15/2006 41

Heuristics (original) H1-1: Simple & natural dialog H1-2: Speak the users language H1-3: Minimize users memory load H1-4: Consistency H1-5: Feedback H1-6: Clearly marked exits H1-7: Shortcuts H1-8: Precise & constructive error messages H1-9: Prevent errors H1-10: Help and documentation 2/15/2006 42

Revised Heuristics Also developed by Nielsen. Based on factor analysis of 249 usability problems A prioritized, independent set of heuristics 2/15/2006 43

Revised Heuristics H2-1: visibility of system status H2-2: Match system and real world H2-3: User control and freedom H2-4: Consistency and standards H2-5: Error prevention H2-6: Recognition rather than recall H2-7: Flexibility and efficiency of use H2-8: Aesthetic and minimalist design H2-9: Help users recognize, diagnose and recover from errors H2-10: Help and documentation 2/15/2006 44

Heuristics (revised set) searching database for matches H2-1: Visibility of system status * keep users informed about what is going on * 0.1 sec: no special indicators needed * 1.0 sec: user tends to lose track of data * 10 sec: max. duration if user to stay focused on action * for longer delays, use percent-done progress bars 2/15/2006 45

Heuristics (cont.) H2-2: Match between system & real world * speak the users language * follow real world conventions Bad example: Mac desktop * Dragging disk to trash * should delete it, not eject it 2/15/2006 46

Heuristics (cont.) H2-3: User control & freedom * exits for mistaken choices, undo, redo * don t force down fixed paths like BART ticket machine 2/15/2006 47

Heuristics (cont.) H2-4: Consistency & standards 2/15/2006 48

Heuristics (cont.) H2-5: Error prevention H2-6: Recognition rather than recall make objects, actions, options, & directions visible or easily retrievable MS Web Pub. Wiz. Before dialing, asks for id & password When connecting, asks again for id & pw 2/15/2006 49

Heuristics (cont.) Edit Cut ctrl-x Copy ctrl-c Paste ctrl-v H2-7: Flexibility and efficiency of use * accelerators for experts (e.g., gestures, keyboard shortcuts) * allow users to tailor frequent actions (e.g., macros) 2/15/2006 50

Heuristics (cont.) H2-8: Aesthetic and minimalist design * no irrelevant information in dialogues 2/15/2006 51

Heuristics (cont.) H2-9: Help users recognize, diagnose, and recover from errors * Error messages in plain language * Precisely indicate the problem * Constructively suggest a solution 2/15/2006 52

Heuristics (cont.) H2-10: Help and documentation * Easy to search * Focused on the user s task * List concrete steps to carry out * Not too large 2/15/2006 53

Phases of Heuristic Evaluation 1) Pre-evaluation training * Give evaluators needed domain knowledge and information on the scenario 2) Evaluation * Individuals evaluate and then aggregate results 3) Severity rating * Can do this first individually and then as a group 4) Debriefing * Discuss the outcome with design team 2/15/2006 54

How to Perform Evaluation At least two passes for each evaluator * First to get feel for flow and scope of system * Second to focus on specific elements If system is walk-up-and-use or evaluators are domain experts, no assistance needed * Otherwise might supply evaluators with scenarios Each evaluator produces list of problems * Explain why with reference to heuristic or other information * Be specific and list each problem separately 2/15/2006 55

Examples Can t copy info from one window to another * Violates Minimize users memory load (H1-3) * Fix: allow copying Typography uses mix of upper/lower case formats and fonts * Violates Consistency and standards (H2-4) * Slows users down * Probably wouldn t be found by user testing * Fix: pick a single format for entire interface 2/15/2006 56

Severity Ratings Example 1. [H1-4 Consistency] [Severity 3][Fix 0] The interface used the string "Save" on the first screen for saving the user's file, but used the string "Write file" on the second screen. Users may be confused by this different terminology for the same function. 2/15/2006 57

Questions? Summary: HE is a discount usability method Based on common usability problems across many designs Have evaluators go through the UI twice Ask them to see if it complies with heuristics Have evaluators independently rate severity 2/15/2006 58

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 59

Outline User testing process Severity and Cost ratings Discount usability methods Heuristic evaluation HE vs. user testing 2/15/2006 60

HE is much faster HE vs. User Testing * 1-2 hours each evaluator vs. days-weeks HE doesn t require interpreting user s actions User testing is far more accurate (by def.) * Takes into account actual users and tasks * HE may miss problems & find false positives Good to alternate between HE & user testing * Find different problems * Don t waste participants 2/15/2006 61

Results of Using HE Discount: benefit-cost ratio of 48 [Nielsen94] * Cost was $10,500 for benefit of $500,000 * Value of each problem ~15K (Nielsen & Landauer) Tends to find more of the high-severity problems 2/15/2006 62

Number of Evaluators Single evaluator achieves poor results * Only finds 35% of usability problems * 5 evaluators find ~ 75% of usability problems * Why not more evaluators???? 10? 20? + adding evaluators costs more + many evaluators won t find many more problems But always depends on market for product: * popular products -> high support cost for small bugs 2/15/2006 63

Decreasing Returns problems found benefits / cost Caveat: graphs for a specific example 2/15/2006 64

Questions? Summary: HE vs. user testing User testing is more accurate For HE, Combine findings from 3 to 5 evaluators Discuss problems with design team Alternate with user testing 2/15/2006 65