GOMS

Last updated

GOMS is a specialized human information processor model for human-computer interaction observation that describes a user's cognitive structure on four components. In the book The Psychology of Human Computer Interaction. [1] written in 1983 by Stuart K. Card, Thomas P. Moran and Allen Newell, the authors introduce: "a set of Goals, a set of Operators, a set of Methods for achieving the goals, and a set of Selections rules for choosing among competing methods for goals." [1] GOMS is a widely used method by usability specialists for computer system designers because it produces quantitative and qualitative predictions of how people will use a proposed system.

Contents

Overview

The concepts behind a GOMS model and their relationships Elements of a GOMS model.svg
The concepts behind a GOMS model and their relationships

A GOMS model is composed of methods that are used to achieve specific goals. These methods are then composed of operators at the lowest level. The operators are specific steps that a user performs and are assigned a specific execution time. If a goal can be achieved by more than one method, then selection rules are used to determine the method.

There are several different GOMS variations which allow for different aspects of an interface to be accurately studied and predicted. For all of the variants, the definitions of the major concepts are the same. There is some flexibility for the designer's/analyst's definition of all of the entities. For instance, an operator in one method may be a goal in a different method. The level of granularity is adjusted to capture what the particular evaluator is examining. For a simple applied example see CMN-GOMS.

Qualification

Advantages

The GOMS approach to user modeling has strengths and weaknesses. While it is not necessarily the most accurate method to measure human-computer interface interaction, it does allow visibility of all procedural knowledge. With GOMS, an analyst can easily estimate a particular interaction and calculate it quickly and easily. This is only possible if the average Methods-Time Measurement data for each specific task has previously been measured experimentally to a high degree of accuracy. [2]

Disadvantages

GOMS only applies to skilled users. It does not work for beginners or intermediates for errors may occur which can alter the data. [3] Also the model doesn't apply to learning the system or a user using the system after a longer time of not using it. [3] Another big disadvantage is the lack of account for errors, even skilled users make errors but GOMS does not account for errors. [3] Mental workload is not addressed in the model, making this an unpredictable variable. The same applies to fatigue. [3] GOMS only addresses the usability of a task on a system, it does not address its functionality. [3]

User personalities, habits or physical restrictions (for example disabilities) are not accounted for in any of the GOMS models. All users are assumed to be exactly the same. Recently some extensions of GOMS were developed, that allow to formulate GOMS models describing the interaction behavior of disabled users. [4] [5] [6]

Variations

Basically there are five different GOMS models: the Keystroke-Level Model, CMN-GOMS, NGOMSL, CPM-GOMS, and SGOMS. Each model has a different complexity and varies in activities.

KLM

The Keystroke-Level Model (KLM) is the first and simplest GOMS technique Stuart Card, Thomas P. Moran and Allen Newell created. [6] Estimating an execution time for a task is done by listing the sequence of operators and then totaling the execution times for the individual operators. With KLM the analyst must specify the method used to accomplish each particular task instance. Furthermore, the specified methods are limited to being in sequence form and containing only keystroke-level primitive operators. The biggest difference between GOMS and KLM is how time is assigned to cognitive and perceptual operators when it comes to execution time predictions. Another major difference is that the goal-hierarchy is explicit in GOMS while it was implicit in the KLM. The nature of unobservable operators is another important difference. KLM has a single M operator that precedes each cognitive unit of action. In contrast, GOMS assigns no time to such cognitive overhead. But both models include M-like operators for substantial time-consuming mental actions such as locating information on the screen and verifying entries. Both methods assign roughly the same time to unobservable perceptual and cognitive activities. Also they make different assumptions about unobservable cognitive and perceptual operators and so distribute the time in different ways. [2] [7]

KLM's execution part is described in four physical-motor operators:

One mental operator M that stands for the time a user has to mentally prepare themselves to do an action, and a system response operator R in with the user has to wait for the system. Execution time is the sum of the times spent executing the different operator types:

Texecute = TK + TP + TH + TD + TM + TR. [1]

Each of these operators has an estimate of execution time, either a single value or a parameterized estimate.

Touch Level Model (TLM)

GOMS and it variants were designed for keyboard interfaces, nowadays a new type of interface is omnipresent. This addition to the GOMS family, together with updates to the existing KLM operators, is called the Touch Level Model (TLM). Andrew D. Rice and Jonathan W. Lartigue propose this model for the used to model human task performance on a constrained input touchscreen device and, with proper benchmarking, accurately predict actual user performance. [8]

The goal is to provide an instrument for quantitative analysis of touchscreen interfaces.
A number of operators are added for touchscreen interactions:

  • Distraction (X) a multiplicative operator that is applied to other operators to model real world distractions
  • Gesture (G) gestures are conceptualized as specialized combinations of finger movements across the device's screen
  • Pinch (P) refers to the common two-finger gesture
  • Zoom (Z) the reverse application of the Pinch operator. value in MS = 200 Ms
  • Initial Act (I) KLM assumed the user is prepared to begin an action, touchscreen devices require users to prepare them for use (home button or password)
  • Tap (T) operator refers to the physical action of tapping an area on the touchscreen device in order to initiate some change or action
  • Swipe (S) usually a horizontally or vertically swipe like changing the page in a book. value in MS = 70 Ms
  • Tilt (L(d)) used with an interacting with a devices equipped with accelerometers.
  • Rotate (O(d)) gesture in which two or more fingers are placed on the screen and then rotated about a central point
  • Drag (D) similar to Swipe, Drag also involves tapping a location on the screen and then moving one or more fingers in specific direction

CMN-GOMS

CMN-GOMS is the original GOMS model proposed by Stuart Card, Thomas P. Moran and Allen Newell.
CMN stands for Card, Moran and Newell and it takes the KLM as its basic and adds subgoals and selection rules. This model can predict operator sequence as well as execution time. A CMN-GOMS model can be represented in program form, making it amenable to analysis as well as execution. CMN-GOMS has been used to model word processors [1] and CAD systems for ergonomic design(see CAD). [2] The CMN method can predict the operator sequence and the execution time of a task on a quantitative level and can focus its attention on methods to accomplish goals on a qualitative level.
In the example by Bonnie E. John and David E. Kieras a simple CMN-GOMS on editing a manuscript is shown. [2]

GOAL: EDIT-MANUSCRIPT . GOAL.  EDIT-UNIT-TASK ...repeat until no more unit tasks . . GOAL.  ACQUIRE UNIT-TASK ...if task not remembered . . .   GOAL.  TURN PAGE  ...if at end of manuscript . . .   GOAL.  GET-FROM-MANUSCRIPT . . GOAL.  EXECUTE-UNIT-TASK ...if a unit task was found . . . GOAL.  MODIFY-TEXT . . . .   select.  GOAL.   MOVE-TEXT* ...if text is to be moved . . . . . .     GOAL.   DELETE-PHRASE ...if a phrase is to be deleted . . . . . .     GOAL.   INSERT-WORD ... if a word is to be inserted . . . .      VERIFY-EDIT 

Shown below is a simple copy and paste example. [9]

GOAL COPY-AND-PASTE-TEXT

GOAL COPY-TEXT
GOAL HIGHLIGH-TEXT
Operator MOVE-CURSOR-TO-BEGINNING
Operator CLICK-MOUSE-BUTTON
Operator MOVE-CURSOR-TO-END
Operator SHIFT-CLICK-MOUSE-BUTTON
Operator VERIFY-HIGHLIGHT
GOAL ISSUE-COPY-COMMAND
Select*:
GOAL USE-MOUSE
Operator MOVE-CURSOR-TO-EDIT-MENU
Operator PRESS-MOUSE-BUTTON
Operator MOVE-CURSOR-TO-COPY-ITEM
Operator VERIFY-HIGHLIGHT
Operator RELEASE-MOUSE-BUTTON
GOAL USE-KEYBOARD
Operator PRESS-KEY-STRG
Operator PRESS-KEY-C
Operator RELEASE-KEYS
GOAL PASTE-TEXT[...]

*Selection rule for GOAL ISSUE-COPY-COMMAND
if HANDS-ARE-ON-KEYBOARD then

select GOAL USE-KEYBOARD

else

select GOAL USE-MOUSE

NGOMSL

NGOMSL is a structured natural language notation for representing GOMS models and a procedure for constructing them. This program form provides predictions of operator sequences, execution time and time to learn methods. An analyst constructs an NGOMSL model by performing a top-down, breadth-first expansion of the user's top-level goals into methods, until the methods contain only primitive operators, typically keystroke-level operators. This model explicitly represents the goal structure just like the CMN-GOMS and can so represent high-level goals. [10]
Shown below is a simple example. [2]

NGOMSL Statements
METHOD for GOAL: MOVE TEXT

STEP 1: ACCOMPLISH GOAL: CUT TEXT
STEP 2: ACCOMPLISH GOAL: PASTE TEXT
STEP 3: RETURN WITH GOAL ACCOMPLISHED

METHOD for GOAL: CUT TEXT

STEP 1: ACCOMPLISH GOAL: HIGHLIGHT TEXT
STEP 2: RETAIN THAT COMMAND IS CUT, AND
ACCOMPLISH GOAL: ISSUE A COMMAND
STEP 3: RETURN WITH GOAL ACCOMPLISHED

etc.

CPM-GOMS

Bonnie E. John and David Kieras describe four different types of GOMS. CMN-GOMS, KLM and NGOMSL assume that all of the operators occur in sequence and do not contain operators that are below the activity level. CPM-GOMS being the fourth method uses operators at the level of Model Human Processor which assumes that operators of the cognitive processor, perceptual processor, and the motor processor can work in parallel to each other. The most important point of CPM-GOMS is the ability to predict skilled behavior from its ability to model overlapping actions. [11] [12]

SGOMS

SGOMS stands for Sociotechnical GOMS [13] [14] [15] and was created to allow GOMS to model work in complex sociotechnical systems. GOMS is meant to model an individual user, working in isolation, with no unexpected interruptions, similar to a Cognitive Psychology experiment. This level of analysis is sometimes referred to as microcognition to distinguish it from macrocognition, which refers to real world cognition. SGOMS is meant to expand the applicability of GOMS to the macro cognitive level of analysis. To do this, SGOMS adds a high level control structure to GOMS, called the planning unit. This allows GOMS to deal with unexpected interruptions.

A planning unit is a list of unit tasks. Planning units can be ordered (the unit tasks must be done in order) or situated (the unit tasks in the list are done as the situation demands). Consistent with CPM-GOMS, SGOMS assumes that the agent can monitor the situation in parallel in order to detect threats (neurophysiologically, this function is associated with the amygdala). Planning units can be interrupted and bookmarked so they can be resumed later. When a planning unit is interrupted the agent considers the situation and can resume the same planning unit or bookmark it and switch to a different planning unit. SGOMS does not prescribe how this choice is made but, if the decision is based on routine expertise, it can be included in the SGOMS model.

Assumptions and errors

Importance of assumptions in GOMS analysis

Accurate assumptions are vital in GOMS analysis. Before applying the average times for detailed functions, it is very important that an experimenter make sure he or she has accounted for as many variables as possible by using assumptions. Experimenters should design their GOMS analysis for the users who will most likely be using the system which is being analyzed. Consider, for example, an experimenter wishes to determine how long it will take an F22 Raptor pilot to interact with an interface he or she has used for years. It can probably be assumed that the pilot has outstanding vision and is in good physical health. In addition, it can be assumed that the pilot can interact with the interface quickly because of the vast hours of simulation and previous use he or she has endured. All things considered, it is fair to use fastman times in this situation. Contrarily, consider an 80-year-old person with no flight experience attempting to interact with the same F22 Raptor interface. It is fair to say that the two people would have much different skill sets and those skill sets should be accounted for subjectively.

Accounting for errors

The only way to account for errors in GOMS analysis is to predict where the errors are most likely to occur and measure the time it would take to correct the predicted errors. For example, assume an experimenter thought that in typing the word "the" it was likely that a subject would instead incorrectly type "teh". The experimenter would calculate the time it takes to type the incorrect word, the time it takes to recognize that a mistake has been made, and the time it takes to correct the recognized error.

Applications of GOMS

Workstation efficiency

A successful implementation of CPM-GOMS was in Project Ernestine held by New England Telephone. New ergonomically designed workstations were compared to old workstations in terms of improvement in telephone operators' performance. CPM-GOMS analysis estimated a 3% decrease in productivity. Over the four-month trial 78,240 calls were analysed and it was concluded that the new workstations produced an actual 4% decrease in productivity. As the proposed workstation required less keystrokes than the original it was not clear from the time trials why the decrease occurred. However CPM-GOMS analysis made it apparent that the problem was that the new workstations did not utilize the workers' slack time. Not only did CPM-GOMS give a close estimate, but it provided more information of the situation. [16]

CAD

GOMS models were employed in the redesign of a CAD (computer-aided design) system for industrial ergonomics. [17] An applied GOMS model shows where the interface needs to be redesigned, as well as provides an evaluation of design concepts and ideas. In Richard Gong's example, when GOMS revealed a frequent goal supported by a very inefficient method, he changed the method to a more efficient one. If GOMS showed that there were goals not supported by any method at all, then new methods were added. GOMS also revealed where similar goals are supported by inconsistent methods, a situation in which users are likely to have problems remembering what to do, and showed how to make the methods consistent. [17]

Software tools

There exist various tools for the creation and analysis of Goms-Models. A selection is listed in the following:

See also

Related Research Articles

<span class="mw-page-title-main">Fitts's law</span> Predictive model of human movement

Fitts's law is a predictive model of human movement primarily used in human–computer interaction and ergonomics. The law predicts that the time required to rapidly move to a target area is a function of the ratio between the distance to the target and the width of the target. Fitts's law is used to model the act of pointing, either by physically touching an object with a hand or finger, or virtually, by pointing to an object on a computer monitor using a pointing device. It was initially developed by Paul Fitts.

<span class="mw-page-title-main">Usability</span> Capacity of a system for its users to perform tasks

Usability can be described as the capacity of a system to provide a condition for its users to perform the tasks safely, effectively, and efficiently while enjoying the experience. In software engineering, usability is the degree to which a software can be used by specified consumers to achieve quantified objectives with effectiveness, efficiency, and satisfaction in a quantified context of use.

A heuristic evaluation is a usability inspection method for computer software that helps to identify usability problems in the user interface (UI) design. It specifically involves evaluators examining the interface and judging its compliance with recognized usability principles. These evaluation methods are now widely taught and practiced in the new media sector, where UIs are often designed in a short space of time on a budget that may restrict the amount of money available to provide for other types of interface testing.

Interaction design, often abbreviated as IxD, is "the practice of designing interactive digital products, environments, systems, and services." Beyond the digital aspect, interaction design is also useful when creating physical (non-digital) products, exploring how a user might interact with it. Common topics of interaction design include design, human–computer interaction, and software development. While interaction design has an interest in form, its main area of focus rests on behavior. Rather than analyzing how things are, interaction design synthesizes and imagines things as they could be. This element of interaction design is what characterizes IxD as a design field as opposed to a science or engineering field.

The following outline is provided as an overview of and topical guide to human–computer interaction:

The cognitive walkthrough method is a usability inspection method used to identify usability issues in interactive systems, focusing on how easy it is for new users to accomplish tasks with the system. A cognitive walkthrough is task-specific, whereas heuristic evaluation takes a holistic view to catch problems not caught by this and other usability inspection methods. The method is rooted in the notion that users typically prefer to learn a system by using it to accomplish tasks, rather than, for example, studying a manual. The method is prized for its ability to generate results quickly with low cost, especially when compared to usability testing, as well as the ability to apply the method early in the design phases before coding even begins.

Task analysis is the analysis of how a task is accomplished, including a detailed description of both manual and mental activities, task and element durations, task frequency, task allocation, task complexity, environmental conditions, necessary clothing and equipment, and any other unique factors involved in or required for one or more people to perform a given task.

Human-centered computing (HCC) studies the design, development, and deployment of mixed-initiative human-computer systems. It is emerged from the convergence of multiple disciplines that are concerned both with understanding human beings and with the design of computational artifacts. Human-centered computing is closely related to human-computer interaction and information science. Human-centered computing is usually concerned with systems and practices of technology use while human-computer interaction is more focused on ergonomics and the usability of computing artifacts and information science is focused on practices surrounding the collection, manipulation, and use of information.

<span class="mw-page-title-main">Ben Shneiderman</span> American computer scientist

Ben Shneiderman is an American computer scientist, a Distinguished University Professor in the University of Maryland Department of Computer Science, which is part of the University of Maryland College of Computer, Mathematical, and Natural Sciences at the University of Maryland, College Park, and the founding director (1983-2000) of the University of Maryland Human-Computer Interaction Lab. He conducted fundamental research in the field of human–computer interaction, developing new ideas, methods, and tools such as the direct manipulation interface, and his eight rules of design.

In human–computer interaction, the keystroke-level model (KLM) predicts how long it will take an expert user to accomplish a routine task without errors using an interactive computer system. It was proposed by Stuart K. Card, Thomas P. Moran and Allen Newell in 1980 in the Communications of the ACM and published in their book The Psychology of Human-Computer Interaction in 1983, which is considered as a classic in the HCI field. The foundations were laid in 1974, when Card and Moran joined the Palo Alto Research Center (PARC) and created a group named Applied Information-Processing Psychology Project (AIP) with Newell as a consultant aiming to create an applied psychology of human-computer interaction. The keystroke-level model is still relevant today, which is shown by the recent research about mobile phones and touchscreens.

Cognitive ergonomics is a scientific discipline that studies, evaluates, and designs tasks, jobs, products, environments and systems and how they interact with humans and their cognitive abilities. It is defined by the International Ergonomics Association as "concerned with mental processes, such as perception, memory, reasoning, and motor response, as they affect interactions among humans and other elements of a system. Cognitive ergonomics is responsible for how work is done in the mind, meaning, the quality of work is dependent on the persons understanding of situations. Situations could include the goals, means, and constraints of work. The relevant topics include mental workload, decision-making, skilled performance, human-computer interaction, human reliability, work stress and training as these may relate to human-system design." Cognitive ergonomics studies cognition in work and operational settings, in order to optimize human well-being and system performance. It is a subset of the larger field of human factors and ergonomics.

Human processor model or MHP is a cognitive modeling method developed by Stuart K. Card, Thomas P. Moran, & Allen Newell (1983) used to calculate how long it takes to perform a certain task. Other cognitive modeling methods include parallel design, GOMS, and keystroke-level model (KLM).

Exploratory search is a specialization of information exploration which represents the activities carried out by searchers who are:

Wayne D. Gray is a professor of cognitive science and director of the cognitive science doctoral program at Rensselaer Polytechnic Institute.

NGOMSL is a variation of the GOMS technique in human computer interaction.

CMN-GOMS stands for Card, Moran and Newell GOMS. CMN-GOMS is the original version of the GOMS technique in human computer interaction. It takes the name after its creators Stuart Card, Thomas P. Moran and Allen Newell who first described GOMS in their 1983 book The Psychology of Human Computer Interaction.

CPM-GOMS is a variation of the GOMS technique in human computer interaction. CPM-GOMS stands for two things: Cognitive Perceptual Motor and the project planning technique Critical Path Method.

<span class="mw-page-title-main">Human–computer interaction</span> Academic discipline studying the relationship between computer systems and their users

Human–computer interaction (HCI) is research in the design and the use of computer technology, which focuses on the interfaces between people (users) and computers. HCI researchers observe the ways humans interact with computers and design technologies that allow humans to interact with computers in novel ways. A device that allows interaction between human being and a computer is known as a "Human-computer Interface (HCI)".

Bonnie E. John is an American cognitive psychologist who studies human–computer interaction, predictive human performance modeling, and the relationship between usability and software architecture. She was a founding member of the Human-Computer Interaction Institute at Carnegie Mellon University, a research staff member at IBM's Thomas J. Watson Research Center, and the director of computation and innovation at The Cooper Union. She is currently a UX designer at Bloomberg L.P.

Human performance modeling (HPM) is a method of quantifying human behavior, cognition, and processes. It is a tool used by human factors researchers and practitioners for both the analysis of human function and for the development of systems designed for optimal user experience and interaction. It is a complementary approach to other usability testing methods for evaluating the impact of interface features on operator performance.

References

  1. 1 2 3 4 Card, Stuart; Thomas P. Moran; Allen Newell (1983). The Psychology of Human Computer Interaction . Lawrence Erlbaum Associates. ISBN   0-89859-859-1.
  2. 1 2 3 4 5 John, Bonnie E.; David E. Kieras (1996). "The GOMS Family of User Interface Analysis Techniques: Comparison and Contrast". ACM Transactions on Computer-Human Interaction. United States of America: ACM. doi:10.1145/235833.236054. ISSN   1073-0516. S2CID   13514458.
  3. 1 2 3 4 5 Rogers, Yvonne; Helen Sharp; Jenny Preece (2002). Interaction Design. United States of America: John Wiley & Sons. p.  454. ISBN   0-471-49278-7.
  4. Tonn-Eichstädt, H. (2005). Measuring website usability for visually impaired people – A modified GOMS analysis. In: ACM SIGACCESS Conference on Assistive Technology. New York: ACM Press. pp. 55–62. doi:10.1145/1168987.1168998. S2CID   14257284.
  5. Schrepp, M. (2006). On the efficiency of keyboard navigation in web sites. Universal Access in the Information Society, Vol. 5, No. 2, pp. 180-188.
  6. 1 2 Card, Stuart; Thomas P. Moran; Allen Newell (1980). The keystroke-level model for user performance time with interactive systems. Lawrence Erlbaum Associates. doi:10.1145/358886.358895. ISBN   0-13-444910-X. S2CID   5918086.
  7. John, Bonnie E.; David E. Kieras (1996). "Using GOMS for User Interface Design and Evaluation: Which Technique?". ACM Transactions on Computer-Human Interaction. United States of America: ACM. doi:10.1145/235833.236050. ISSN   1073-0516. S2CID   1187310.
  8. Rice, Andrew D.; Jonathan W. Lartigue (2014). Touch-level model (TLM): evolving KLM-GOMS for touchscreen and mobile devices. ACM. doi:10.1145/2638404.2638532. ISBN   978-1-4503-2923-1. S2CID   25139034.
  9. Voss, David (2010). Analyse, Evaluierung und Optimierung der intraoperativen Mensch-Maschine-Interaktion. Eberhard-Karls-Universität Tübingen. hdl:10900/45648.
  10. Kieras, David (1994). "GOMS modeling of user interfaces using NGOMSL". Conference companion on Human factors in computing systems - CHI '94. ACM. pp. 371–372. doi:10.1145/259963.260467. ISBN   0-89791-651-4. S2CID   16999909.
  11. John., Bonnie E.; Alonso Vera; Michael Mattesa (2002). Automating CPM-GOMS. ACM. doi:10.1145/503376.503404. ISBN   1-58113-453-3. S2CID   675250.
  12. John, Bonnie E; Wayne D. Gray (1995). CPM-GOMS: an analysis method for tasks with parallel activities. ACM. ISBN   0-89791-755-3.
  13. West, R. L.; Nagy, N. (2007). "Using GOMS for Modeling Routine Tasks Within Complex Sociotechnical Systems: Connecting Macrocognitive Models to Microcognition". Journal of Cognitive Engineering and Decision Making. 1 (2): 186–211. doi:10.1518/155534307X232848. S2CID   62229701.
  14. West, R. L.; Provnost, S. (2009). "Modeling SGOMS in ACT-R: Linking Macro- and Microcognition". Journal of Cognitive Engineering and Decision Making. 3 (2): 194–207. doi:10.1518/155534309X441853. S2CID   62563894.
  15. West, R. L.; Macdougal, W. (2015). "The Macro Architecture Hypothesis: Modifying Newell's System Levels to Include Macro Cognition". Biologically Inspired Cognitive Architectures.
  16. Gray, Wayne D.; John, Bonnie E.; Atwood, Michael E. (1992). "The Precis of Project Ernestine or an overview of a validation of GOMS". Proceedings of the SIGCHI conference on Human factors in computing systems. doi:10.1145/142750.142821. ISBN   0897915135.
  17. 1 2 Gong, Richard; David Kieras (1994). A Validation of the GOMS Model Methodology in the Development of a Specialized, Commercial Software Application. ACM. doi:10.1145/191666.191782. ISBN   0-89791-650-6. S2CID   1477759.

A previous version of this article was largely a derivative work of GOMS Analysis Techniques - Final Essay (1997).

Further reading