Usability Evaluation of Modeling Languages by Christian Schalles
Usability Evaluation of Modeling Languages by Christian Schalles
of Modeling Languages
Christian Schalles
Usability Evaluation
of Modeling Languages
An Empirical Research Study
RESEARCH
Christian Schalles
Leidersbach, Germany
Springer Gabler
© Springer Fachmedien Wiesbaden 2013
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole
or part of the material is concerned, specifically the rights of translation, reprinting, reuse of
illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical
way, and transmission or information storage and retrieval, electronic adaptation, computer
software, or by similar or dissimilar methodology now known or hereafter developed.
Exempted from this legal reservation are brief excerpts in connection with reviews or schol-
arly analysis or material supplied specifically for the purpose of being entered and executed
on a computer system, for exclusive use by the purchaser of the work. Duplication of this pub-
lication or parts thereof is permitted only under the provisions of the Copyright Law of the
Publisher’s location, in its current version, and permission for use must always be obtained
from Springer. Permissions for use may be obtained through RightsLink at the Copyright
Clearance Center. Violations are liable to prosecution under the respective Copyright Law.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this
publication does not imply, even in the absence of a specific statement, that such names are
exempt from the relevant protective laws and regulations and therefore free for general use.
While the advice and information in this book are believed to be true and accurate at the
date of publication, neither the authors nor the editors nor the publisher can accept any legal
responsibility for any errors or omissions that may be made. The publisher makes no warranty,
express or implied, with respect to the material contained herein.
Springer Gabler is a brand of Springer DE. Springer DE is part of Springer Science+Business Media.
www.springer-gabler.de
To Carina, Heidi, Wilhelm and Andreas. Thanks for your
patience and support
Foreword
Conceptual modeling often is considered to be at the heart of the Business and Information
Systems Engineering discipline. Without conceptual modeling, it would be extraordinarily costly
and time-consuming, if not impossible, to communicate concepts, structures and artifacts of a
project domain between a larger number of team members and to finish a project successfully
and efficiently. Graphical modeling languages are the key instruments of conceptual modeling
and are widely used not only throughout most diverse kinds of software projects, but also in
process management, quality management or compliance management projects.
Compared to its central importance, surprisingly little is known about the usability evaluation
of modeling languages. Graphical modeling languages are invented and developed by persons,
teams or organizations, often based upon prior concepts, and can evolve into being a certified or a
de facto industry standard. But during these development processes, usability evaluations are not
yet common. Thus it often remains unclear, when a language should be chosen for a given project
task, what intellectual and practical skills the use of the concepts and elements of the language
requires, or which aspects influence the content and the quality of models developed. Still,
these questions are of considerable relevance for the success of modeling tasks in organizational
practice.
Christian Schalles investigates and sheds light on this exciting and important, however rather
neglected field of research. For this purpose he examines the variables that influence the usability
of modeling languages. His inquiry not only brings together conceptual arguments, but is also
driven by empirical research. In an innovative way, the usability evaluation concept presented
in this book combines approaches from different disciplines. Relevant usability attributes are
derived from literature; metrics and a framework for evaluation are developed; hypotheses are
formulated and tested empirically. Thus, usability evaluation of graphical modeling languages is
put on a sound basis.
VIII
The book not only widens and enhances research about modeling languages and sets an agenda for
future investigation. It is also of immediate use for developers of graphical modeling languages,
as it contains recommendations for the refinement of existing languages or the development
of new ones – wherever this, despite the plethora on hand, may still be necessary. It is also
useful for business practice, as it provides a decision framework that helps project managers to
determine the appropriate modeling language for a given application context. I hope that the
book will be well received by the different audiences it addresses and that it spurs many fruitful
discussions in research teams, standardization bodies, public administrations and businesses.
Foreword VII
Acknowledgements XI
Abstract XIII
3. Background Theories 31
3.1. Communication Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.2. Activity Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
3.3. Cognitive Theories . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
3.3.1. Cognitive Load Theory . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.3.2. Cognitive Theory of Multimedia Learning . . . . . . . . . . . . . . . 37
3.3.3. Perception Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
12.Outlook 159
Bibliography 163
Nomenclature 179
Glossary 181
List of Figures
1.2.1.Research Questions and Deductions . . . . . . . . . . . . . . . . . . . . . . . 6
1.3.1.Structure of this Book - Recommended Reading Process . . . . . . . . . . . . 7
INTRODUCTION AND
BACKGROUND
1. Introduction
Application and business process modeling has received considerable attention recently by both
business administration and computer science communities. Modeling has always been at the
core of organizational design and information systems development. Models enable decision-
makers to filter out the irrelevant complexities of the real world so that efforts can be directed
towards the most important parts of the system or business process under study.
However, both business analysts and information system professionals may find it difficult to
navigate through a maze of theoretical paradigms, methodological approaches, and represen-
tational formalisms that have been proposed for business process modeling and information
systems modeling.
The aim of the study presented in this thesis is to analyze the usability of existing visual modeling
formalisms, i.e. graphical modeling languages.
1.1. Motivation
Complex software systems form the basis of the modern information society. Software en-
gineering is about developing, maintaining and managing high-quality software systems in a
cost-effective and predictable way [Sjoberg et al., 2007].
Modeling is a concept fundamental for software engineering and business process management.
Models can be found in all areas and applications of software engineering [Ludewig, 2003].
For accurate interpretation it is important that a model reproduces the knowledge contained in a
clearly arranged and well-structured manner. To ensure this, users need to understand the model,
i.e., they have to be familiar with the modeling language used [Mendling and Strembeck, 2008].
In organizations, models are important for documenting business processes and specifying
information system requirements under development. Models are represented by using graphical
modeling languages such as the Business Process Modeling Notation (BPMN), Event Driven
Process Chains (EPC) and the Unified Modeling Language (UML) providing a set of elements,
relations and rules for combining them.
The research presented in this thesis focuses on the usability evaluation of graphical modeling
languages established in software engineering and business process modeling.
Avison and Fitzgerald (1995) defined the two main reasons for evaluating graphical modeling
languages [Avison and Fitzgerald, 1995]:
1. The academic reason is to better understand the nature of modeling languages in order to
improve future modeling languages development.
2. The practical reason is to support the customizing process and the decision process for or
against specific modeling languages in companies.
In general, graphical modeling languages aim to support the expression of relevant aspects of
real world domains such as business processes or application system structures [Sjoberg et al.,
2007].
Previous research has focused on the evaluation of technical and functional aspects of graphical
modeling languages. Just a few number of researchers connect their evaluation studies with the
user of modeling languages [Birkmeier et al., 2010].
In this thesis the connection between modeling languages and users is investigated. Furthermore,
a theoretical framework defining usability in the domain of graphical modeling languages is
developed. Based on this two empirical surveys prove the quality of the developed evaluation
framework and explore important findings about the impact of modeling languages on usability.
Mendling and Strembeck (2008) recommend future surveys for analyzing the understandability
of process models [Mendling and Strembeck, 2008].
Overhage and Schlauderer (2010) compared the usability of BPMN and UML Activity diagrams
based on the Bunge-Weber-Wand Ontology [Birkmeier et al., 2010].
Daniel L. Moody and his research group analyzed the physics and visual syntax of graphical
modeling languages theoretically [Moody, 2004, Moody and Hillegersberg, 2009, Moody, 2009,
Moody and Heymans, 2010]. They did not support their findings with the conduction of empirical
surveys.
As the analyzed related work has shown, previous studies are mainly evaluating partitions of
usability regarding modeling languages. The presented related past research activities show
a current research need for an overall survey evaluating the usability of graphical modeling
languages.
The first research question focuses on the thesis that different modeling concepts influence
usability in different ways. Both behavioural and structural modeling need to be investigated.
Past and current research has not focused on this question intensely. The answer to this main
research question is given in chapter 6 of this thesis.
The second research question has a causal character and aims to specify the way different
modeling languages impact different usability attributes. The answer for this question is given in
chapter 8 of this thesis.
It is possible to deduce further questions based on the presented two main research questions:
• How can usability be measured in the domain of graphical modeling languages?
– What are properties of modeling languages, which influence the usability attributes?
How can they be measured?
– Do different user scenarios exist, which influence the usability of graphical modeling
languages?
– Do further language-independent variables exist that influence the usability of graph-
ical modeling languages?
These questions are answered within the development of a usability evaluation framework in the
domain of graphical modeling languages. The framework is presented in chapter 4 of this thesis.
The following figure shows the defined main research questions and their connected deductions.
6 1 Introduction
Deduced Q3 How
Q H can usability
bilit be
b measuredd in
i
Research the domain of graphical modeling
languages?
Questions
1.3. Structure
The general structure of this thesis is similar to an empirical research thesis.
Chapter 2 defines the theoretical background for graphical modeling and connected modeling
languages.
In chapter 3 necessary theories acting as a background for the empirical surveys are defined. Due
to the fact, that those theories are mainly user and usability-oriented, the underlain theories have
their traditional roots in psychology.
Chapter 4 presents the development of a framework for usability evaluation of graphical modeling
languages. This chapter acts as a basis chapter of this thesis. All forthcoming empirical studies
are based on this evaluation framework.
Chapter 5 deals with the development of a generic metric for measuring model complexity. In
the further course of this thesis it becomes clear, that model complexity acts as a significant inter-
fering variable. Consequently, a method for measuring model complexity has to be developed.
Thus, the influence of model complexity in the upcoming empirical surveys is considered.
In chapter 6 the presented evaluation framework is applied. Therefore, hypotheses, survey design,
data collection, data analysis and implications are presented.
Chapter 7 focuses on challenges when using a specific statistical approach for analyzing causal
relations between several statistical constructs within the scope of the presented research.
The application and realization of a structural equation modeling approach is presented in chapter
8. Concerning this, hypotheses, structural models, measurement models, data analysis and data
interpretation are included in this chapter.
In chapter 9 the results of the empirical surveys are represented and summarized. Furthermore,
a management decision framework supporting the selection process for or against specific
1 Introduction 7
.0 +0.,"1!0',++"!(%.,1+"
!(%.,1+",$
,$03.#+" !(%.,1+"
+0.,"1!0',+
1/'+#//.,!#// &#,.'#/
,"#)'+%
.0 2)10',+.*#3,.(+"*-'.'!)+)5/'/
#+#.'!,"#)
'+
,*-)#4'05#0.'! / ')'05#/#.!&
,*-.'/,+
2)10',+ *-!001"5
01"5
.*#3,.(
.0
#/1*#,$0&#*-'.'!)#/1)0/
.'+!'-)#/$,.
1.0&#.
101.##2#),-*#+0,$
.,!#//'+%,$0&#
%.-&'!),"#)'+%
*-'.'!)#/1)0/
+%1%#/
.0 ,+!)1/',++"10),,(
,+!)1/',+ 10),,(
Model
modeled
attributes to
the original
The mapping criterion does not imply the actual existence of the original; it may be planned,
suspected, or fictitious. The cost estimation of a software project is a speculative model of the
future. A model may act as the original of another model. A program design is a model of the
code to be written, while the code is a model of the computation performed by the computer
when the code is executed.
At first glance, the reduction criterion seems to describe a weakness of models, because some-
thing is lost in the model that was present in the original. But that loss is the real strength of
models: very often, the model can be handled while the original cannot.
Additionally, the pragmatic criterion is the reason models are applied. Since it is not possible to
use the original, the model is used instead.
As an effect of the reduction, many features of the original (the waived attributes) are not found
in the model. For example, the name of a person is not visible in his photograph. On the other
hand, features that do not stem from the original are added (abundant attributes). For example,
the size of the picture does not tell anything about the person.
Figure 2.1.1 shows the resulting relation between a original real-world domain and its associated
model.
However, a weakness of Stachowiak’s concept of a model is that it implies an epistemological
position of positivism. This is criticized in Schuette & Rotthowe (1998), where the authors
propose an alternative position based on insights from critical realism and constructivism. This
position regards a model as a “result of a construct done by a modeler” [Schuette and Rotthowe,
1998].
Consequently, a relationship between a model and the modeler exists leading to limitations of
resulting models, which depend on the modeler’s subjective view of modeling and the domain
modeled [Krogstie, 2003].
As such, it is heavily influenced by the subjective perception of the modeler. This makes
modeling a non-deterministic task that requires standards in order to achieve a certain level of
2 Background of Software and Business Process Modeling 11
inter-subjectivity.
A graphical modeling language meets this demand for standardization in the domain of soft-
ware and business process modeling.
both focusing on the physical level what nowadays may be called graph models [Angles and
Guteierrez, 2008].
Techniques for knowledge integration as well as data abstraction were introduced by Abrial
(1974). He proposed a definition of the semantics of classes by access procedures [Abrial, 1974].
Because of its simplicity, Chen’s Entity-Relationship Diagram became popular and the de facto
standard in data modeling and database design [Chen, 1976].
Semantic data models allow for designing models at a higher level and enable the database
practitioners to naturally and directly incorporate in the schema a larger portion of the semantics
of the data [Hammer and McLeod, 1978].
The Structured Analysis and Design Technique (SADT), introduced by Ross (1977) in the mid
seventies, was one of the most significant early steps in the area of requirements specifications
[Ross, 1977]. Among the features is the emphasis on modeling data as well as activities connected
by edges representing the flow of information.
Smith and Smith (1977) have introduced concepts for abstraction and generalization in database
design in database research. They combined ‘generalization’ and ‘aggregation’ into one struc-
turing discipline. However, aggregation was still not easily modeled using ER; it became the
main thrust in Object Oriented databases. The advantage of aggregation is that it provides an
easier understanding of complex models and a more systematic approach to database design.
It mainly supports the development of highly structured models without loss in intellectual
manageability [Smith and Smith, 1977].
The first high level data definition languages for defining conceptual schemas such as the
Conceptual Schema Language were discussed in the late seventies. Descriptive elements as well
as procedural elements are provided within this language.
Hence, static aspects and dynamic behavior of data could be described by providing standard
types, object types and association types. A prominent example of a database design language
covering the concepts is Taxis [Mylopoulos et al., 1978]. Taxis provides relational database
management facilities, means of specifying semantic integrity constraints incorporated into
transactions, and an exception-handling mechanism. Taxis applies the concepts of class, property
and generalization relationship to all aspects of program design.
In the domain of requirements engineering, Greenspan et al. (1994) adopted the approach of
Taxis and attempted to formalize the SADT notation [Greenspan et al., 1994]. The Requirements
Modeling Language embodies a notation for requirements modeling which combines object-
orientation and organization, with an assertional sublanguage used to specify constraints and
deductive rules [Roussopoulos and Yeh, 1984].
The Semantic Database Design takes not only the environment of the data processing system
into account, but also focuses on the environment of the entire company. This approach proceeds
on the assumption that without a complete understanding of how the enterprise operates, it is not
possible to develop an effective design. Thus, it is recommended to start with an environment
analysis phase followed by a system analysis phase, capturing and analyzing the operational
behavior of an organisation.
describing the context in which the intended system will function. The above UML emphasizes
concepts for modeling and analysis during the later requirements phases, which usually focus on
completeness, consistency, and automated verification of functional requirements.
With Tropos, a development method supporting the early phases of requirement engineering is
provided. Tropos is founded on the idea of using the agent paradigm and related mentalistic
notions during all phases of the development software process [Bresciani et al., 2004].
Modelling developed
described by Model
Language with
described by
describes
Notation visualizes Syntax meaning Semantics
of Component
System
arranges
Semantic Semantic
according
Mapping Schema
to
The following figure was developed under consideration of Karagiannis & Kühn (2002) and
shows specific terms and their relations in the domain of graphical modeling languages:
play important roles in the design and analysis of complex systems in general and software
systems in particular.
It is instructive to explain these levels in a bottom-up order, starting with the instance level. The
instance level reflects the concrete entities that are involved in business processes or applications.
Executed activities, concrete data values, and resources and persons are represented at the
instance level. To organize the complexity of the real-world scenarios, a set of similar entities at
the instance level are identified and classified at the model level.
For instance, a set of similar business process instances are classified and represented by a
business process model. In object modeling, a set of similar entities is represented by a class, and
in data modeling using the Entity Relationship approach, a set of similar entities is represented
by an entity type, and similar relationships between entity types are represented by a relationship
type.
In this thesis, the UML in the version 2.0 plays a significant role due to its current ubiquity in the
domain of software engineering. The UML is a standardized modeling language in the field of
object-oriented software engineering. The standard was created and is managed by the OMG.
UML includes a set of graphic notation techniques to create visual models of object-oriented
software-intensive systems. UML diagrams represent two different types of a system model
[OMG, 2011c]:
• Static or structural models: emphasizes the static structure of the system using objects,
attributes, operations and relationships. The structural view includes class diagrams and
composite structure diagrams.
• Dynamic or behavioral models: emphasizes the dynamic behavior of the system by
showing collaborations among objects and changes to the internal states of objects. This
view includes sequence diagrams, activity diagrams and state machine diagrams.
The following figure shows an overall view of the diagrams of the UML:
Models can be found in all areas and applications of software engineering. While software
developers create concrete models people who do research in software engineering work on
notations and methods for developing such concrete models. Class Diagrams and State Charts,
Petri Nets and Data Flow Diagrams are a few examples of models that use such notations. Most
of the models used in software engineering are prescriptive, for instance:
• process models, such as UML Activity Diagrams
• information flow models such as the diagrams used in structured Analysis and Design
18 2 Background of Software and Business Process Modeling
Technique (SADT)
• design models, such as UML Class Diagrams,
• models of user interaction, like UML Use Cases, or UML Interaction Diagrams
In the following sections significant modeling languages of the software engineering domain
that are relevant in the forthcoming empirical surveys of this thesis are introduced.
A decision node has one incoming flow and several outgoing flows. A decision node splits up
into several alternative flows. Only one outgoing flow will be chosen for further processing.
The merge node merges the outgoing flows of the decision node. A merge node brings multiple
alternate flows together.
In an activity the flow of control form one node to another is modeled using control flow edges
and data flow edges.
The control flow models the flow between actions, and the data flow between object nodes and
actions.
20 2 Background of Software and Business Process Modeling
<<include>>
<<extend>>
Generalization
Actor Association
Figure 2.5.4.: Elements of UML 2.0 Use Case Diagrams [OMG, 2011d]
Flight Name
flightNumber: Integer
Attributes
departureTime: Date
delayFlight(): Date
Operations
getArrivalTime():Date
Association:
An association represents a family of links between classes. Binary associations are normally
represented as a line, with each end connected to a class box. Higher order associations can be
drawn with more than two ends. In such cases, the ends are connected to a central diamond.
An association can be named, and the ends of an association can be adorned with role names,
ownership indicators, multiplicity, visibility, and other properties. There are four different
types of association: bi-directional, uni-directional, aggregation, composition and reflexive. For
instance, a flight class is associated with a plane class bi-directionally. Association represents
the static relationship shared among the objects of two classes.
Aggregation:
Composition:
Generalization:
The generalization relationship indicates that one of the two related classes i.e. the subclass is
considered to be a specialized form of the other class and is considered as ’generalization’ of the
subclass. This means that any instance of the subtype is also an instance of the superclass. An
2 Background of Software and Business Process Modeling 23
exemplary tree of generalizations of this form is found in binomial nomenclature: human beings
are a subclass of simian, which are a subclass of mammal etc. The relationship is most easily
understood by the phrase ’an A is a B’.
Multiplicity:
The association relationship indicates that one of the two related classes makes reference to the
other. In contrast with the generalization relationship, this is most easily understood through the
phrase ’A has a B’, for example a building has one or more rooms, an employee belongs to one
company.
appropriate IT-support.
Since over time various different modeling languages have been developed, as a result differing
legacy models exist. Business Process Models describe sequences of activities, expressed in
a certain modeling language, with the model elements being labelled following the business
terminology in use in the applicable domain.
One of the core tasks in business management is the design and continuous improvement of
business processing according to changing needs and expectations and the allocation of all
necessary resources. The increasing speed of globalization demands from enterprises of all sizes
to adequately adapt in an ever quickening pace to changing business conditions and varying
market requirements.
The motivation mostly arises from increasing cost pressure and intensifying competition as
well as new legal regulations, the need to follow standards or for incorporating new innovative
technologies. Therefore, it is mandatory to engineer business in an agile manner for on-going
optimization or reengineering.
Possible motivations for applying Business Process Modeling in organizations are defined by
Havey (2005):
• Reducing complexity by developing abstract process models
• Formalize existing process and spot needed improvements
• Facilitate automated, efficient process flow
• Increase productivity and decrease head count
• Simplify regulations and compliance issues [Havey, 2005]
In the following sections significant modeling languages of the business process modeling
domain that are relevant in the empirical surveys of this thesis are introduced.
Organization
Event Process Path
Unit
Function
Information
Object
Order is
received
XOR
V Order is
created
Production Planning
Production Leader
V
Letter E-Mail Fax Purchasing
V Assign to
Create
Production
Purchase Raw
employee Material
Plan
Rejection is V
written
Check Order
Send rejection Rejection
XOR
Rejection is
sent
Order is ok Order is not ok
Confirmation
Send Communicate
Confirmation
Confirmation Changes
XOR
Customer is
informed
Syntax Semantics
composition rules
Referring to Moody (2010) it can be concluded that the survey is influenced by the metalevel
and the instance level of modeling languages. The metalevel (i.e. the metamodel) contains the
graphical shapes, the relationships, semantics and the syntax.
This language-specific level influences the usability of graphical modeling languages. In other
words, two relevant components of the metamodel can be extracted:
• Visual Syntax (Language Complexity)
• Graphical Shapes (Visual Properties)
On the other hand, the instance level is important as well. For testing the usability with individuals
in a survey it is necessary to confront them with concrete models/diagrams of particular languages.
Consequently, the influence of those concrete models/diagrams has to be considered and the data
material has to be adjusted. For this, a specific algorithm is developed and introduced in chapter
5 of this thesis.
Furthermore, relevant modeling languages of the software engineering and business process
modeling domain were introduced. Concerning this,
• EPC’s,
• BPMN,
• UML Activity Diagrams,
• UML Class Diagrams and
• UML Use Case Diagrams were focused on.
In addition to this, similarities of different modeling concepts (behavioral vs. structural concepts)
were defined. Those similarities are:
• metalevel including shapes, relations, semantics and syntax
• instance level including concrete models/diagrams
This definition is important since the following survey aims to be a cross-language survey over
different modeling concepts. In general, this chapter shows the theoretical basis of this thesis
concerning the used terms and languages.
3. Background Theories
In this chapter the theoretical background of the study conducted in this thesis is
developed. Usability theory has its roots in cognitive psychology and is a relatively
young branch of computer science. A common understanding of the theoretical fac-
tors influencing usability and their interrelations is currently not existing. Therefore,
the theoretical context for this thesis consists of several parts of strong theories,
which are introduced in the following chapter. Those theories are necessary for the
surveys conducted in this thesis and are applied in Part III of this thesis.
Information transmitted
Encoding Decoding
Model
(Signal)
Model Model
Develope Interprete
r r
Channel
Noise
(Medium)
Object Orientedness
Activity Theory includes a subject, which refers to the individual or sub-group whose agency
is chosen as the point of view in the analysis. The object refers to the raw material or problem
space at which the activity is directed and which is modelled and transformed into outcomes
with the help of physical and symbolic, external and internal mediating instruments, including
both tools and signs. Transferring this to the domain of graphical modeling languages leads to
the fact that
• a subject is characterized by the user of graphical modeling languages whereas
• the object can be defined as graphical model/diagram.
Hierarchy
Leontjev (1978) proposes that an activity has a hierarchical structure with three distinct levels:
• the activity level,
• the action level and
• the operation level [Leontiev, 1978].
Activities consist of actions, which consist of operations. Actions are basic components of
activities. Different actions may be undertaken to meet the same goal. Operations are ways of
executing actions, and represent the concrete conditions required to achieve goals.
In the domain of graphical modeling languages an
• activity can occur as model development task,
• a possible action may be encoding or decoding of information to be developed or inter-
preted and
• a possible operation may be the handling of a modeling tool.
34 3 Background Theories
Experience Motivation
Activity
Model
User
Modeling
Modeling Tool Design
Language
Mediation
Activity Theory emphasizes that human activity is mediated by artefacts. The mediating artifact
can be external (e.g. modeling tool) or internal (e.g. motivation, modeling experience).
Figure 3.2.1 subsumes the concepts of Activity Theory and shows the adoption of this theory on
graphical modeling languages.
Model/
Diagram Perceptual Perceptual Attention Working Long Term
Discrimination Configuration Memory Memory
Figure 3.3.1.: Human Information Processing in the Domain of Graphical Modeling Languages
as well as short and long-term memory are affected in learning how to use specific modeling
languages, creating models, and understanding models.
Figure 3.3.1 shows a model of human graphical information processing, which reflects current
research in visual perception and cognition [Moody, 2009].
Processing is divided into two phases: perceptual processing (seeing) and cognitive processing
(understanding). Perceptual processes are automatic, very fast, and mostly executed in parallel,
while cognitive processes operate under conscious control of attention and are relatively slow,
effortful, and sequential.
A major explanation for the cognitive advantages of diagrams is computational offloading: They
shift some of the processing burden from the cognitive system to the perceptual system, which
is faster and frees up scarce cognitive resources for other tasks. The extent to which diagrams
exploit perceptual processing largely explains differences in their effectiveness [Larkin and
Simon, 1987].
First, cognitive theory is underlain, which generally defines the external impact of human learning
and acting. The theoretical constructs of cognitive psychology have direct analogies in model
development and model interpretation scenarios. From the traditional cognitive point of view,
the usability system in this study is composed of three basic units generating and processing
information:
• the human being acts as model developer and model interpreter processing information,
• the model task, which provides the information and
• the modeling language, which determines the interpretation or graphical representation of
given information.
This theory focuses on the impact of memory load to human learning and knowledge acquisition.
Figl et al. (2010) mapped the cognitive load theory to the context of graphical modeling
languages [Figl et al., 2010].
Cognitive load theory differs between three types of cognitive load:
The extraneous cognitive load is influenced by the way the information is represented. From
the language perspective, the use of geometrics and colours for designing elements of modeling
languages seems to be important.
The intrinsic cognitive load is determined by information complexity and interaction. For
example, learning elements of modeling languages results in a low intrinsic cognitive load. In
this case, the difficulty of learning a language and consequently the intrinsic cognitive load is
strongly connected with the range of elements a language consists of. Contrariwise, the element
interaction by means of syntactical and semantic element relations leads to a high intrinsic
cognitive load. Furthermore, the intrinsic cognitive load is influenced by prior knowledge of
human beings [Sweller, 2005].
Considering this in the survey leads to the importance of including experience in the proposed
research model in Part II of this thesis.
Finally, germane cognitive load is the result of beneficial cognitive processes such as abstractions
and elaborations that are promoted by the instructional presentation [Sweller, 2005, Plass et al.,
2010].
Cognitive load is determined by the amount of elements needed to be paid attention to at a point
of time.
Consequently, the concepts of this theory has to be considered in the empirical surveys of this
thesis.
Auftrag
Auftrag ist eingegangen
durchgeführt
XOR
XOR
Auftrag
Disposition Auftrag
Auftrag anlegen
Berichtsdaten Berichtsdaten ablehnen
Sekretariat inhaltlich
erfassen
prüfen
Berichtsdaten
erfasst
V Auftrag
angelegt
XOR
Produktionsplanung
Produktionsleitung
V
V
Mitarbeiter Produktionspl Rohmaterial
Disposition
zuteilen an anlegen bestellen
Absage
Sekretariat Aufwand Serviceauftrag Bericht an FiBu Vertrieb Absagedokument
schreiben
berechnen prüfen schicken Mitarbeiter Produktionspl Rohmaterial
zugeteilt an angelegt bestellt
FiBu
Absage V
geschrieben
XOR
XOR
Rechnung FiBu
erstellen
Absage
verschickt
Auftrag nicht
Auftrag ok
ok
Rechnung
erstellt
Änderungsliste
Bestätigung Änderungen
Bestätigung
senden mitteilen
V
XOR
Rechnung als Rechnung an
offener Posten Kunde
verbucht versenden Kunde wurde
informiert
Proximity Closure
Similarity Good
Continuation
modeling methods and modeling languages exist. These studies can be structured in three main
categories [Siau and Rossi, 2008]:
1. Feature comparison studies
2. Theoretical and conceptual evaluation studies
3. Empirical evaluation studies
Feature Comparison Studies Feature comparison studies are mostly based on the idea of
using different languages to model the same domain and determining how various modeling
languages tackle the same problem [Olle et al., 1986]. For example, within these studies
the comparison of object oriented and structured development methods and process modeling
methods was conducted
[Barbier and Henderson-Sellers, 2000, Loy, 1990, Strom, 1986].
Eloranta et al. (2006) analyzed the two modeling languages BPMN and UML. The evaluation
was based on different frameworks known as the Workflow Patterns Framework and the Bunge-
Weber-Wand Model (BWW-model) [Eloranta et al., 2006].
Empirical Evaluation Studies Empirical evaluation studies target observations and propo-
sitions based on sense experience with considering methods of inductive logic including mathe-
matics and statistics [Cooper and Schindler, 2005].
Recker and Dreiling (2007) conducted a survey on understanding process modeling languages
including EPC’s and BPMN. They dealt with the investigation of how learning affects modeling
works. Particularly, this empirical study focuses the interpretation of diagrams developed with
EPC’s and BPMN and comes up with the result that process modelers with training in any
process modeling language perform reasonably well in understanding other unknown process
models [Recker and Dreiling, 2007].
Mendling and Strembeck (2008) analyzed influence factors of understanding business process
models with applying an online questionnaire [Mendling and Strembeck, 2008]. The results of
this survey support the hypothesis that personal, model and content related factors influence the
understandability of process models. As the analyzed related work has shown, previous studies
are mainly evaluating partitions of usability regarding modeling languages.
Siau and Rossi (2007) concluded a lack of empirical evaluation considering practical scenarios.
Furthermore they strongly propose further surveys to study the usability of modeling languages
[Siau and Rossi, 2008].
Mendling and Strembeck (2008) recommend future surveys for analyzing the understandability
of process models [Mendling and Strembeck, 2008].
Overhage and Schlauderer (2010) compared the usability of BPMN and UML Activity diagrams
based on the Bunge-Weber-Wand Ontology [Birkmeier et al., 2010].
The results of this literature research suggest current research activities in the field of user-
oriented assessment of graphical modeling languages. Furthermore, they show a current research
need for an overall survey evaluating the usability of graphical modeling languages. Thus, the
usability evaluation framework associated with graphical modeling languages presented in this
chapter is the first step to study usability in the domain of graphical modeling languages.
appearing in literature and standards have to be analyzed. This procedure builds up a basis
for defining usability in the domain of graphical modeling languages. Subsequently, different
attributes are operationalized by defining specific metrics for the domain of graphical modeling
languages.
Preece et al. (1994) combined effectiveness and efficiency to throughput [Preece et al., 1994].
Constantine and Lockwood (1999) and Nielsen (2006) collected the attributes defining usability
and developed an overall definition of usability attributes consisting of learnability, memorability,
effectiveness, efficiency and user satisfaction [Constantine and Lockwood, 1999, Nielsen, 2006a].
The variety of definitions concerning usability attributes led to the use of different terms and
labels for the same usability characteristics, or different terms for similar characteristics, without
full consistency across these standards; in general, the situation in the literature is similar.
For example, learnability is defined in ISO 9241-11 as a simple attribute, “time of learning”,
whereas ISO 9126 defines it as including several attributes such as “comprehensible input and
output, instructions readiness, messages readiness . . . ” [ISO/IEC9241-11, 1998, Abran et al.,
2003, ISO/IEC9126-1:2001, 2004].
Model
Interpretation
Model
User of Modeling
Development
Languages
Learnability is probably the most important attribute of usability, since a modeling language
needs to be easy to learn. Learning to use a modeling language seems to be the first
experience most users are confronted with a new modeling language [Siau and Rossi,
2008, Mayer, 1989].
Memorability describes the “remembering rate” of a modeling language. Overall it describes
the fact that a modeling language should be easy to remember regarding its elements,
syntax and semantics [Recker and Dreiling, 2007, Mayer, 1989].
Effectiveness characterises the fact, that it should be possible to reach a successful task
accomplishment. In this regard, a user should be able to develop and comprehend models
with low error rates [Bobkowska, 2005b, Wand and Weber, 1993].
Efficiency refers to users with medium-high modeling experience. Once a user has learned a
modeling language it should be possible to reach a high level of productivity. A modeling
language is efficient to use when the users are able to develop or comprehend a model
relatively quickly and correctly regarding the regulations of the modeling language [Wand
and Weber, 1993, Bobkowska, 2005a].
User Satisfaction focuses on the user and his/her subjective contentment when modeling or
interpreting a model [Siau and Wang, 2007].
Perceptibility is a very important attribute for evaluating the usability of modeling languages.
In most instances this attribute relates to the interpretation of models. This attribute is
measured by using the method of eye-tracking, which can be described as a technique to
determine eye movement and eye fixation patterns of a user [Moody and Heymans, 2010].
Regarding this evaluation framework, the additional information by using this technique
can answer questions such as does the user recognize the process flow or the diagram
structure of the model in an easy way?
A general definition of usability indicating how it is applied to modeling languages in this thesis
is presented as follows:
The usability of modeling languages is specified by learnability, memorability,
effectiveness, efficiency, user satisfaction and perceptibility: The learnability of
modeling languages describes the capability of a modeling language to enable
the user to learn how to use it. The modeling language and its semantics, syntax
and elements should be easy to remember, so that a user is able to return to
the language after some period of non-use without having to learn the language
again. The modeling language should be effective for reaching a successful task
accomplishment. The modeling language should be efficient to use, so that a high
level of working productivity is possible. The modeling language should be pleasant
to use. Users have to be satisfied when using the language. The language should
offer a convenient perceptibility regarding structure, overview, elements and shapes.
50 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
ψG(e) = {n, n }
then e is said to join n and n , and the nodes n and n are called the ends of e. As a consequence
we have
No graph and consequently no graphical modeling language can exist without nodes and edges
[Diestel, 2005].
Concerning the usability of modeling languages some metrics are basically functions that are
defined in terms of a formula but others are simple countable data. In the following, usability
metrics for modeling languages are developed and assigned to the attributes defined in the
previous section.
In order to assess the usability of modeling languages an additional possibility is to use the
method of eye-tracking [Gordon, 2004, Das et al., 2008, Ehmke and Wilson, 2007, Pretorius et al.,
2005].
4 A Framework for Usability Evaluation of Modeling Languages (FUEML) 51
4.3.1. Effectiveness
Evaluating Effectiveness F requires the analysis of a task output with measuring quantity and
quality of goal achievement [Rengger et al., 1993].
Quantity is defined as the proportion of task goals represented in the output of a task. Quality
is the degree to which the task goals represented in the output have been achieved [Bevan and
Macleod, 1994].
Bevan (1995) defined Effectiveness as a product of Quantity and Quality [Bevan, 1995].
Transferring this issue to FUEML, effectiveness can be expressed in the following formula with
applying the number of nodes N, edges E and errors R of a model developed or interpreted as
metrics for proband’s task completion and task goals [Kan, 2002]:
∑ (Ntask + Etask )
Grade o f completeness = , D = {0; 1} (4.3.2)
∑ Ngoals + Egoals
∑ Ngoals + Egoals − R
Grade o f correctness = , D = {0; 1} (4.3.3)
∑ Ngoals + Egoals
A limitation of this metric exists when the nodes and edges of a task model are greater than the
nodes and edges of the goal model. Consequently the domain of this metric is restricted to values
between zero and one.
4.3.2. Efficiency
The Efficiency G is the amount of human, economical and temporal resources. Measures of
efficiency relate to the level of effectiveness achieved to the expenditure of resources [Bevan and
Macleod, 1994].
Measure values of efficiency include time taken to complete tasks [Vuolle et al., 2008].
52 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
Hence, this can be expressed by the ratio of Effectiveness F and Task Completion Time T in
minutes:
F
E f f iciency (G) = (4.3.4)
T
4.3.3. Learnability
Learnability describes the ease of learning the semantics and syntax including different shapes
and the application of these elements. For this characteristic, the standard measure values are
based on time behavior (task completion time, interpretation time) and the accuracy [Seffah et al.,
2006].
In general, learnability is a development and can be graphically described by learning curves
[Tamir et al., 2008].
Hence, learnability can be measured by the rate of difference when the user repeats evaluation
sessions [Bevan, 1995].
Nielsen (1993) insists that highly learnable systems could be categorized as “allowing users to
reach a reasonable level of usage proficiency within a short time” [Nielsen, 1993].
Furthermore, Nielsen (2006) proposes measuring proficiency by quantity, quality and time
behavior of goal achievement. Thus, the defined efficiency metric is equal to proficiency and
can be applied for measuring learnability [Nembhard and Napassavong, 2002, Grossman et al.,
2009].
With conducting two measuring points mp and mp + 1, it is possible to analyze the relative
difference between mp and mp + 1 based on efficiency (G) for indicating Δ Learnability in
percent :
Gmp+1 − Gmp
ΔLearnability = (4.3.5)
Gmp
In the following, the presented metrics are demonstrated and proved by an example. Participants
are asked to develop a process model based on a given textual process description. The example
models are presented in Figure 4.3.1.
The goal model with N = 4 ∧ E = 3 is the model requested and the task model with N = 2 ∧ E = 1
is the possible output of one participant.
The task model contains one error due to the fact that an EPC-model must start with a triggering
event (purple). Furthermore, it is presumed that the task model was generated in 120 seconds by
participant.
4 A Framework for Usability Evaluation of Modeling Languages (FUEML) 53
Example
Customer Order
O
confirmed
Goal Task
Model Model
Based on this, the calculation of the developed metrics for effectiveness, efficiency and learnabil-
ity is shown:
E f f ectiveness (F) = 2+1
4+3 ∗ 4+3−1
4+3 = 0.43 ∗ 0.86 = 0.37
0.37
E f f iciency = = 0.19
2m
Considering that a 0.19 is the value for efficiency in session 1 and a possible value for session 2
is 0.30 leads to the following calculation of learnability:
0.30 − 0.19
Learnability = = 0.58
0.19
The given examples show that the metrics result in values that are comparable between different
modeling languages in a survey.
4.3.4. Memorability
Memorability is best measured as proficiency after a period of non-use provided a user has
already learned a language [Olle et al., 1986].
54 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
The non-use period can be minutes for simple element meanings, hours for simple syntactic
regulations and days or weeks for measuring a complete modeling language. Accordingly, the
measure values for memorability are neglect curves and time-delayed knowledge tests [Nembhard
and Uzumeri, 2000].
Concerning the usability of modeling languages, the user must remember the different elements
and its intended meaning (semantics), the syntax and the application. In due consideration of
Nielsen (2006), the measuring points interval should be several weeks regarding memorability
[Nielsen, 2006a].
Thus, for measuring memorability a knowledge test consisting of items focusing on
• elements and relations,
• syntax and
• application
of particular language is applied. The developed knowledge test contains closed questions and
multiple-choice questions. The detailed knowledge tests that are part of the upcoming surveys
are shown in appendix A.1.3. The knowledge test was carried out in German. However, the
impressions provide a translation in English.
4 A Framework for Usability Evaluation of Modeling Languages (FUEML) 55
Knowledge Test
5.) How many UML Class Diagrams did you develop or interpret approximately?
none
< 10
> 10
6.) Please name all elements of UML Class Diagrams you can remember spontaneously!
________________________________________________________________________________________________
________________________________________________________________________________________________
________________________________________________________________________________________________
7.) Please name all possible relations between the elements of UML Class Diagrams you can remember spontaneously!
________________________________________________________________________________________________
________________________________________________________________________________________________
________________________________________________________________________________________________
________________________________________________________________________________________________
56 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
8.) Evaluate the following statements by ticking one box per phrase
Correct Wrong
A class consist of a name, attributes
(4.1)
and operations
(4.7)
A composition is a directed edge
4 A Framework for Usability Evaluation of Modeling Languages (FUEML) 57
<<Class>>::Vehicle
<<Class>>::Customer
-Speed : double
-Forename : string
-Direction : double
-Surename : string
+accelerate() -Address : string
+brake()
+direction shift()
<<Class>>::Customer number
<<Class>>::Car <<Class>>::Plane <<Class>>::Boat
<<Class>>::Order item
<<Class>>::Customer -Number : int
-Quantity : int
-Forename : string
-Price : int
-Surename : string
-Disposability : int
-Address : string
1..* 1
1..*
1
<<Class>>::Bank Account
<<Class>>::Product
-Account Number : int
-Label : string -Label : string
+deposit() -Number : int
+withdraw()
+balance calculation()
<<Class>>::Lecture <<Class>>::Department
1..*
<<Class>>::Student <<Class>>::Employee
58 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
Questionnaire
5.) How many <<language the survey is focused on>> did you develop or interpret approximately?
none
< 10
> 10
6.) Please rate if you disagree or agree with the following statements
7. Please mark your overall satisfaction with the modelling language on the following line!
5.) How many UML Class Diagrams did you develop or interpret approximately?
none
< 10
> 10
Planar Retinal
Variables Variables
Horizontal Shape Size Color
Position
Vertical
Position
Textual languages encode information using sequences of characters, while visual languages
encode information using spatial arrangements of graphic and textual elements. Textual represen-
tations are 1-dimensional, whereas visual representations are 2-dimensional: A widely accepted
definition of a diagram is a representation in which information is indexed by 2-dimensional
location [Larkin and Simon, 1987].
According to dual channel theory, the human mind has separate systems for processing pictorial
and verbal material [Mayer and Moreno, 2003].
Visual representations are processed in parallel by the visual system, while textual representations
are processed serially by the auditory system [Bertin, 1983].
These differences mean that fundamentally different principles are required for evaluating and
designing visual languages. However, such principles are far less developed than those available
for textual languages [Gurr, 1999, Winn, 1990].
Moody (2010) deduces visual variables from Bertin’s Semiology of Graphics that need to be
optimized for processing by the human mind [Bertin, 1983].
Bertin (1983) identified eight visual variables that can be used to graphically encode information.
These are divided into planar variables (the two spatial dimensions) and retinal variables (features
of the retinal image). Planar variables are position-based and subsume the horizontal and vertical
position of diagram components while retinal variables describe element-based properties such
as shape, size, color, brightness, orientation and texture [Moody and Heymans, 2010].
Structuring those variables leads to variables with color properties (color, brightness, texture)
and one variable with geometric properties (shape), which are defined in the metamodel of a
graphical modeling language.
Contrariwise, the planar variables and additionally size and orientation depend on the particular
model developed or interpreted.
64 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
However, the model perspective i.e. model complexity is already considered in this survey.
Consequently, the following variables are defined for measuring the visual dimension of graphical
modeling languages:
• Number of different colors
• Number of different geometric shapes
Color is one of the most cognitively effective visual variable [Moody, 2009].
The human visual system is highly sensitive to variations in color and can quickly and accurately
distinguish between them [Winn, 2002, Mackinlay, 1986].
Differences in color are detected three times faster than shape and are also more easily remem-
bered [Treisman, 1982, Lohse, 1993].
However, color is rarely used in SE notations and is specifically prohibited in UML 2.0:
“UML avoids the use of graphic markers, such as color, that present challenges
for certain persons (the color blind) and for important kinds of equipment (such as
printers, copiers, and fax machines)” [OMG, 2005b] .
However, color should never be used as the sole basis for distinguishing between symbols as it is
sensitive to variations in visual perception (e.g., color blindness) and screen/printer characteristics
(e.g., black-and-white printers).
To avoid loss of information (robust design), color should only be used for redundant coding.
Event-driven Process Chains (EPCs) is one of few graphical modeling languages using color to
encode information. But this language makes the mistake of using it in a non-redundant way.
When diagrams are reproduced in black and white, differences between some symbols disappear
[Moody, 2009].
Language O R P C(M)
EPC 15 5 11 19.26
BPMN 90 6 143 169.07
UML Class Diagrams 7 18 18 26.40
UML Activity Diagrams 8 5 6 11.18
UML Use Case Diagram 6 6 6 10.39
UML Component Diagram 8 10 9 15.65
Table 4.1.: Complexity Values for heterogeneous Modeling Languages [Recker et al., 2009,
Indulska et al., 2009]
average number of properties per object type, average number of properties per relationship type,
and the average number of relationship types that can be linked with a particular object type in a
given method. All of which indicate the complexity of of describing relationship types or object
types in a given method. These metrics form the basis for the calculation of the total conceptual
complexity of a method, which can then be used as a benchmark for comparison of conceptual
complexity of different methods [Rossi and Brinkkemper, 1996].
C(M) = O2 + R2 + P2 (4.4.1)
Several researchers such as Indulska et al. (2009) and Recker et al. (2009) transferred those
metrics to UML diagrams, EPC’s and the BPMN [Indulska et al., 2009, Recker et al., 2009].
Table 4.1 shows a brief comparison concerning the complexity C(M) of heterogenous modeling
languages.
66 4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
Language Properties:
Number of object types - Number of relationship types – Number of property types - Number of different shapes - Number of different colours
Model Interpretation
4 A Framework for Usability Evaluation of Modeling Languages (FUEML)
4.6. Conclusion
This chapter presented the development of a usability evaluation framework for modeling
languages. After elaborating basic usability definitions the relevant usability attributes were
extracted, adopted and applied to modeling languages. Subsequently, measure values for each
attribute were defined and different user situations such as interpreting or developing a model
have been considered. It was shown that previous studies focusing on the usability evaluation
of modeling languages are mainly evaluating partitions regarding the attributes of usability.
These studies are limited to particular modeling languages or usability attributes. An overall
usability survey for modeling languages is currently not available. Thus, the developed evaluation
framework provides a basis for the empirical studies conducted in the following chapters of this
thesis.
5. Developing a Generic Metric For
Measuring Model Complexity
In recent years, various object and process oriented modeling methods were de-
veloped to support the process of modeling in enterprises. When applying these
methods, graphical models are generated and used to depict various aspects of
enterprise architectures. Concerning this, surveys analyzing modeling languages in
different ways were conducted. In many cases these surveys include experimental
data collection methods. At this juncture the complexity of concrete models often
affects output of these studies. To ensure complexity value comparability of different
models, a generic metric for measuring complexity of models is proposed. Another
variable impacting on the output of a usability study in the domain of graphical
modeling languages is the complexity of different tasks applied. Since we move in
the area of modeling languages, a metric measuring task complexity by defining
complexity of models that have to be developed or interpreted by survey participants
is developed. This ensures comparability of survey results crossing different model-
ing languages. In this chapter the development of a generic metric measuring model
complexity (GCMM) is proposed. The developed metric is evaluated and applied in
the empirical surveys in this thesis [Schalles et al., 2010c].
Even though software and process modeling have been used intensely over the last decades,
only a small number of research analyzed understandability and comprehension of graphical
models [Mendling and Strembeck, 2008].
Past researches either were focusing on process models or structural models. For example,
Mendling (2008) developed metrics for process models such as Event Driven Process Chains
(EPC) [Mendling, 2008].
Metrics in software engineering have shown their potential as guidance to improve software
designs and make them more understandable and easier to maintain [Vanderfeesten et al., 2007a].
Surveys focusing on the evaluation of modeling languages include metrics measuring model
complexity in order to operationalize the influence of model complexity on particular outputs.
When analyzing these studies it appears that a great number of empirical researches apply
easy structured metrics for measuring the complexity of models. This chapter focuses on the
development of a generic metric for measuring the complexity of process models e.g. EPC as
well as structure models and UML Class Diagrams. Several researchers concluded that business
process and software program designs have a lot in common [Vanderfeesten et al., 2007b, Reijers
and Vanderfeesten, 2005]. In general, this metric aims at researchers conducting empirical
surveys on modeling languages.
The quality of a complexity measure rests on its explanatory power and applicability. Explanatory
power refers to the measure’s ability to explain the interrelationships among complexity, quality,
and other programming and design parameters. Applicability refers to the degree to which the
measure can be applied to improve the quality of work during the design, coding and testing
stages [Siau and Cao, 2001].
5.2. Size
For developing a generic model complexity metric analogous partitions of complexity metrics are
transformed. Halstead (1977) and Mc Cabe (1976) propose a set of metrics including primitive
measure values for measuring software complexity [Halstead, 1977, Mc Cabe, 1976].
Firstly, in this approach it is suggested to map model elements and relations to a set of primitive
measures proposed by Halstead (1977) and Mc Cabe (1976). For example, the number of unique
operator occurences and the number of operand occurrences are the number of nodes N and
edges E in a model. With the size S is dependent on E and R the following formula can be
72 5 Developing a Generic Metric For Measuring Model Complexity
defined:
S = ∑ (N + E) (5.2.1)
L = ∑ Ndi f + Edi f (5.3.1)
5.4. Connectivity
Beside size and semantic spread a further important part of the metric is density. One essential
element for measuring model density is described by connectivity degree of contained arcs and
vertices [Mendling, 2008].
In general, the developed connectivity degree metric is based on Yang et al. (2006) [Yang et al.,
2006]. For measuring the connectivity degree of ontologies they propose the ratio of relations
and arcs. With adding Henry and Kafura’s approach to the metric, the complexity of element’s
connections to its environment is captured [Henry and Kafura, 1981].
Hence, the fan-in and fan-out metric maps to number of element inputs NIn and number of
element outputs NOut in a particular model. Finally, the described concepts result in the following
5 Developing a Generic Metric For Measuring Model Complexity 73
∑ (NIn + NOut ) ∑ E ∗ 2
c= = (5.4.1)
∑N ∑N
Considering Ince and Hekmatpur (1988) the term ∑ (NIn + NOut ) can be simplified by ∑ E ∗ 2
[Ince and Hekmatpur, 1988].
CM = (S + L2 ) ∗ c (5.5.1)
Model S L c CM
Class Diagram I 11 3 1.67 7.45
EPC I 9 4 1.60 8.00
Activity Diagram I 14 5 2.00 12.49
Class Diagram II 25 7 1.92 16.49
EPC II 45 5 2.09 17.49
EPC III 63 8 1.90 21.45
Check Check
Warehousing
warehouse Inventory list warehouse
department
stock stock
XOR XOR
Activity
Goods Goods not Goods Goods not Diagram
available available available available
Customer
information
V V
Choose menu
Inform Purchasing Inform
Order goods Order goods Service team
customer department customer
XOR XOR
Confirm order
Goods Goods
packaged packaged
XOR XOR
XOR XOR
The other values range in between. Table 5.3 subsumes the survey complexity results CS , the
relative distribution of CM and CS and additionally the difference D of CM ∗ and CS ∗. As shown
in Table 5.3 the difference D of CM and CS is negligible.
In conclusion, the presented metric offers traceable results for evaluating the complexity of
models developed with graphical modeling languages such as UML-diagrams, EPC, BPMN,
ER-diagrams.
The metric calculates complexity of models developed with different process based and structure
based modeling languages. In the forthcoming survey it is assumed that model complexity acts
as a significant impact factor and, therefore, the impact on the survey output has to be controlled.
5.7. Conclusion
This developed metric aims for supporting empirical surveys on modeling languages. Therefore
a metric analyzing and comparing complexity of models developed with different process and
structure modeling languages was proposed.
It is important to consider semantic spread and connectivity degree in addition to model size.
Considering generality of this approach it is important to mention some restrictions: To ensure
generality this problem was solved on an abstract graph-based level. It is obvious that an
EPC-event, UML-activity and UML-class are semantically different and cannot be compared by
implication.
Hence, the proposed metric was built up focusing on graph theory i.e. nodes and edges. Subse-
quently, the development was influenced by moving from an abstract level to a concrete level
adding semantic spread. Typical application domains for this metric are empirical surveys on
modeling languages including model complexity. Another domain is the practical application of
the proposed metric in organizations.
5 Developing a Generic Metric For Measuring Model Complexity 77
Currently organizations are designing process and structure models without considering model
complexity and understandability. As a result, it may happen that simple business cases are
modelled in a complex and unsuitable way. This leads to lower understandability and higher
maintenance costs in an organization. Applying this metric might result in transparent models
that are easy to understand for interpreting users. The developed metric is evaluated and applied
in the empirical surveys in this thesis.
6. Comparing the Usability of Graphical
Modeling Languages Using FUEML
Documenting, specifying and analyzing complex domains such as information sys-
tems or business processes have become unimaginable without the support of graph-
ical models. Generally, models are developed using graph-oriented languages
such as Event Driven Process Chains (EPC) or diagrams of the Unified Modeling
Language (UML). For industrial use, modeling languages aim to describe either
information systems or business processes. Heterogeneous modeling languages
allow different grades of usability to their users. In the following chapter an eval-
uation of four heterogeneous modeling languages and their different impact on
user performance and user satisfaction is conducted [Schalles et al., 2012]. The
evaluation is based on the FUEML framework, which was developed in chapter 4 of
this thesis.
Almost all notations for software and business process specifications use diagrams as the primary
basis for documenting and communicating them. The large number of available languages
confronts companies with the problem of selecting the language most suitable to their needs.
Beside functional and technical evaluation criteria, user-oriented characteristics of modeling
languages are becoming more and more a focal point of interest in research and industry (Siau
and Wang 2007).
In the following chapter a comparative study on usability of selected modeling languages is
conducted. The remainder of this chapter is structured as follows: First, hypotheses that are
relevant for analyzing the usability of modeling languages are defined. Secondly, the research
methodology and design are introduced. Subsequently, the resulting empirical findings are
presented. Based on the empirical results implications for different application domains of
graphical modeling languages are deduced.
textually. For model development the Bflow*-toolbox1 and ArgoUML2 modeling tools were
applied. Those two modeling tools are quite similar and offer a drawing area and a general
overview of all elements of one specific modeling language. It is assumed that the modeling
tools might have a significant influence on the usability metrics, however, the aim of this study
is to analyze and to compare the usability of modeling languages. Since the design of the two
modeling tools applied is quite similar, the values of this study are comparable.
In the second part the participants were confronted with the interpretation of given models.
The interpretation scenario was structured in two parts. The first part was focusing on general
observation while the second part included verbal interpretation of given models. Data was
collected by using Tobii Eye Tracker T603 integrated a webcam and a microphone.
However, the interpretation task generates time, error, completeness and additionally eye tracking
values. At the beginning of second data collection phase the knowledge tests for measuring the
ability of remembering specific metalevel properties were distributed. Subsequently, the User
Satisfaction Questionnaire was administered to the participants. Figure 6.2 shows the overall
procedure of data collection.
and model interpretation scenarios. Furthermore, the framework contains metrics for measuring
and comparing the resulting values germane to all graphical modeling languages. The concrete
development and interpretation tasks are shown in the appendix A.1. of this thesis.
6.2.2. Sample
The sample includes second year students of business informatics. The experimental data
collection, the questionnaire and the knowledge test were conducted with these students. The
overall sample size amounts 114 students, 47% female and 53% male.
Even though a choice of students for experiments has sometimes been criticized for lack of
external validity, it is agreed with Gemino and Wand (2004) and Batra et al. (1992). They
confirmed that the selection of students over practitioners could in fact be advisable [Gemino
and Wand, 2004, Batra and Davis, 1992].
Participants that are able to bring to bear prior knowledge in software and process engineering
are sufficient for analyzing human impact criteria in the domain of graphical modeling languages
[Siau and Loo, 2006].
Hence, the selection of students overcomes the problem of controlling for any bias in technique
or domain familiarity.
With calculating the estimated means the influence of the covariate is partialled out. Concern-
ing this, the covariates are user’s general modeling experience, language experience and task
complexity.
The aim of this analysis is to determine whether there is a statistically significant difference
between EPCs, UML Activity Diagrams, UML Class Diagrams and UML Use Case Diagrams
impacting on defined usability attributes considering user experience and task complexity as
covariates. The ANCOVA including the analysis of the adjusted means was conducted using the
statistical software package SPSS 19.
For calculating an ANCOVA, the dataset has to meet several assumptions that are analyzed in
the following section:
Stevens (1999) concluded that the F-statistic is robust with respect to the normality assumption
[Stevens, 1999].
The basic reason is the Central Limit Theorem, which states that the sum of independent
observations have any distribution whatsoever approaches a normal distribution as the number
of observations increases. Bock (1975) notes, even for distributions which depart markedly
from normality, sums of 50 or more observations approximate to normality. For moderately
non-normal distributions the approximation is good with 10 to 20 observations [Bock, 1975].
Due to the fact that the sample in this thesis concsist of 114 cases, it is concluded that the lack of
normality in all variables has only a slight effect on the results of the ANCOVA.
UML
UML Use Case
EPC Activity UML Class Diagram
Diagram
Diagram
Language
19.26 11.18 10.39 26.40
Complexity
Visual
Geometrics* 1.00 0.75 0.33 0.86
Differentiation
Colors** 0.80 0.25 0.33 0.29
Effectiveness Development
(=Grade of (p=0.007), 0.943**** 0.897 0.912 0.836
completeness*Grade of (F=4.231)
correctness)
Interpretation
(p=0.000), 0.983 0.970 0.891 0.882
(F=7.463)
Efficiency Development
(=Effectiveness/Task (p=0.071) 0.074 0.099 0.051 0.033
time***) (F=2.405)
Interpretation
(p=0.046) 0.411 0.438 0.382 0.321
(F=2.466)
Learnability Development
(=Rise of Efficiency in% (p=0.042), 121.134 13.549 45.576 116.469
between Session 1 and (F=5.236)
Session 2)
Interpretation
(p=0.024), 45.267 0.294 63.321 -8.422
(F=6.563)
Memorability Both Development
(Normalized between 0 and Interpretation
and 1) 0.804 0.600 0.815 0.496
(p=0.000),
(F=18.255)
User Satisfaction General Impression
(Normalized between 0 (p=0.000), 0.734 0.679 0.649 0.577
and 1) (F=8.947)
Development
(p=0.017), 0.748 0.717 0.704 0.614
(F=3.540)
Interpretation
(p=0.037) 0.786 0.777 0.749 0.734
(F=4.789)
Perceptibility Interpretation
(Information (p=0.000),
search=Fixation count) (F=19.971) 111.000 188.000 153.000 248.000
(Information
extraction=fixation Information Search
Duration)
Interpretation
(p=0.001),
(F=8.768) 97.920 74.280 84.570 73.870
Information
Extraction
Note. All shown values are significant at least at the 0.05 level. Additional information for Significance and F-values are presented
in appendix. Bold values show best language results for each usability attribute.
* Number of different shapes/total number of different elements
** Number of different element colors/total number of different elements
*** Value taken from Appendix
****Values represent adjusted means
Table 6.2.: ANCOVA Results for Usability Attributes across heterogeneous Modeling Languages
6 Comparing the Usability of Graphical Modeling Languages Using FUEML 89
6.4. Discussion
With the exception of H4, the results for H1-H3 are indeed surprising because it is not possible
to confirm or delete a hypothesis with 100 percent. As shown in Table 6.2 every language has
advantages in usability results on the attribute level. For example, EPCs are the most usable
language for model development scenarios whereas UML Class Diagrams have high values in
extracting information out of models in model interpretation scenarios. This leads to the first
deduction from the empirical results.
It becomes obvious that it is not possible to calculate a single usability measure for evaluating
modeling languages. The results show that different languages have different advantages and
disadvantages on the usability attribute level. This finding is contributing to conclusions of
Birkmeier et al. (2010). They conducted a survey including EPC and UML Activity Diagrams on
selected usability measures. Their general conclusion was that it is not possible to make general
recommendations. Rather, it comes clear that different modeling languages have advantages or
disadvantages regarding different usability attributes [Birkmeier et al., 2010].
The results confirm the strict separation between model development and model interpretation
scenarios proposed in the usability evaluation framework (FUEML) underlying this survey
6 Comparing the Usability of Graphical Modeling Languages Using FUEML 91
(Schalles et al. 2010b). For example, UML Class Diagrams, and EPCs have a comparable
learnability in model development scenarios. Contrariwise, the learnability of these languages is
significantly less in model interpretation scenarios.
For almost all usability attributes the results are headed by behavioral languages. The empirical
results confirm that behavioral modeling languages result in higher measuring values concerning
the different usability attributes than structural modeling languages. Obviously, human beings
show a neuro-processing advantage for concrete concepts over abstract concepts [Crutch and
Warrington, 2005].
Transferring this into the domain of graphical modeling languages leads to the conclusion that
users of behavioral modeling languages have greater availability of contextual information (i.e.
concrete imagination, similarities etc.) compared to structural modeling languages. This is
extended by the fact, that EPCs show high usability in many attributes. From this can be deduced,
that process modeling is more usable than system modeling due to different abstraction levels.
Additionally, the results show that languages offering low complexity result in high values
indicating memorability. This finding is contributing to Kintsch (1998), who shows that cognitive
processes underlie the comprehension of a specific domain [Kintsch, 1998].
Nembhard and Napassavong (2002) found out that the complexity of a specific domain influences
memorability negatively [Nembhard and Napassavong, 2002]. Since our results contribute to
the intrinsic Cognitive Load, this fact underlines the application of Cognitive Load Theory
in usability assessment of graphical modeling languages. The results indicate that languages
offering visual differentiation in the metamodel are generally more usable than languages offering
low visual differentiation. This contributes to the extraneous cognitive load, which is influenced
by the way the information is represented [Sweller, 2005]. These results underline findings
of Moody and Hillegersberg (2009). They found out, that increasing visual differentiation in
language specifications could optimize the visual effectiveness and consequently the visual
perceptibility of graphical modeling languages [Moody and Hillegersberg, 2009].
The search for information in diagrams developed by using specific languages is most efficient
in EPC diagrams. A probable reason for this may be due to the fact that the specification and
the metamodel of EPCs strictly set the use of different colors and different shapes for particular
elements. Accordingly, the information structuredness in EPC diagrams is higher compared to
UML Activity, UML Use Case and UML Class Diagrams leading to more efficient information
search procedures in model interpretation scenarios [Hall and Hanna, 2004].
92 6 Comparing the Usability of Graphical Modeling Languages Using FUEML
Contrariwise, the extraction of information out of class diagrams is easier compared to EPCs,
UML Activity and UML Use Case Diagrams. However, at this stage it is essential to consider
the quality of information extraction.
With comparing the values for effectiveness it is concluded that information extraction out of
class diagrams is comparable easy but the quality of information extraction is comparable low.
A possible reason for that lies in the fact that the size of UML Class Diagrams is comparable
low in relation to information density. This increases the efficiency of information extraction.
Indeed, the information density in UML Class Diagrams is comparable higher than the density of
information in EPCs, UML Activity Diagrams and UML Use Case Diagrams. This is based on
the fact that UML Class Diagrams consist of comparable less elements and relations. But various
properties such as multiplicities can be added to them resulting in a higher grade of variation.
This fact lets class diagrams appear more difficult to use than EPCs, UML Activity Diagrams
and UML Use Case Diagrams.
Furthermore, the results imply that users prefer process modeling languages for modeling and
interpreting scenarios. On the one hand we suppose that this behavior is based on the fact that
process modeling languages are comparable less complex and easy to understand due to greater
availability of contextual and concrete information [Schwanenflugel and Shoben, 1983].
Another interesting finding is that UML Activity Diagrams support most efficient task accom-
plishment in the survey in model development as well as model interpretation scenarios. A
possible reason might be that they are not as complex as EPCs. For example, UML Activity
Diagrams include the swim lane concept for adding information about specific task (=activity)
responsibilities whereas EPCs need the organizational unit for every task (=function). This
increases the visual spread of an EPC-diagram significantly.
6.5. Implications
The implications that can be deduced from our results give insights into
• Modeling and Education,
• Modeling Language Development and
• Business Process Modeling.
6 Comparing the Usability of Graphical Modeling Languages Using FUEML 93
6.6. Conclusion
The results imply deeper potential for analyzing the usability of modeling languages. In this
chapter a comparison of different selected modeling languages and their impact of usability
attributes was conducted. What is missing yet, are the impacting characteristics on usability in
the domain of graphical modeling languages specifically.
This could be very interesting for defining recommendations focusing on further development
of existing modeling languages. Thus, a further inductive, causal study is conducted in the
following chapters to bring out new and important findings of how language specific criteria
impact on usability attributes on different causal stages.
7. Using Structural Equation Modeling
in Usability Research
The following two chapters extend the empirical analysis of chapter six by analyzing
language’s metaproperties and their causal impact on different usability attributes.
The findings show how the metaproperties of modeling languages influence usability
and connected attributes. Chapter seven introduces a statistical methodology for
detecting and analyzing causal interactions [Schalles et al., 2011a].
In the area of usability research structural equation models have not been applied intensely.
A possible reason for that might be the fact that principles of usability theory are gradually
making their way to the mainstream software applications but the underlying research is less
known [Ilomäki, 2008].
With this chapter the statistic concept of Structural Equation Modeling (SEM) is introduced.
Furthermore, a suitable algorithm for the SEM for calculating SEMs is derived. Additionally, a
method for using SEM in usability research is proposed.
The findings of this chapter are essential for the empirical analysis conducted in chapter eight of
this thesis.
these structural relations can be modelled graphically to enable a clearer conceptualization of the
theory under study.
Subsequently, the hypothesized model can be tested statistically in a simultaneous analysis of
the entire system of variables to determine the extent to which it is consistent with the data. If
goodness of fit is adequate, the model argues for the plausibility of postulated relations among
variables; if it is inadequate, the tenability of such relations is rejected.
In general, a structural equation model consists of (1) structural relations between latent variables
and (2) indicating variables for measuring latent constructs. subsection 7.1.1 shows the general
structure of a structural equation model.
The measurement model formulation depends on the direction of the relationships between the
latent variables and the corresponding manifest variables
[Fornell and Bookstein, 1982]. Consequently, two different types of measurement models are
available: the reflective model and the formative model.
In a reflective model the block of manifest variables related to a latent variable is assumed to
measure a unique underlying concept. Each manifest variable reflects the corresponding latent
variable and plays a role of endogenous variable in the block specific measurement model.
In the formative model, each manifest variable or each sub block of manifest variables represents
a different dimension of the underlying concept. Therefore, unlike the reflective model, the
formative model does not assume homogeneity nor unidimensionality of the block. The latent
variable is defined as a linear combination of the corresponding manifest variables, thus each
7 Using Structural Equation Modeling in Usability Research 97
According to Stevens (2001) a good rule of thumb is 15 cases per predictor in a standard ordinary
least squares multiple regression analysis [Stevens, 2001]. Actually, the assumptions for the
sample size are influenced by the algorithm chosen for model calculation. An intensive discussion
on SEM calculation algorithms is offered in chapter 7.1.3 of this thesis.
One of the most important assumptions for SEM is that the variables follow a normal distribution.
The normality assumption in SEM applies to all observed variables [Blanthorne et al., ]. However,
this assumptions is strongly dependend on the specific algorithm chosen for the calculation of
the structural equation model. For a more detailed discussion on that topic see chapter 7.1.3.
In educational and social science research it is impossible to collect data that is complete. For
example, when administering a survey participants may answer some questions and not others.
This missing data causes a problem for researchers using SEM techniques for data analysis.
Because SEM and multivariate methods require complete data, several methods such as the
casewise deletion, pairwise deletion and the imputation have been proposed for dealing with
missing data [Lynn, 2006]. At last, appropriate handling methods are offered in most SEM
calculation tools and additionally in statistics software such as SPSS 19.
Generally, SEM needs a strong theoretical basis for the development of the structural and the
measurement model. Famous SEM calculation algorithms such as LISREL meet the assumption
for a strong theoretical basis in the developed structural equation model. However, algorithms
offering less assumptions on the theoretical basis do exist. The Partial Least Squares (PLS)
98 7 Using Structural Equation Modeling in Usability Research
method comes into the process of theory construction at an early stage when theories have not
been developed or adapted sufficiently to propose causal relationships [Wallenburg and Weber,
2005].
Prediction
Parameter oriented:
oriented:
Explanation of
Main objective Explanation of
empirical data
latent and manifest
structures
variables
Optimal for
Optimal for
Implication prediction
parameter accuracy
accuracy
Multivariate normal
distribution and Predictor
Assumptions
independent specification
observations
Reflective and/or
Measurement Model Reflective
formative
Also appropriate
Large sample size
for small sample
(minimum
sizes (minimum
Sample Size recommendations
recommendation
reach from 200 to
reach from 30 to
800 cases)
100 cases)
Table 7.1.: Comparison of LISREL and PLS [Chin and Newsted, 1999, Fornell and Bookstein,
1982]
LISREL of 200 cases in usability research and consequently for the survey conducted in this
thesis. Table 7.1 shows a detailed comparison of LISREL and PLS.
The usability concept in the domain of graphical modeling languages is specified in the FUEML
framework by
• Learnability,
• Memorability,
• Effectiveness,
• Efficiency and
• User Satisfaction [Schalles et al., 2010a].
In the special case of the model interpretation scenario the attribute of
• Visual Perceptibility
is introduced.
The proposition is that these attributes and especially their causal interaction influence the
usability of modeling languages.
The general development of metrics for measuring usability attributes in the modeling domain are
proposed in the developed FUEML framework (see chapter 4). For reasons of transparency and
readability the following section subsumes the most important findings for measuring different
usability attributes.
Evaluating effectiveness requires analysis of task output with measuring quantity and quality
of goal achievement [Rengger et al., 1993]. Quantity is defined as the proportion of task goals
represented in the output of a task. Quality is the degree to which the task goals represented in
the output have been achieved [Bevan, 1995]. Bevan (1995) defined effectiveness as a product of
quantity and quality. Transferring this, indicating manifest variables for measuring effectiveness
are the grade of completeness and the grade of correctness of a task conducted in usability
experiments.
7 Using Structural Equation Modeling in Usability Research 101
The efficiency is the amount of human, economical and temporal resources. Measures of
efficiency relate to the level of effectiveness achieved to the expenditure of resources [Bevan,
1995]. Measure values of efficiency include time taken to complete tasks, i.e. duration time for
performing an experimental task [Vuolle et al., 2008]. Learnability describes the ease of learning
the application of applications, devices or websites. For this characteristic, the standard measure
values are based on task completion rates and the task accuracy [Seffah et al., 2006]. In general,
learnability is a development and can be graphically described by learning curves [Tamir et al.,
2008].
Hence, learnability can be measured by the rate of difference when the user repeats evaluation
sessions [Bevan, 1995]. Nielsen (2006) insists that highly learnable systems could be categorized
as “allowing users to reach a reasonable level of usage proficiency (. . . )” [Nielsen, 2006b].
Furthermore, Nielsen (2006) proposes measuring proficiency by quantity and quality and of task
fulfilment. Thus, it is proposed choosing grade of completeness and grade of correctness as basic
variables for measuring learnability. With conducting two measuring points mp and mp+1, it is
possible to analyze the relative difference between mp and mp+1 for indicating Δ learnability,
i.e. individual learning progress in percent [Nembhard and Napassavong, 2002, Grossman et al.,
2009].
The visual perceptibility is measured by using the method of eye-tracking with analyzing the
user’s visual attention [Gordon, 2004]. In our research we aim to include eye-tracking for
measuring user’s cognitive processes i.e. information search and information extraction. The
pioneering work regarding the use of eye-tracking was first carried out by Fitts et al. (1950).
They proposed that fixation length is a measure of difficulty of information extraction and
interpretation [Fitts et al., 1950]. Fixations are eye movements that stabilize the gaze over an
object of interest. During this, the brain starts to process the visual information received from the
eyes [Duchowski, 2007]. The number of fixations overall is thought to be negatively correlated
with search efficiency [Goldberg and Kotval, 1999]. Consequently, a larger number of fixations
indicates less efficient information search in a website etc. Furthermore, we aim to analyze the
difficulty of information extraction out of devices, applications and websites. Byrne et al. (1999)
propose tracking fixation duration time as a measure for information extraction [Byrne et al.,
1999]. From this follows that longer fixations times indicate a participant’s difficulty extracting
information from a website etc.
Compared to the other attributes defining usability, the individual satisfaction of a user is a user
subjective criterion that can be measured best by using standardized questionnaires [Vuolle et al.,
2008]. Currently no unified standardized method for measuring user satisfaction does exist.
Therefore, questionnaires focusing on system and website usability were adapted [Kirakowski
and Corbett, 1993, Armstrong et al., 2005]. For evaluating user satisfaction a questionnaire,
which consists of thirty items structured in 1) General impression, 2) Recommendation rate and
3) Language application was developed. The development of this questionnaire is generally
contributing to the Questionnaire for User Interaction Satisfaction (QUIS) and additionally the
Software Usability Measurement Inventory (SUMI) [Chin et al., 1988, Kirakowski and Corbett,
102 7 Using Structural Equation Modeling in Usability Research
1993].
Memorability is best measured as proficiency after a period of non-use provided a user has
already learned a language [Olle et al., 1986, Seffah et al., 2006]. Accordingly, the measure
values for memorability are neglect curves and time-delayed knowledge tests [Nembhard and
Napassavong, 2002]. Concerning usability, the user must remember the different elements and
its intended meaning (semantics), the syntax and the application. In due consideration of Nielsen
(2006), the measuring points interval should be several weeks regarding memorability [Nielsen,
2006b].
Besides, the consideration of variables influencing causal relationships between latent variables
is very important when developing the structural model. These effects are called moderating
variables and they are evoked by variables whose variation influences the strength or the direction
of a relationship between manifest variables.
1 see chapter 8 of this thesis for a detailed description of the structural model development
7 Using Structural Equation Modeling in Usability Research 103
Moderator variables can either be metric (e.g., user psychological constructs like experience
or intelligence) or categorical (e.g., gender or education level) in nature. One example of the
examination of moderating effects is a paper by [Homburg and Giering, 2001].
They find that age and income have significant effects on the strength of the relationship between
customer satisfaction and customer loyalty. In that context, age and income serve as moderator
variables. In usability context we found out that users of applications, websites etc. differ
regarding their experience. This fact influences the task accomplishment and consequently the
usability and has to be considered in a research model [Nielsen, 2006b].
Furthermore, when conducting a survey on usability evaluation, the complexity of particular
tasks must be controlled and treated as moderator variables for minimizing its influence on the
outcome.
Table 7.2 and Table 7.3 show further quality metrics and their description in the context of
PLS-calculation of structural equation models.
7.3. Conclusion
In this chapter, a general methodology for using SEM in usability research was developed.
Furthermore, a short discussion on SEM and connecting assumptions and calculation algorithms
was presented. Additionally, one possibility of developing manifest variables for latent usability
attributes was proposed.
Finally, extracted metrics for ensuring statistical quality of measurement and structural model
were deduced.
A short discussion on SEM calculation algorithms is presented. It is confirmed that PLS gives
reliable results if the following requirements are fulfilled:
• Phenomena explored are new without existing construct and measuring theories
• Structural model includes a large number of indicating variables
• Research design focuses on relative small sample size - Detection of causal paths and
predictions is focused on
7 Using Structural Equation Modeling in Usability Research 105
PLS is a powerful method of analysis because of the minimal demands on measurement scales,
sample size, and residual distributions. Although PLS can be used for theory confirmation, it can
also be used to suggest where relationships might or might not exist and to suggest propositions
for later testing.
As a result of this chapter the following method for conducting a PLS-analysis in usability
research is proposed:
1. Theoretical deduction of Hypotheses based on causal relations
2. Definition of manifest variables for measuring latent constructs
3. Calculation of the PLS-Algorithm
4. Quality assessment of structural and measurement model results
5. Result interpretation
PLS is based on least squares estimation with the primary objective of maximizing the explanation
of variance in a structural equation model’s dependent constructs. Literature suggests that PLS
is primarily intended for causal-predictive analysis in situations of high complexity but low
theoretical information [Chin, 1998].
The PLS method does not require strong theory and can be used as a theory-building method
[Gefen et al., 2000]. Considering this, it is concluded that this method is appropriate to calculating
structural models in usability research due to incomplete theoretic background findings and
missing unified empirical proved measurement scales by now. As a consequence, this chapter
builds the basis for the following empirical analysis conducted in chapter 8.
8. Impact of Metaproperties on the
Usability of Graphical Modeling
Languages
In this chapter the impact of metamodel properties on usability attributes in the
domain of graphical modeling languages is analyzed. The study is based on a model
of hypotheses including two structural models depending on particular user scenario.
The models were developed under consideration of psychological cognitive theories
and usability theory. Survey data is collected and the causal relations hypotheses
are assessed using a structure equation modeling approach. The outcome of this
study shows important findings for practical and theoretical issues of how differing
modeling languages are influencing usability attributes on causal stages in the
modeling domain. The main focus of this chapter is to define differences and
similarities of how language metaproperties impact on usability attributes in model
development and model interpretation scenarios. This chapter aims to answer the
research question Q2 “How do different modeling languages impact on different
usability attributes” [Schalles et al., 2011b].
Considering the FUEML Framework, the usability concept in the domain of graphical modeling
languages is specified by learnability, memorability, effectiveness, efficiency, and user satisfac-
tion [Schalles et al., 2010a]. In case of model interpretation scenarios the attribute of visual
perceptibility is introduced. The proposition is that these attributes and especially their causal
interaction influence the usability of modeling languages.
Usability literature and transferred theories only set the different attributes on one causal level.
For example, Nielsen (2006) and Abran et al. (2003) state that usability is affected by attributes
with same weightings [Nielsen, 2006a, Abran et al., 2003]. The general idea behind this analysis
is that the usability of modeling languages is defined by chosen attributes on different stages.
Furthermore, a causal interaction between usability attributes, which is examined in this empirical
research, is proposed.
Control Variables
semantics, syntax, element colors and element shapes of specific modeling language [Burton et al.,
2009]. Consequently, these variables are subsumed as exogenous variables, visual properties
(VP) and language complexity (LC), in the structural model.
modeling languages, which are difficult to learn, offer a limited user individual application. This
influences task completion rates and task error rates, which are manifest variables for measuring
the latent construct effectiveness.
HYPOTHESIS 6a. The user’s ability to remember the range of elements, relations
and syntactic regulations (MA) is positively related to the user’s ability of performing
tasks with minimal errors and maximal completeness (ES)
Memorability describes the “remembering rate” of a modeling language. Overall it describes the
fact that a modeling language should be easy to remember regarding its elements, syntax and
semantics [Mayer, 1989]. Memorability is a very important attribute for measuring the usability
of modeling languages considering that users may not be using a modeling language all the
time [Nielsen, 2006a]. Hence, it is hypothesized that a modeling language, which is easy to
remember results in less errors and higher completion rates in model development tasks.
HYPOTHESIS 7a. The gradient of a language‘s learning curve (LA) is positively
related to the efficiency (EY) that is offered by modeling languages during applying
them
Learnability is probably the most important attribute of usability, since a modeling language
needs to be easy to learn. Learning to use a modeling language in development scenarios seems
to be the first experience most users are confronted with a new modeling language [Siau and
Rossi, 2008,Mayer, 1989]. Easy to learn languages offer a higher user-individual learning growth
and consequently higher curve gradients based on task completion time values than difficult to
learn modeling languages [Tamir et al., 2008]. It is stated that this effect supports efficiency in
the modeling domain.
HYPOTHESIS 8a. The user’s ability to remember the range of elements, relations
and syntactic regulations (MA) is positively related to efficient task accomplishment
(EY) offered by the modeling language
Usability research shows that memorability is an initial basis for applying a system or a website
[Nielsen, 2006a]. Transferring this it is stated that some modeling languages are easier to
remember than other. For example, it seems that BPMN elements are not easy to remember
because of its high range of different element types. From this fact can be deduced that an
efficient use and consequently a fast task completion is influenced by the memorability of the
different metamodel properties a language consists of.
HYPOTHESIS 9a. The ability to perform a task with minimal errors and maximal
completeness (ES) is positively related to user‘s individual satisfaction (US) with a
modeling language
Effectiveness characterizes the fact, that it should be possible to reach a successful task accom-
plishment. In this regard, a user should be able to develop models with low error rates and high
task completion rates [Bobkowska, 2005a, Wand and Weber, 1993]. Regarding the usability
of modeling languages in the development scenario it can be deduced that languages offering
112 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
Session 2
high effectiveness result in higher user satisfaction values. In contrast, it is stated that languages
offering low effectiveness values affect user’s individual satisfaction negatively.
HYPOTHESIS 10a. The Efficiency of task completion (EY) is positively related to
user’s individual satisfaction (US) of modeling languages
A modeling language is efficient to use when users are able to develop or comprehend a model
relatively quickly and correctly regarding the regulations of the modeling language. Once a user
has learned a modeling language it should be possible to reach a high level productivity regarding
task completion time [Bobkowska, 2005b, Wand and Weber, 1993]. Hence, it is hypothesized
that languages, which afford an efficient task completion in model development scenarios (H10a),
result in higher values concerning user satisfaction.
The data collection was based on two different modeling concepts and connected languages.
On the one hand process based languages, Event driven Process Chains (EPC), UML Activity
Diagrams and on the other hand structure based modeling languages, UML Use Case and UML
Class Diagrams. For measuring learnability a second measuring point was introduced.
For model development the Bflow*-toolbox1 and ArgoUML2 modeling tools were applied. At
the beginning of second data collection phase the knowledge tests for measuring the ability of
remembering specific metalevel properties were distributed. Subsequently, the User Satisfaction
Questionnaire was administered to the participants.
1 http://www.bflow.org
2 http://www.argouml.tigris.org
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 113
In this section chosen manifest variables working as indicators for latent constructs in the research
model are theoretically underlain. The general idea is transferred from the FUEML evaluation
framework proposed in chapter 4. However, various adaptions to the structural models has been
made and are introduced in the following paragraphs.
Effectiveness Evaluating effectiveness requires analysis of task output with measuring quan-
tity and quality of goal achievement [Rengger et al., 1993]. Quantity is defined as the proportion
of task goals represented in the output of a task. Quality is the degree to which the task goals
represented in the output have been achieved [Bevan and Macleod, 1994]. Bevan (1995) defined
effectiveness as a product of quantity and quality [Bevan, 1995]. Transferring this to the pro-
posed research model, indicating manifest variables for measuring effectiveness are the grade of
completeness and the grade of correctness of model development tasks [Schalles et al., 2010a].
Efficiency The efficiency is the amount of human, economical and temporal resources. Mea-
sures of efficiency relate to the level of effectiveness achieved to the expenditure of resources [Be-
van and Macleod, 1994]. Measure values of efficiency include time taken to complete tasks, i.e.
duration time for performing model development tasks [Vuolle et al., 2008].
1. Model Development Time in seconds
Learnability Learnability describes the ease of learning the application of modeling languages
in model development scenarios. For this characteristic, the standard measure values are based
on task completion rates and the task accuracy [Seffah et al., 2006]. In general, learnability
is a development and can be graphically described by learning curves [Tamir et al., 2008].
Hence, learnability can be measured by the rate of difference when the user repeats evaluation
sessions [Bevan, 1995]. Nielsen (2006) insists that highly learnable systems could be categorized
as “allowing users to reach a reasonable level of usage proficiency (. . . )” [Nielsen, 2006a].
Furthermore, Nielsen (2006) proposes measuring proficiency by quantity and quality and of task
fulfillment. Thus, grade of completeness and grade of correctness were chosen as basic variables
for measuring learnability. With conducting two measuring points mp and mp + 1, it is possible
to analyze the relative difference for indicating Δ learnability, i.e. individual learning progress in
percent [Nembhard and Napassavong, 2002, Grossman et al., 2009].
The applied variables for measuring learnability in this survey are calculated as follows:
1. Grade o f completenessmp+1 − Grade o f completenessmp
2. Grade o f correctnessmp+1 − Grade o f correctnessmp
114 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
User Satisfaction Compared to the other latent variables in our research model, the individual
satisfaction of a user while developing or interpreting a model is a user subjective criterion that
can be measured best by using standardized questionnaires [Vuolle et al., 2008]. Currently no
standardized method for measuring user satisfaction in the modeling domain exists. Therefore,
questionnaires focusing on system and website usability were mapped [Kirakowski and Corbett,
1993, Armstrong et al., 2005]. For evaluating user satisfaction a questionnaire was developed,
which consists of thirty items structured in
1. general impression,
2. recommendation rate and
3. language application.
The constructs were measured with 5-point Likert-scales. The development of this questionnaire
is generally contributing to the Questionnaire for User Interaction Satisfaction (QUIS) and
additionally the Software Usability Measurement Inventory (SUMI) [Chin and Lee, 2000,
Kirakowski and Corbett, 1993]. In addition to that, variables focusing on user perception for
measuring user satisfaction in the research model were specified [Maes and Poels, 2007].
Language Complexity The exogenous variable language complexity was tracked track by
number of different elements, number of different relations and number of different properties
(LC) under consideration of Rossi and Brinkkemper’s (1996) OPRR-model and particular
expansions by Recker et al. (2009) and Indulska et al. (2009) [Rossi and Brinkkemper, 1996,
Recker and Dreiling, 2007, Indulska et al., 2009]. A detailed deduction of those properties is
given in the FUEML framework in chapter 2.
1. number of elements
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 115
2. number of relations
3. number of properties
Visual Properties Furthermore, for indicating visual properties findings of Bertin (1983) and
Moody and Heymans (2010)were adopted. Hence, there are six retinal visual variables, which
can be used to graphically encode information: Shape, Size, Color, Brightness, Orientation and
Texture. Those variables are structured in color (color, brightness, texture) and shapes (shape,
size, orientation), which are defined in a language’s metamodel [Bertin, 1983] [Moody and
Heymans, 2010].
Consequently, the number of different colors and the number of different geometric shapes act
as variables for measuring visual properties (VP).
1. number of different shapes
2. number of different colors
Moderators Additionally, two moderating variables, which affect causal relations in our
model were analyzed. First, the participant’s experience of developing or interpreting models
and secondly the particular complexities of development or interpretation tasks influence the
causal relations in our study. For measuring modeling experience, participant’s individual
experiences in
1. general modeling experience and
2. language experience on a 5-point Likert-scale are tracked.
Finally, model complexity is operationalized by three indicator variables:
1. number of elements and relations (size),
2. connectivity degree
3. semantic spread.
Table 8.1 provides an overview of applied items for measuring the latent constructs:
8.1.2.3. Pretest
A pretest was conducted prior collecting data for the field test. The research instruments were
tested for reliability, content validity and construct validity. Necessary changes were made to
improve measuring instruments. All pilot test participants were excluded from the analysis
sample.
116 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
Number of relations
Number of properties
Δ Grade of Completeness
Syntax
Application
User Satisfaction (US) I am likely to choose this language for modeling my business cases
The number of different elements and relations in language’s metamodel are confusing
Connectivity Degree
Semantic Spread
Modeling Experience
An exploratory factor analysis in SPSS 19 for each construct of the model was conducted
including all defined items using an oblique rotation (Promax). Compared to orthogonal rotation
methods, Promax assumes that the factors are correlated. Fabrigar et al. (1999) concluded that
human behavior is rarely partitioned into neatly packaged units that function independently of
one another [Fabrigar et al., 1999]. Hence, some correlation among factors is expected.
In all cases the Bartlett-test of sphericity indicating independency of construct items among was
accepted. Consequently, different factors were analyzed and variables were assigned to specific
factors considering Kaiser’s criterion [Kaiser, 1974]. Indicating acceptable validity, items with
loadings smaller than 0.5 should be excluded from the research model. However, in our case, all
item loadings exceed the threshold of 0.5. Consequently, no items were excluded. A detailed
presentation of these values is given in the appendix A.3. (pnline supplement)
Multiple indicators measured all but one construct. The exception was EY, which represents
a discrete value and therefore can be appropriately measured with a single item focusing on
task completion time. Language Complexity (LC), Memorability (MA), Learnability (LA),
Effectiveness (ES) and User Experience (EX) were conceptualized and measured as aggregations
of different manifestations; thus the direction of causality is from indicator to construct (i.e.
formative). The other constructs were operationalized as reflective indicators.
As the research instrument of this study contains several collection methods such as questionnaire,
knowloedge test and empirical tasks the possibility of missing data was deemed reasonable. The
rate of missing values in the dataset is less than two percent. For dealing with missing values the
multiple imputation method offered by SPSS 19 was chosen.
Multiple imputation accounts for missing data by restoring not only the natural variability in
the data, but also by incorporating the incertainty caused by estimating missing data. The
3 SmartPLS is a software application for the modeling of Structural Equation Models (SEM) and their calculation
with the methodology of the Partial Least Squares (PLS) approach [Ringle et al., 2011].
118 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
performance of multiple imputation in a variety of missing data situations has been well-studied
and it has been shown to perform favorably in connection with PLS [Lynn, 2006].
Internal consistency reliability for our model was evaluated using Cronbach’s Alpha, corrected
item total correlation and average variance extracted (AVE)
[Fornell and Larcker, 1981]. Cronbach’s Alpha coefficients were all higher than the proposed
minimum cutoff score of 0.70 [Nunnally and Bernstein, 1994]. Alpha values for Experience
(EX) are marginally less (i.e.0.68) than minimum threshold. However, [Barker et al., 1994]
conclude that values between 0.60 and 0.70 are marginal. Furthermore, all reflective constructs
had an minimum AVE of 0.5, indicating adequate internal consistency of our model [74].
For testing reliability of formative constructs, R2-value proposed by Chin (1998) with a minimum
cutoff of 0.3 was analyzed [Chin, 1998]. Furthermore, Diamantopoulos and Winklhofer (2001)
concluded that sufficient significant regression weights between formative constructs and other
constructs in the path model are indicating formal construct validity [Diamantopoulos and
Winklhofer, 2001].
As shown in the following section all relevant path regression weights are at least significant
at 0.05-level. According to Fornell and Larcker (1981), constructs have adequate discriminant
validity if the square root of AVE is higher than variance shared between construct and other
constructs in the model [Fornell and Larcker, 1981]. In all cases the correlations between each
pair of constructs were lower than the square root of the AVE for specific construct. In conclusion,
these results confirm that all constructs in our model are empirically distinct.
Table 8.2 shows the applied quality metrics for ensuring reliability of the SEM.
Figure 8.1.3 and Figure 8.2.3 present the results of structural model testing including regression
weights and significance of the paths. According to Lohmöller (1989), path regression weights
should be at least 0.10 in order to be considered meaningful for discussion [Lohmöller, 1989].
Our results confirmed the general assumption that language’s metamodel properties are influenc-
ing usability attributes on different stages. According to Chin (1998) and for ensuring complete
model assessment, effect size f2, which is indicating whether a path’s latent exogenous variable
has a significant influence (effect) on latent endogenous variable or not is shown additionally.
Thresholds for f2 are 0.02 (weak), 0.15 (medium) and 0.35 (strong) [Chin, 1998].
LC has a strong negative and highly significant impact on MA (beta=-0.934, f2=0.59, p<0.001).
This result supports hypothesis H4a. Against the theoretic based expectation the influence of
LC on LA is neither strong nor significant (beta=-0.038, f2=0.01). Furthermore, VP is strongly
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 119
Constructs ES EX EY LA LC MA TC US VP
ES NA
EX 0.21 0.85
EY 0.21 -0.76 1.00
LA 0.02 0.31 0.42 NA
LC 0.14 0.17 0.06 -0.04 NA
MA 0.29 0.23 0.05 0.05 -0.93 NA
TC -0.52 0.30 -0.34 0.27 0.23 0.06 0.96
US 0.46 0.13 -0.12 0.24 0.27 0.34 -0.42 0.59
VP 0.17 0.59 0.25 0.82 0.60 -0.05 0.53 0.40 0.99
Notes. (1) Diagonal elements are the square root of average variance extracted (AVE). These values should exceed
the interconstruct correlations for adequate discriminant validity. (2) The italic values show correlations included
in our structural models. (3) NA - Does not apply for formative measures.
Control Variables
0.756** -0.339*
0.228*
-0.516**
-0.419***
0.091
***
(R2=0.335)
16 (R2=0.436)
0.8 0.4
59
***
8
.03
-0
(R2=0.203)
0*
0.
23
42
0.
4*
2*
2
-0.93 .1
4*** -0
0.048
-0.047 (R2=0.263)
(R2=0.486)
connected to LA, which is contributing to H1a (beta=0.816, f2=0.23, p<0.001). Contrariwise, the
impact of VP on MA is not significant (beta=-0.047, f2=0.029). Consequently, H2a is rejected.
MA is positively influencing ES, which is contributing to H6a (beta=0.230, f2=0.21, p<0.01).
However, MA is not significantly influencing EY and consequently H8a is not supported by the
results (beta=0.048, f2=0.01). LA is positively impacting on EY (beta=0.422, f2=0.14, p<0.05).
This result is supporting H7a. Furthermore, the results show that H5a, which is defining causal
path between LA and ES, is not supported (beta=0.091, f2=0.16). Additionally, a strong positive
correlation between ES and US (beta=0.459, f2=0.28, p<0.001) exists. This result is supporting
H9a. Finally, the results explain a negative correlation between EY and US (beta=-0.124, f2=0.17,
p<0.05). This fact is not contributing to H10a.
Turning to model fit, the R-square values for MA, LA, EY, ES and US were 0.486, 0.335, 0.263,
0.436, and 0.203, indicating that the model explains substantial variation in these variables. For
example, the R-square value for MA implies that the causes specified in this model, VP and
LC, jointly explain 49% of the total variance in MA. In summary, the results show that most
hypotheses in the research model for model development scenario are fully supported.
However, H10a is not supported by the results. Furthermore, H2a, H3a, H5a and H8a could not
be confirmed by significant results. As a consequence, particular hypotheses are not confirmed
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 121
No. Path
1 V P → LA → EY → US
2 V P → MA → ES → US
for further comparable samples. The resulting regression weights are valid for the specific sample
and should be proved in further surveys based on this research model.
Causal Path 1 The use of multiple colors and geometric shapes (VP) in language’s metamodel
supports LA and EY positively in model development scenarios. The theoretical reason is that
various colors and shapes lead to more information structuredness [Hall and Hanna, 2004]. We
deduce that the grade of structuredness of information has positive effects on learnability and
consequently the efficiency [Sweller, 2005]. From this result we deduce, that languages including
various element shapes and colors in the metamodel are easier to learn regarding developing
models than languages offering less shape and color variation. Consequently, we conclude that
for example learning developing models with EPC’s is easier than with UML-Activity-Diagrams
122 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
or BPMN. Furthermore we reason that VP impacts on EY indirectly. From this follows, that
languages including various element shapes and colors in the metamodel support a fast task
accomplishment in development scenarios. We found out that time used for task completion in
model development scenarios is not positively impacting on User Satisfaction. This might be
underlining former findings of Walker (1998). In their studies they found out that users have
demonstrated preferences for systems with which they performed less efficiently [Walker et al.,
1998].
Causal Path 2 Form the results is derived that the number of elements, relations and properties
(LC) is negatively influencing memorability in development tasks. For finding a theoretical
background for this result we start with applying Sweller’s Cognitive Load Theory [Sweller,
2005]. They analyzed three loads, which are defining human working memory during instruction.
Human’s working memory is a basis for extracting, processing and storing information. In our
case we focus on intrinsic cognitive load. Intrinsic cognitive load is influenced by information
complexity. Transferring this to our research we conclude that LC impacts user’s intrinsic
cognitive load. Sweller (2005) proposes reducing information complexity to keep intrinsic
cognitive load low. With considering this we come to the conclusion that complex modeling
languages such as BPMN are offering worse memorability than less complex languages such as
UML-activity-diagrams. Additionally, MA is positively supporting ES and consequently US.
From this we conclude that ability to complete model development tasks completely and correctly
is highly correlating to User Satisfaction values. In conclusion, modeling languages offering
low metamodel complexity i.e. EPC’s, UML-Use-Case-diagrams, support user’s individual
satisfaction with developing a model based on particular language.
influences all defined usability attributes in development scenarios negatively. Concerning this,
memorability is influenced most by complexity rise (-0.93).
Another interesting finding is that an effective task accomplishment (ES) is not caused by
language complexity (-0.00) in development scenarios. Furthermore, the total effect of language
complexity on learnability is comparable low (-0.04).
Second, for visual properties (VP), the total effect on the defined usability attributes is in all cases
positive. It becomes obvious that the total effect variance of VP between usability attributes is
not as high as LC.
In contrast, visual language properties seem not to be influencing the memorability of elements
and relations in model development scenarios. A possible reason can be found in cognitive
information research and especially in research focusing on human screen and icon recognition.
Lansdale (1988) found out that task performance is essentially easier when the participant
has to select between given items, rather than having to generate the missing information
from memory [Lansdale, 1988]. Transferring this to the presented research model leads to the
conclusion that in model development scenarios the combination of different given elements
(e.g. in the modeling tool side-bar) is the basis for developing models. Consequently, by using
modeling tools the model developer doesn’t have to generate the missing elements from memory
to full extent. Model developers have to remember elements and relations abstract semantics and
as a consequence their intended meaning.
Additionally, our results show a weak regression weight between complexity of modeling lan-
guages and learnability of language application. This result supports the thesis that in information
science learnability is influenced by graphical information representation. Concerning learnabil-
ity, information complexity plays a secondary role in model development scenarios. Furthermore,
the learnability of language application is strongly influencing efficiency in task performance. In
contrast to that, learnability is not influencing effectiveness significantly. Finally, effectiveness,
i.e. accurate and complete task performance is highly impacting on user satisfaction. On the
other hand, efficiency is influencing user satisfaction negatively. This might be underlining
former findings of Walker (1998). In their studies they found out that users have demonstrated
preferences for systems with which they performed less efficiently [Walker et al., 1998].
Control Variables
the elements and syntax of the modeling language in model interpretation scenarios
(MA)
HYPOTHESIS 3b. The complexity of a modeling language (LC) affects negatively
the probands‘ ability to learn this language in model interpretation scenarios (LA)
HYPOTHESIS 4b. Language complexity (LC) affects negatively the user‘s ability to
remember elements, relations and syntax within a period of non use/training (MA)
HYPOTHESIS 5b. The gradient of a language‘s learning curve (LA) is positively
related to the ability of completing a task with minimal errors and maximal com-
pleteness (ES)
HYPOTHESIS 6b. The user’s ability to remember the range of elements, relations
and syntactic regulations (MA) is positively related to the user’s ability of performing
tasks with minimal errors and maximal completeness (ES)
HYPOTHESIS 7b. The gradient of a language‘s learning curve (LA) is positively
126 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
related to the efficiency (EY) that is offered by modeling languages during applying
them
HYPOTHESIS 8b. The user’s ability to remember the range of elements, relations
and syntactic regulations (MA) is positively related to efficient model interpretation
(EY)
HYPOTHESIS 9b. The ability to perform a task with minimal errors and maximal
completeness (ES) is positively related to user‘s individual satisfaction (US) with a
modeling language
HYPOTHESIS 10b. The Efficiency of task completion (EY) is positively related to
user’s individual satisfaction (US) of modeling languages
In the following section specific hypotheses H11-H15 for model interpretation scenarios including
the method of eye-tracking are proposed:
HYPOTHESIS 11. The variance of visual language properties (VP) set in the meta-
model of the modeling language is positively influencing language’s perceptibility
(PA)
Many researchers analyzed the influence of visual differentiation caused by varying geometric
shapes and colors in usability and primarily neurophysical research. For example, Westphal
and Würtz (2009) investigated that visual differentiation is supporting object recognition and
consequently information search and information extraction [Westphal and Würtz, 2009]. How-
ever, in the structural model language’s perceptibility is measured by values indicating cognitive
processes e.g. information search and information extraction [Underwood, 2005]. Furthermore,
Underwood (2009) corroborates the hypothesis that visual characteristics of an image are in-
fluencing eye movements [Underwood, 2009]. From this it is deduced, that visual language
properties, i.e. colors, geometric shapes, are positively influencing language’s perceptibility due
to stronger visual differentiation in model diagrams.
HYPOTHESIS 12. The complexity of modeling languages (LC) is negatively influ-
encing visual perceptibility (PA)
The complexity of modeling languages, which is set in the language’s metamodel, is strongly
connected with syntactical and semantical complexity. For example, UML Class Diagrams
contain a high range of syntactically different relations (e.g. association, aggregation etc.), which
can be expanded by cardinalities. Furthermore, a UML Class Diagram generally includes two
different class types: standard and abstract classes. Pan et al. (2004) analyzed the viewing
behavior of web pages by using an eye-tracker [Pan et al., 2004]. They come to the conclusion
that visual complexity negatively contributes to eye-movement behavior due to difficulty of
information search and information extraction. The resulting hypothesis is that syntactic and
semantic language properties are negatively influencing the perceptibility of a diagram developed
by the application of specific modeling languages.
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 127
The data collection was based on two different modeling concepts and connected languages.
On the one hand process based languages, Event driven Process Chains (EPC), UML Activity
Diagrams and on the other hand structure based modeling languages, UML Use Case and UML
Class Diagrams. In the model interpretation scenario the participants were confronted with
several models developed by using the defined languages. Hereby, each interpretation process
per session consists of two parts. In the first part participants were asked to gaze the presented
model generally. During this period the basic values for eye tracking, fixation duration and the
number of fixations, were collected. After that, participants were asked to analyze the entire
128 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
Session 2
model verbally. During the whole data collection phase the specific models were presented to
the participants on screen.
The following table shows the items used for measuring the variables in model interpretation
scenarios:
Similar to the development scenario, an exploratory factor analysis in SPSS for each construct
of our models including all defined items using an oblique rotation (Promax) was conducted.
In all cases the Bartlett-test of sphericity indicating independency of construct items among
was accepted. Consequently, different factors were analyzed and variables were assigned to
specific factors considering Kaiser’s criterion [Kaiser, 1974]. Indicating acceptable validity,
items with loadings smaller than 0.5 should be excluded from the research model. However, in
the interpretation scenario, all item loadings exceed the threshold of 0.5. Consequently, no items
were excluded. A detailed presentation of these values is given in the appendix A.3. (online
supplement).
4 SmartPLS is a software application for the modeling of Structural Equation Models (SEM) and their calculation
with the methodology of the Partial Least Squares (PLS) approach [Ringle et al., 2011].
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 129
Number of relations
Number of properties
Syntax
Application
Fixation length
User Satisfaction (US) I am likely to choose this language for interpreting my business cases
The number of different elements and relations in language’s metamodel are confusing
Modeling Experience
As the research instrument of this study contains several collection methods such as questionnaire,
knowloedge test and empirical tasks the possibility of missing data was deemed reasonable. The
rate of missing values in the dataset is less than two percent. For dealing with missing values the
multiple imputation method offered by SPSS 19 was chosen.
Multiple imputation accounts for missing data by restoring not only the natural variability in
the data, but also by incorporating the incertainty caused by estimating missing data. The
performance of multiple imputation in a variety of missing data situations has been well-studied
and it has been shown to perform favorably in connection with PLS [Lynn, 2006].
Referring to the quality metrics deduced in subsubsection 8.1.3.3 it becomes obvious that the
measurement model as well as the structural model are empirically distinct. Consequently,
Cronbach’s Alpha, Composite Reliability, Average Variance Extracted, Stone-Geisser-criterion
Q2 and R2 fulfil the minimum cutoff proposed in literature.
8 Impact of Metaproperties on the Usability of Graphical Modeling Languages 131
Control Variables
"
"
!
#
!
#
!
!
#" #"
!
"
!
#"
#
LC has a strong negative and highly significant influence on MA (beta=-0.715, f2=0.80, p<0.001).
This empirical result supports the hypothesis H4b. LC has also a strong significant negative
impact on PA underlining H12b (beta=-0.418, f2=0.16, p<0.001). Furthermore, LC has a negative
significant relation to LA contributing to H3b (beta=-0.068, f2=0.02, p<0.05). However, this
path disposes not to Lohmöller’s (1989) proposed threshold for path weighting of 0.1.
VP is positively influencing LA of applied modeling languages (beta=0.208, f2=0.02, p<0.01).
In addition to that, VP is positively influencing PA (beta=0.303, f2=0.03, p<0.05). Considering
this, all hypotheses in the research model connected with VP are accepted.
Additionally, LA is strongly positively related to ES on a high significance level (beta=0.648,
f2=0.72, p<0.001), which is contributing to H5b. Furthermore, LA is positively affecting time
based latent construct EY and MA is positively correlating with ES. These path regression
weights are not significant (p>0.05). Deducing from that, the null hypothesis with probability
132 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
Constructs ES EX EY LA LC MA TC PA US VP
ES NA
EX -0.11 NA
EY 0.05 0.02 0.77
LA 0.65 0.14 0.10 NA
LC -0.01 -0.45 0.00 -0.07 NA
MA 0.06 0.24 -0.12 0.03 -0.72 NA
TC -0.44 0.17 -0.18 -0.21 -0.35 0.17 0.79
PA -0.01 0.30 0.44 0.03 -0.42 0.26 -0.30 0.94
US 0.36 -0.03 0.04 -0.24 -0.16 0.30 0.50 0.28 0.82
VP 0.22 0.16 -0.13 0.21 -0.41 0.22 0.64 0.30 0.35 0.98
Notes. (1) Diagonal elements are the square root of average variance extracted (AVE). These values should exceed the
interconstruct correlations for adequate discriminant validity. (2) The italic values show correlations included in our
structural models. (3) NA - Does not apply for formative measures.
No. Path
1 V P → LA → ES → US
2 V P → PA → US
3 LC → PA → US
hypothesis that Visual Properties and Language Complexity both set in language’s metamodel
act as exogenous latent variables in the research model.
Causal Path 1 This path shows, that multiple colors and geometric shapes (VP) in language’s
metamodel support LA of interpreting models developed with particular language. Consequently,
a possible theoretical reason is that various colors and shapes lead to more information struc-
turedness supporting learnability of interpreting models [Hall and Hanna, 2004]. Compared
to model development tasks LA is supporting user’s ability accomplishing tasks completely
and accurately. In former findings of cognitive psychology low gradients of learning curves
causes ineffective application of a construct in a specific domain [Anderson, 1985]. Finally,
high ES-values result in high US-values. From this path can be deduced that languages offering
multiple colors and geometric shapes support LA, ES consequently US.
Causal Path 3 This path is characterized by the positive effect of low level of language
complexity (LC) on PA. Consequently it is deduced that PA is not only impacted by VP but
also by LC. This result contributes to Pan et al. (2004). They analyze the viewing behavior of
web pages by using an eye-tracker [Pan et al., 2004]. They concluded that visual complexity
negatively contributes to eye-movement behavior due to difficulty of information search and
information extraction. It is concluded, that languages offering high metamodel complexity (i.e.
134 8 Impact of Metaproperties on the Usability of Graphical Modeling Languages
UML-Class Diagrams etc.) complicate user’s ability of searching, extracting and processing
information in model interpretation tasks.
An important result of our survey is the causal impact of visual language properties, i.e. variability
in shape geometrics and shape colors, in the field of model interpretation. The output of our
study shows that visual language properties are positively influencing the visual perceptibility of
modeling languages. This result underlines the finding that visual differentiation supports object
information search and information extraction [Westphal and Würtz, 2009].
As a consequence, the application of different colors and geometrics in a model supports inter-
preting users in searching and extracting information. Furthermore the variability in shape color
and geometrics is positively influencing learnability of model interpretation and memorability
of language’s elements and relations. Consequently, languages offering higher variability in
geometrics and colors are easier to learn concerning model interpretation. The learnability of
interpreting a model based on a certain language is strongly impacting the ability of performing
an interpretation task completely and correctly. For example, in industry and education it is
important that users can interpret developed models with a high level of completeness and
correctness [Mendling and Strembeck, 2008].
This study shows that learnability, which is positively influenced by visual language properties
acts as a basic independent variable strongly impacting on user’s ability of complete and
correct model interpretation. Furthermore, learnability is positively influencing efficiency of
model interpretation. In conclusion, learnability is a basic construct in model interpretation
scenarios. A theoretical basis might be cognitive load theory and especially intrinsic cognitive
load [Sweller, 2005]. The intrinsic cognitive load is determined by information complexity.
The interdependency of information to be learned is positively impacting cognitive load and
consequently the more important learnability appears in a causal system. Concerning modeling
languages and model interpretation, the cognitive load is high because of strong information
interdependency occurring in models. Considering our results and cognitive load theory the
importance of learnability in model interpretation is emphasized.
In due consideration of our results it consequently becomes clear that learnability is positively
impacted by visual language properties. From this follows that languages offering high visual
variability are easier to learn than other. As a consequence languages containing high visual
variability allow higher task completion and accuracy rates in model interpretation. In conclusion,
if a language should support effectiveness of model interpretation, the metamodel should offer
high visual variability in elements and relations. Concerning this, the complexity of language’s
metamodel is not determining language support of effectiveness in model interpretation.
Part III.
Specification
Desired Output
complete
Agreement
fair
common view
Representation
informal semi- formal
formal
The representation dimension copes with the different representations (informal and formal
languages, shapes, colors) used for expressing knowledge about the process.
The agreement dimension deals with the degree of agreement reached on a specification. At
the beginning of the modeling procedure each person involved has its own personal view of the
domain modelled. For example, a few steps of a business process may be shared among the
team at this stage, but the whole business process including activities, states, organization units
etc. exist only within personal views of the people stemming from the various roles they have
(business analyst, manager, user, developer etc.).
Usability
Target
Conflicts
Specification Representation
The ability of a formal representation and complete specification increases the complexity of a
modeling language compared to informal representations and opaque specifications.
The outcome of these relationships is a goal conflict between usability and the dimensions of
specification and representation.
Domain of
Elicitation
Domain Informal/opaque
Expert Specification
Validation Modelling
Business/
Software Semi-formal/fair
Analyst Specification
Validation Modelling
Developer Formal/complete
Specification
Verification
For the model interpretation scenario, questions such as whether employees understand the
information modelled do appear. Thus, companies aiming for fast, complete and correct model
interpretation, e.g. business process consulting companies, typically apply modeling languages
offering high variability in visual properties. In many cases those companies customize languages
such as the BPMN by
1. adding colors or shapes to support complete and accurate model interpretation.
2. reducing language complexity (i.e. element limitation) to support the user in model
interpretation and model development scenarios
Since EPCs offer highest usability values in development and interpretation scenarios, the
empirical results support this course of action.
Recommendations
The recommendations for the future improvement of graphical modeling languages based on the
empirical results of this thesis are as follows:
R1) Enrichment of the metamodel with color and shape information
⇒ Development Scenarios: Support of Effectiveness, Efficiency and Learnability
144 9 Further Processing of the Empirical Results
Which
Scenario is
relevant?
"
") ')"% )
9 Further Processing of the Empirical Results
Which
Attribute is Which
preferred? Attribute is
preferred?
'
!
'
'
"
!
'
$
!
"
"
' % "
"%
"
!'
" !'
single model is limited by human perceptual and cognitive abilities. Considering the following
recommendations in further language development or language customization activities lead to a
usability-oriented complexity management with particular modeling languages:
• Perceptual limits: The ability to discriminate between diagram elements increases with
diagram size
• Cognitive limits: The number of diagram elements and syntactic regulations that can be
comprehended at a time is limited by working-memory capacity. When this is exceeded, a
state of cognitive overload ensues and comprehension degrades rapidly
Due to the fact that the survey conducted in this research crosses over several modeling languages,
the different task complexities (i.e. model complexity) have to be calculated. Consequently, the
data output has to be adjusted considering the criterion of model complexity.
[Belady and Evangelisti, 1981] Belady, L. A. and Evangelisti, C. J. (1981). System partitioning
and its measure. Journal of Systems and Software, 2(1):23–29.
[Bertin, 1983] Bertin, J. (1983). Semiology of graphics: diagrams, networks, maps. University
of Wisconsin Press, Madison.
[Bevan, 1995] Bevan, N. (1995). Measuring usability as quality of use. Software Quality
Journal, 4:115–150.
[Bevan and Macleod, 1994] Bevan, N. and Macleod, M. (1994). Usability measurment in
context. Behaviour and Information Technology, 13(1):132–145.
[Birkmeier et al., 2010] Birkmeier, D., Klöckner, S., and Overhage, S. (2010). An empirical
comparison of the usability of bpmn and uml activity diagrams for business users. In
Proceedings of the 18th European Conference on Information Systems (ECIS), South Africa.
[Blalock, 1982] Blalock, H. M. (1982). Conceptualization and Measurement in the Social
Sciences. SAGE Publications.
[Blanthorne et al., ] Blanthorne, C., Jones-Farmer, A., and Almer, E. D. Why you should
consider sem: A guide to getting started. In Schmitt, D. B., editor, Advances in Accounting
Behavioral Research, volume 9, pages 179–207. Emerald Group Publishing Limited.
[Bobkowska, 2005a] Bobkowska, A. (2005a). A framework for methodologies of visual model-
ing language evaluation. ACM International Conference Proceeding Series, 214(2).
[Bobkowska, 2005b] Bobkowska, A. (2005b). Modeling pragmatics for visual modeling lan-
guage evaluation (tamodia ’05). In Proceedings of the 4th Workshop on Task Models and
Diagrams, volume 127, Gdansk.
[Bock, 1975] Bock, R. (1975). Multivariate statistical Methods in behavioural Research.
McGraw-Hill Book Company.
[Bodart et al., 2001] Bodart, F., Patel, A., Sim, M., and Weber, R. (2001). Should optional
properties be used in conceptual modelling? a theory and three empirical tests. Information
Systems Research, 12(1):384–405.
[Booch et al., 1998] Booch, G., Rumbaugh, J., and Jacobson, I. (1998). The Unified Modeling
Language User Guide. Addison-Wesley Professional.
[Bresciani et al., 2004] Bresciani, P., Perini, A., Giorgini, P., Giunchiglia, F., and Mylopoulos, J.
(2004). Tropos: An agent-oriented software development methodology. Autonomous Agents
and Multi-Agent Systems, 8(3):203–236.
[Bubenko, 1986] Bubenko, J. (1986). Information system methodologies; a research view.
[Burton et al., 2009] Burton, Jones, A., Wand, Y., and Weber, R. (2009). Guidelines for empiri-
cal evaluations of conceptual modeling grammar. Journal of the Association for Information
Systems, 10(6):495.532.
Bibliography 165
[Byrne et al., 1999] Byrne, M. D., Anderson, J., Douglass, S., and Matessa, M. (1999). Eye
tracking the visual search of click-down menus. In Proceedings of CHI’99, pages 402–409.
[Chen, 1976] Chen, P. P. (1976). The entity-relationship model: toward a unified view of data.
ACM Transactions on Database Systems, 1(1):9–36.
[Chiese et al., 1979] Chiese, H., Spilich, G., and Voss, J. (1979). Acquisition of domain related
information in relation to high and low domain knowledge. Journal of Verbal Learning and
Verbal Behavior, 18(1):257–273.
[Chin et al., 1988] Chin, J., Diehl, V., and Norman, K. (1988). Development of an instrument
measuring user satisfaction of the human-computer interface. Proceedings of the SIGCHI
conference on Human factors in computing systems, pages 213–218.
[Chin, 1998] Chin, W. W. (1998). Issues and opinion on structural equation modeling. MIS
Quarterly, 22(1):7–16.
[Chin and Lee, 2000] Chin, W. W. and Lee, M. (2000). A proposed model and measurement
instrument for the formation of is satisfaction: the case of end-user computing satisfaction. In
International Conference on Information Systems (ICIS) Proceedings, Brisbane.
[Chin et al., 2003] Chin, W. W., Marcolin, B. L., and Newsted, P. R. (2003). A partial least
squares latent variable modeling approach for measuring interaction effects: Results from a
monte carlo simulation study and an electronic–mail emotion/adoption study. Information
Systems Research, 14(2):189–217.
[Chin and Newsted, 1999] Chin, W. W. and Newsted, P. R. (1999). Structural equation modeling
analysis with small samples using partial least squares. In Hoyle, R., editor, Statistical
Strategies for Small Sample Research. Sage Publications.
[Codd, 1970] Codd, E. (1970). A relational model of data for large shared data banks. Commu-
nications of the ACM, 13(6):377–387.
[Comber and Maltby, 1997] Comber, T. and Maltby, J. (1997). Layout complexity: does it
measure usability?
[Constantine and Lockwood, 1999] Constantine, L. and Lockwood, L. (1999). Software for Use:
A practical Guide to the Models and Methods of Usage-Centered Design. Addison-Wesley,
New York.
[Cook and Campbell, 1979] Cook, T. and Campbell, D. (1979). Quasi-Experimentation: Design
and Analysis for Field Settings. Rand McNally.
[Cooper and Schindler, 2005] Cooper, D. and Schindler, P. (2005). Business Research Methods.
McGraw-Hill, New York.
[Coursaris and Kim, 2006] Coursaris, C. and Kim, D. (2006). A qualitative review of empirical
mobile usability studies. In Proceedings of the 12th Americas International Conference on
Information Systems (AMCIS), pages 85–94.
166 Bibliography
[Cronbach, 1951] Cronbach, L. (1951). Coefficient alpha and the internal structure of tests.
Psychometrika, 16(3):297–324.
[Crutch and Warrington, 2005] Crutch, S. and Warrington, E. (2005). Abstract and concrete
concepts have structurally different representational frameworks. Brain, 128(1):615–627.
[Dahl and Nygaard, 1966] Dahl, O.-J. and Nygaard, K. (1966). Simula: an algol-based simula-
tion language. Commun. ACM, 9:671–678.
[Dancey and Reidy, 2011] Dancey, C. P. and Reidy, J. (2011). Statistics Without Maths for
Psychology. Prentice Hall, Harlow, 5th edition.
[Das et al., 2008] Das, S., McEwan, T., and Douglas, D. (2008). Using eye-tracking to evaluate
label alignment in online forms. In Proceedings of the 5th Nordic conference on Human-
Computer-Interaction (NordiCHI08), pages 451–454. ACM.
[De Angeli et al., 2006] De Angeli, A., Sutcliffe, A., and Hartmann, J. (2006). Interaction,
usability and aesthetics: what influences users’ preferences? Proceedings of the 6th conference
on Designing Interactive systems, pages 271–280.
[Diamantopoulos and Winklhofer, 2001] Diamantopoulos, A. and Winklhofer, H. (2001). Index
construction with formative indicators: An alternative to scale development. J. Mark. Res.,
38(2):269–277.
[Diestel, 2005] Diestel, R. (2005). Graph Theory (Graduate Texts in Mathematics). Springer,
3rd edition.
[Duchowski, 2007] Duchowski, A. T. (2007). Eye Tracking Methodology - Theory and Practice.
Springer, New York, 2 edition.
[Dumas and Redish, 1999] Dumas, J. and Redish, J. (1999). A practical guide to usability
testing. Greenwood Publishing Group, Westport, 2nd edition edition.
[Dumas et al., 2005] Dumas, M., Hofstede, A., and Russel, N. (2005). Pattern-based analysis of
bpmn - an extensive evaluation of the control-flow, the data and the resource perspectives.
[Ehmke and Wilson, 2007] Ehmke, C. and Wilson, S. (2007). Identifying web usability prob-
lems from eye-tracking data. In Proceedings of the 21st British HCI Group Annual Conference
on People and Computers (BCS-HCI’07), pages 119–128. British Computer Society.
[Eloranta et al., 2006] Eloranta, L., Kallio, E., and Thero, I. (2006). A notation evaluation of
bpmn and uml ad.
[Elsuwe and Schmedding, 2001] Elsuwe, H. and Schmedding, D. (2001). Metriken für uml-
modelle. Informatik Forschung und Entwicklung, 18(1):22–31.
[Esswein et al., 2004] Esswein, W., Gehlert, A., and Seiffert, G. (2004). Towards a framework
for model migration. Advanced Information Systems Engineering, Proceedings, 3084:463–
476. Bah71 Times Cited:0 Cited References Count:23 Biomedical Sciences Instrumentation.
Bibliography 167
[Evermann and Wand, 2005] Evermann, J. and Wand, Y. (2005). Toward formalizing domain
modeling semantics in language syntax. IEEE Transactions on Software Engineering,
31(1):21–37.
[Fabrigar et al., 1999] Fabrigar, L., Wegener, D., MacCallum, R., and Strahan, E. (1999). Evalu-
ating the use of exploratory factor analysis in psychological research. Psychological Methods,
62(4):272–299.
[Figl et al., 2010] Figl, K., Mendling, J., Strembeck, M., and Recker, J. (2010). On the cognitive
effectiveness of routing symbols in process modeling languages. Springer Lecture Notes in
Business Information Processing (LNBIP), 47(1).
[Fitts et al., 1950] Fitts, P., Jones, R., and Milton, J. (1950). Eye movements of aircraft pilots
during instrument-landing approaches. Aeronautical Engineering Review, 9(2):24–29.
[Fornell and Bookstein, 1982] Fornell, C. and Bookstein, F. (1982). Two structural equation
models: Lisrel and pls applied to consumer exit-voice theory. Journal of Marketing Research,
19(2):440–452.
[Fornell and Larcker, 1981] Fornell, C. and Larcker, D. F. (1981). Evaluating structural equation
models with unobservable variables and measurment error. Journal of Marketing Research,
18(2):39–50.
[Frederiks and van der Weide, 2006] Frederiks, P. and van der Weide, T. (2006). Information
modeling: The process and the required competencies of its participants. Data and Knowledge
Engineering, 58(1):4–20.
[Gefen et al., 2000] Gefen, D., Straub, D. W., and Boudreau, M.-C. (2000). Structural equation
modeling and regression: Guidelines for research practice. Communications of the Association
for Information Systems, 4(7):22–56.
[Gemino and Wand, 2003] Gemino, A. and Wand, Y. (2003). Evaluationg modeling techniques
based on models of learning. Communications of the ACM, 46(10):79–84.
[Gemino and Wand, 2004] Gemino, A. and Wand, Y. (2004). A framework for empirical evalu-
ation of conceptual modeling techniques. Requirements Engineering, 9(4):248–260.
[Glass et al., 1972] Glass, G., Peckham, P., and Sanders, J. (1972). Consequences of failure to
meet assumptions underlying the fixed effects analyses of variance and covariance. Review of
Educational Research, 42(2):237–288.
[Goldberg and Kotval, 1999] Goldberg, J. and Kotval, X. (1999). Computer interface evalu-
ation using eye movements: methods and constructs. International Journal of Industrial
Ergonomics, 24(6):631–645.
[Gordon, 2004] Gordon, I. E. (2004). Theories of visual perception. Psychology Press, Hove, 3.
ed. edition.
168 Bibliography
[Greenspan et al., 1994] Greenspan, S., Mylopoulos, J., and Borgida, A. (1994). On formal
requirements modeling languages: Rml revisited. In Proceedings of the 16th International
Conference on Software Engineering, ICSE 94, pages 135–147, Los Alamitos, CA, USA.
IEEE Computer Society Press.
[Grossman et al., 2009] Grossman, T., Fitzmaurice, G., and Attar, R. (2009). A survey of
software learnability: metrics, methodologies and guidelines.
[Gurr, 1999] Gurr, C. (1999). Effective diagrammatic communication: Syntactic, semantic and
pragmatic issues. Journal of Visual Languages and Computing, 10(4):317–342.
[Hall and Hanna, 2004] Hall, R. and Hanna, P. (2004). The impact of web page text-background
colour combinations on readability, retention, aesthetics and behavioural intention. Behaviour
and Information Technology, 23(3):183–195.
[Halstead, 1977] Halstead, M. (1977). Elements of Software Science (Operating and program-
ming systems series). Elsevier Science Inc., New York.
[Hammer and McLeod, 1978] Hammer, M. and McLeod, D. (1978). The semantic data model:
A modelling mechanism for database applicaitons. In Proceedings of the 1978 ACM SIGMOD
international conference on management of data, pages 26–36.
[Havey, 2005] Havey, M. (2005). Essential business process modeling. O’Reilly, Beijing, 1. ed.
edition.
[Henry and Kafura, 1981] Henry, S. and Kafura, D. (1981). Software structure metrics based
on information flow. IEEE Transactions on Software Engineering, 7(5):510–518.
[Henseler et al., 2009] Henseler, J., Ringle, C. M., and Sinkovics, R. R. (2009). The use of
partial least square path modeling in international marketing. In Sinkovics, R. R. and Ghauri,
P., editors, Advances in international marketing, volume 20, pages 277–320. Bingley, London.
[Homburg and Giering, 2001] Homburg, C. and Giering, A. (2001). Personal characteristics
as moderators of the relationship between customer satisfaction an loyaltya: An empirical
analysis. Psychology and Marketing, 18(1):43–66.
[IEEE610.12-1990, 1990] IEEE610.12-1990 (1990). Standard glossary of software engineering
terminology. Technical report, Institute of Electrical and Electronics Engineers.
[Ilomäki, 2008] Ilomäki, T. (2008). The usability of music theory software: The analysis of
twelve-tone music as a case study. In Computer Music Modeling and Retrieval. Sense of
Sounds: 4th International Symposium, CMMR 2007, Lecture Notes in Computer Science
(LNCS), pages 98–109. Springer-Verlag.
[Ince and Hekmatpur, 1988] Ince, D. and Hekmatpur, S. (1988). An approch to automated
software design based on product metrics. Software Engineering Journal, 3(2):6–53.
[Indulska et al., 2009] Indulska, M., Zur Muehlen, M., and Recker, J. C. (2009). Measuring
method complexity: The case of the business process modeling notation. property complexity.
Bibliography 169
[Mendling and Strembeck, 2008] Mendling, J. and Strembeck, M. (2008). Influence factors of
understanding business process models. Proceedings of the 11th International Conference on
Business Information Systems, 7(1):142–153.
[Mitchell and Jolley, 2001] Mitchell, M. and Jolley, J. (2001). Research Design Explained,
volume 4. Harcourt, New York.
[Moody, 2009] Moody, D. (2009). The "physics" of notations: Toward a scientific basis for
constructing visual notations in software engineering. IEEE Transactions on Software Engi-
neering, 35(6):756–779.
[Moody, 2004] Moody, D. L. (2004). Theoretical and practical issues in evaluating the quality
of conceptual models: current state and future directions. Data and Knowledge Engineering,
55:243–276.
[Moody and Heymans, 2010] Moody, D. L. and Heymans, P. (2010). Visual syntax does matter:
improving the cognitive effectiveness of the i* visual notation. Requirements Engineering,
15(1):141–175.
[Moody and Hillegersberg, 2009] Moody, D. L. and Hillegersberg, J. (2009). Evaluating the
visual syntax of uml: An analysis of the cognitive effectiveness of the uml family of diagrams.
Lecture Notes in Computer Science, 5452(1):16–34.
[Mylopoulos et al., 1978] Mylopoulos, J., Bernstein, P. A., and Wong, H. K. T. (1978). A
language facility for designing interactive database-intensive applications. In Proceedings
of the 1978 ACM SIGMOD International Conference on Management of Data, pages 17–17.
ACM.
[Nembhard and Napassavong, 2002] Nembhard, D. and Napassavong, O. (2002). Task com-
plexity effects on between-individual learning/forgetting variability. International Journal of
Industrial Ergonomics, 29(2):297–306.
[Nembhard and Uzumeri, 2000] Nembhard, D. and Uzumeri, M. (2000). Experimental learning
and forgetting for manual and cognitive tasks. International Journal of Industrial Ergonomics,
25(2):315–326.
[Nielsen, 1993] Nielsen, J. (1993). Usability Engineering. Morgan Kaufmann, 1st edition.
[Nielsen, 2006a] Nielsen, J. (2006a). Prioritizing Web usability. New Riders.
[Nielsen, 2006b] Nielsen, J. (2006b). Quantitative studies: How many users to test?
[Nunnally and Bernstein, 1994] Nunnally, J. and Bernstein, I. (1994). Psychometric Theory.
McGraw-Hill Humanities/Social Sciences/Languages, 3 edition.
[Oei et al., 1992] Oei, J., van Hemmen, L., Falkenberg, L., and Brinkkemper, S. (1992). The
meta model hierarchy: A framework for information systems concepts and techniques. Tech-
nical Report No. 92-17, pages 1–30.
172 Bibliography
[Olle et al., 1986] Olle, T., Sol, H., and Verijin-Stuart, A. (1986). A comparative evaluation of
system development methods.
[OMG, 2005a] OMG (2005a). Introduction to omg’s unified modeling languageTM (uml ).
R
Technical report, Object Management Group.
[OMG, 2005b] OMG (2005b). Unified modeling language. Technical report, Object Manage-
ment Group.
[OMG, 2011a] OMG (2011a). Business process model and notation (bpmn 2.0). Technical
report, Object Management Group, http://www.omg.org/spec/BPMN/2.0/PDF.
[OMG, 2011b] OMG (2011b). Meta object facility (mof) core specification. Technical report,
Object Management Group, http://www.omg.org/spec/MOF/2.4.1/PDF.
[OMG, 2011c] OMG (2011c). Unified modeling language infrastructure. Version 2.4.1, Object
Management Group, http://www.omg.org/spec/UML/2.4/Infrastructure/Beta2/PDF/.
[OMG, 2011d] OMG (2011d). Unified modeling language superstructure. Technical report,
Object Management Group, http://www.omg.org/spec/UML/2.4.1/Superstructure/PDF/.
[Paivio, 1986] Paivio, A. (1986). Mental representations: a dual coding approach. Oxford
University Press, Oxford.
[Palmer and Rock, 1994] Palmer, S. and Rock, I. (1994). Rethinking perceptual organi-
zation: The role of uniform connectedness. Psychonomic Bulletin; Review, 1:29–55.
10.3758/BF03200760.
[Pan et al., 2004] Pan, B., Hembrooke, H. A., Gay, G. K., Granka, L. A., Feusner, M. K., and
Newman, J. K. (2004). The determinants of web page viewing behavior: an eye-tracking
study. Proceedings of the 2004 symposium on Eye tracking research and ypplication, pages
147–154.
[Parnas, 1972] Parnas, D. (1972). A technique for software module specification with examples.
Communications of the ACM, 15(5):330–336.
[Pearl, 2000] Pearl, J. (2000). Causality: Models, Reasoning, and Inference. Cambridge
University Press.
[Petre, 2006] Petre, M. (2006). Cognitive dimensions ’beyond the notation’. Journal of Visual
Languages and Computing, 17(4):292–301.
[Pfeiffer, 2007] Pfeiffer, D. (2007). Constructing comparable conceptual models with domain
specific languages. In Proceedings of the 15th European Conference on Information Systems
(ECIS2007), St.Gallen, Switzerland.
[Plass et al., 2010] Plass, J., Moreno, R., and Brünken, R. (2010). Cognititve Load Theory.
Cambridge University Press.
Bibliography 173
[Pohl, 1994] Pohl, K. (1994). The three dimensions of requirements engineering. In CAISE93
Selected papers from the fifth international conference on Advanced information systems
engineering, pages 275–292. Springer.
[Preece et al., 1994] Preece, J., Rogers, Y., Sharp, H., Benyon, D., Holland, S., and Carey, T.
(1994). Human Computer Interaction. Addison-Wesley, Wokingham.
[Pretorius et al., 2005] Pretorius, M. C., Calitz, A. P., and van Greunen, D. (2005). The added
value of eye tracking in the usability evaluation of a network management tool. In Proceedings
of the annual Research Conference of the South African Institute for Computer Scientists and
Information Technologists, pages 1–10. South African Institute for Computer Scientists and
Information Technologists.
[Recker and Dreiling, 2007] Recker, J. C. and Dreiling, A. (2007). Does it matter which process
modelling language we teach or use? an experimental study on understanding process mod-
elling languages without formal education. In Proceedings of 18th Australasian Conference
on Information System (ACIS 2007). University of Southern Queensland.
[Recker et al., 2006] Recker, J. C., Indulska, M., Rosemann, M., and Green, P. (2006). How
good is bpmn really? insights from theory and practice. In Ljungberg, J. and Andersson, M.,
editors, 14th European Conference on Information Systems (ECIS), pages 178–190.
[Recker et al., 2009] Recker, J. C., Zur Muehlen, M., Keng, S., Erickson, J., and Indulska, M.
(2009). Measuring method complexity: Uml versus bpmn. Proceedings of the Fifteenth
Americas Conference on Information Systems, San Francisco, California.
[Reijers and Vanderfeesten, 2005] Reijers, H. and Vanderfeesten, I. (2005). Cohesion and cou-
pling metrics for workflow process design. Lecture Notes in Computer Science, 3080(1):290–
305.
[Rengger et al., 1993] Rengger, R., Macleod, M., Bowden, R., Blaney, M., and Bevan, N. (1993).
MUSiC Performance Measurement Handbook. National Physical Laboratory, Teddington,
UK.
[Ringle et al., 2011] Ringle, C. M., Wende, S., and Will, A. (2011). Smartpls.
[Ross, 1977] Ross, D. (1977). Structured analysis: A language for communicating ideas. IEEE
Transactions on Software Engineering, 3(1):16–34.
[Rossi and Brinkkemper, 1996] Rossi, M. and Brinkkemper, S. (1996). Complexity metrics for
systems development methods and techniques. Information Systems, 21(2):209–227.
[Roussopoulos and Yeh, 1984] Roussopoulos, N. and Yeh, H. (1984). An adaptable methodol-
ogy for database design. ACM Transactions on Database Systems, 17(5):64–80.
[Schafer and Graham, 2002] Schafer, J. and Graham, J. (2002). Missing data: Our view of the
state of the art. Psychological Methods, 7(2):147–177.
174 Bibliography
[Schalles et al., 2010a] Schalles, C., Creagh, J., and Rebstock, M. (2010a). Developing a
usability evaluation framework (fueml) for modeling languages. In Fox, R. and Golubski, W.,
editors, Proceedings of the IASTED International Conference on Software Engineering (SE),
volume 3, pages 126–135, Innsbruck. Acta Press.
[Schalles et al., 2010b] Schalles, C., Creagh, J., and Rebstock, M. (2010b). Ein generischer
ansatz zur messung der benutzerfreundlichkeit von modellierungssprachen. In Engels, G.,
Karagiannis, D., and Mayr, H. C., editors, Modellierung 2010, volume 161 of Lecture Notes
in Informatics (LNI), pages 15–30, Klagenfurt. Gesellschaft für Informatik (GI).
[Schalles et al., 2010c] Schalles, C., Creagh, J., and Rebstock, M. (2010c). A generic metric for
measuring complexity of models. In Filipe, J. and Cordeiro, J., editors, Proceedings of the
12th International Conference on Enterprise Information Systems (ICEIS), volume 3, pages
436–439, Funchal (Madeira).
[Schalles et al., 2011a] Schalles, C., Creagh, J., and Rebstock, M. (2011a). Experiences on
using partial least squares in usability research. In Bleimann, U., Walsh, P., and Humm,
B., editors, Proceedings of the 2nd Collaborative European Research Conference (CERC),
volume 2, pages 7–18, Cork.
[Schalles et al., 2011b] Schalles, C., Creagh, J., and Rebstock, M. (2011b). Usability of mod-
elling languages for model interpretation: An empirical research report. In Bernstein, A. and
Schwabe, G., editors, Proceedings of the 10th International Conference on Wirtschaftsinfor-
matik (WI2011), volume 2, pages 787–797, Zurich.
[Schalles et al., 2012] Schalles, C., Creagh, J., and Rebstock, M. (2012). Exploring usability-
driven differences of graphical modeling languages: An empirical research report. In Sinz, E.,
editor, Lecture Notes in Informatics (LNI), volume 201, pages 91–106, Bamberg.
[Scheer, 1992] Scheer, A.-W. (1992). Architecture of Integrated Information Systems: Principles
of Enterprise Modeling. Springer Berlin Heidelberg.
[Schuette and Rotthowe, 1998] Schuette, R. and Rotthowe, T. (1998). The Guidelines of Model-
ing - An Approach to Enhance the Quality in Information Models, volume 1507 of Lecture
Notes in Computer Science (LNCS), pages 240–254. Springer.
[Schwanenflugel and Shoben, 1983] Schwanenflugel, P. and Shoben, E. (1983). Differential
context effects in the comprehension of abstract and concrete verbal materials. Journal of
Experimental Psychology, 9(1):82–102.
[Seffah et al., 2006] Seffah, A., Donyaee, M., Kline, R., and Padda, H. (2006). Usability
measurement and metrics: A consolidated model. Software Quality Control, 14(2):159–178.
[Shackel, 1991] Shackel, B. (1991). Usability - context, framework, definition, design and
evaluation. In Shackel, B. and Richardson, S., editors, Human Factors for Informatics
Usability, pages 21–38. University Press, Cambridge.
Bibliography 175
[Shaft and Vessey, 2006] Shaft, T. and Vessey, I. (2006). The role of cognitive fit in the relation-
ship between software comprehension and modification. MIS Quarterly, 30(1):29–55.
[Shannon et al., 1998] Shannon, C., Weaver, W., and Shannon (1998). The Mathematical Theory
of Communication. University of Illinois Press.
[Siau and Cao, 2001] Siau, K. and Cao, Q. (2001). Unified modeling language (uml) - a
complexity analysis. Journal of Database Management, 12(1):26–34.
[Siau and Loo, 2006] Siau, K. and Loo, P.-P. (2006). Identifying difficulties in learning uml. IS
Management, 23(3):43–51.
[Siau and Rossi, 2008] Siau, K. and Rossi, M. (2008). Evaluation techniques for systems
analysis and design modelling methods; a review and comparative analysis. Information
Systems Journal.
[Siau and Wang, 2007] Siau, K. and Wang, Y. (2007). Cognitive evaluation of information
modeling methods. Information and Software Technology, 49(5):455–474.
[Sibert and Jacob, 2000] Sibert, L. E. and Jacob, R. J. (2000). Evaluation of eye gaze interaction.
In Proceedings of the SIGCHI conference on Human factors in computing systems, pages
281–288. ACM.
[Sjoberg et al., 2007] Sjoberg, D., Dyba, T., and Jorgensen, M. (2007). The future of empirical
methods in software engineering research. Future of Software Engineering, pages 358–378.
[Smith and Smith, 1977] Smith, J. M. and Smith, D. C. P. (1977). Database abstractions:
aggregation and generalization. ACM Transactions on Database Systems, 2:105–133.
[Sonderegger and Sauer, 2009] Sonderegger, A. and Sauer, J. (2009). The influence of design
aesthetics in usability testing: Effects on user performance and perceived usability. Applied
Ergonomics, 41(3):403–410.
[Stachowiak, 1973] Stachowiak, H. (1973). Allgemeine Modelltheorie. Springer, Wien.
[Stevens, 1999] Stevens, J. (1999). Intermediate Statistics - A Modern Approach. Lawrence
Erlbaum, page 77, London.
[Stevens, 2001] Stevens, J. (2001). Applied Multivariate Statistics for the Social Sciences
(Applied Multivariate STATS). Lawrence Erlbaum.
[Strom, 1986] Strom, R. (1986). A comparison of the object-oriented and process paradigms.
[Sweller, 2005] Sweller, J. (2005). Implications of cognititve load theory for multimedia learn-
ing. In Mayer, R. E., editor, The Cambridge Handbook of Multimedia Learning. Cambridge
University Press, Cambridge.
[Tamir et al., 2008] Tamir, D., Komogortsev, O. V., and Mueller, C. J. (2008). An effort and time
based measure of usability. In Proceedings of the 6th international Workshop on Software
Quality (WoSQ’08). ACM. 1370111 47-52.
176 Bibliography
[Tenenhaus et al., 2005] Tenenhaus, M., Vinzi, V., Chatelin, Y., and Lauro, C. (2005). Pls path
modeling. Computational statistics and Data analysis, 48(2):159–205.
[Treisman, 1982] Treisman, A. (1982). Perceptual grouping and attention in visual search for
features and for objects. Journal of Experimental Psychology, 8:194–214.
[Underwood, 2005] Underwood, G. D. (2005). Cognititve Processes in Eye Guidance. Oxford
University Press, New York.
[Underwood, 2009] Underwood, G. D. (2009). Cognitive processes in eye guidance: Algorithms
for attention in image processing. Cognititve Computation, 1(1):64–76.
[Vanderfeesten et al., 2007a] Vanderfeesten, I., Cardoso, J., Mendling, J., Reijers, H., and
Van der Aalst, W. (2007a). Quality metrics for business process models. In Fischer, L., editor,
Workflow Handbook 2007. Workflow Management Coalition, Lighthouse Point, Florida.
[Vanderfeesten et al., 2007b] Vanderfeesten, I., Cardoso, J., and Reijers, H. (2007b). A weighted
coupling metric for business process models. In The 19th International Conference on
Advanced Information Systems Engineering (CAiSE’07), Trondheim, Norway.
[Vogt, 1999] Vogt, W. P. (1999). Dictionary of Statistics and Methodology - A nontechnical
guide for the social Sciences. Sage Publications, California.
[Vuolle et al., 2008] Vuolle, M., Aula, A., Kulju, M., and Vainio, T. (2008). Identifying usability
and productivity dimensions for measuring the success of mobile business services. Advances
in Human-Computer Interaction.
[Wahl and Sindre, 2005] Wahl, T. and Sindre, G. (2005). An analytical evaluation of bpmn
using a semiotic quality framework.
[Walker et al., 1998] Walker, M., Fromer, J., Di Fabbrizio, G., Mestel, C., and Hindle, D. (1998).
What can i say?: evaluating a spoken language interface to email. pages 582–589.
[Wallenburg and Weber, 2005] Wallenburg, C. M. and Weber, J. (2005). Structural equation
modeling as a basis for theory development within logistics and supply chain management
research. In Kotzab, H., Seuring, S., Müller, M., and Reiner, G., editors, Research Methodolo-
gies in Supply Chain Management. Springer Berlin Heidelberg.
[Wand and Weber, 1993] Wand, Y. and Weber, R. (1993). On the ontological expressiveness of
information systems analysis and design grammars. Information Systems Journal, 3(4):217–
237.
[Weiber and Mühlhaus, 2009] Weiber, R. and Mühlhaus, D. (2009). Strukturgleichungsmodel-
lierung. Springer, Heidelberg.
[Welke, 1992] Welke, R. (1992). The case repository: more than another database application.
In Cottermann, W. and Senn, J., editors, Challenges and strategies for research in systems
development, pages 181–218. Wiley Inc.
Bibliography 177