Towards understanding the understandability of UML models

Seediscussions,stats,andauthorprofilesforthispublicationat:http://www.researchgate.net/publication/261100463

TowardsunderstandingtheunderstandabilityofUMLmodels

CONFERENCEPAPER·JUNE2014

DOI:10.1145/2593770.2593779

CITATION

1

DOWNLOADS

87

VIEWS

155

2AUTHORS:

ChintanAmrit

UniversityofTwente

49PUBLICATIONS77CITATIONS

SEEPROFILE

NiekTax

UniversityofTwente

3PUBLICATIONS1CITATION

SEEPROFILE

Availablefrom:ChintanAmrit

Retrievedon:21September2015

http://www.researchgate.net/publication/261100463_Towards_understanding_the_understandability_of_UML_models?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_2

http://www.researchgate.net/publication/261100463_Towards_understanding_the_understandability_of_UML_models?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_3

http://www.researchgate.net/?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_1

http://www.researchgate.net/profile/Chintan_Amrit?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_4


http://www.researchgate.net/institution/University_of_Twente?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_6


http://www.researchgate.net/profile/Niek_Tax?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_4


http://www.researchgate.net/institution/University_of_Twente?enrichId=rgreq-b3fa6c2c-3e7d-4bb5-9527-13a1d2cc664c&enrichSource=Y292ZXJQYWdlOzI2MTEwMDQ2MztBUzoxMDE2NDIzNDMwMjY2ODhAMTQwMTI0NDgzMTAzMA%3D%3D&el=1_x_6


Towards Understanding the Understandability of UML Models

Chintan Amrit IEBIS Department

University of Twente Enschede, Netherlands [email protected]

Niek Tax EWI Faculty

University of Twente Enschede, Netherlands

[email protected]

ABSTRACT Metrics for conceptual diagram understandability are generally based on a particular modelling notation and do not consider multiple perspectives in arriving at measurable metrics. This paper aims to be a step towards integration of multiple perspectives of conceptual model understandability. We demonstrate how one can arrive at a suite of metrics for conceptual diagram understandability. We use a questionnaire based on a few selected metrics to gain insights on the relative impact of different perspectives (namely, the notation and aesthetic perspectives) on conceptual diagram understandability. The main result of this study is that given a particular modelling notation (UML) and novice modellers, certain aesthetic aspects seem to have a greater impact on perceived diagram understandability than the notation aspects. We conclude with a call for further research in order to draw valid conclusions about the overall impact of the aesthetic and notation perspectives on diagram understandability.

Categories and Subject Descriptors D.3.3 [Programming Languages]: Language Constructs and Features – abstract data types, polymorphism, control structures.

General Terms Measurement, Design, Languages.

Keywords UML, Diagram understandability, Model syntax

1. INTRODUCTION Communication is critical for efficient workplace coordination [1]. For efficient stakeholder communication it is of importance that the conceptual models used for communication are easily understandable [2]. Lindland et al [3] state that the quality of a conceptual model is determined largely by how easily all the concerned parties understand the model.

The understandability and the ability to process information in a conceptual diagram depend on the appearance of the diagram. Aesthetic properties like the number of Line Crossings and Line Bends have been known to influence the degree to which stakeholders are able to read and cognitively process the information in a conceptual diagram [4]. Other non-aesthetic factors like diagram structure (diagram complexity) [5] and modelling notation choice [6] have been shown to influence the degree to which stakeholders can read and cognitively process the information in a conceptual diagram.

Literature on Model Understandability can be broadly categorised into three perspectives: (i) Notation Perspective: Studies in the notation perspective focus on the fit between a chosen modelling notation and the people working with the diagram and the

diagramming task. (ii) Aesthetics Perspective: Studies in the aesthetics perspective focus on visual properties of the diagram. The symmetry of the diagram and the number of Line Crossings in the diagram are examples of a diagram’s aesthetics, and (iii) Structure Perspective: Studies in the structure perspective focus on structural properties of a diagram, like the number of entities and the number of relationships in a diagram. The structure of a diagram is related to the semantic concepts that are modelled in the diagram.

We base this three perspective categorisation on Moody’s [6] categorisation into sentence level and language level perspectives. Moody (2009) defines language perspective as the fit among a chosen modelling notation, the people working with the diagram and the diagramming task (our notation perspective). Sentence level can be described as all aspects that describe the actual model visualisation, such as line bending, line crossings as well as the number of instances of particular tokens (which is both our aesthetics perspective and structure perspective). Although several models exist that look at different perspectives of conceptual diagram understandability, they mostly focus on one of these perspectives (for example, aesthetic or non-aesthetic) to analyse diagrams. As an example, we could not find any papers that looked at sentence level as well as language level understandability by considering and comparing the different aspects and metrics and critically validating them. To be able to improve conceptual diagram understandability, multiple perspectives are relevant and cannot be neglected. This leads to the following research question:

Which of the perspectives (notation, aesthetics or structure) has a higher impact on understandability of a particular modelling notation for a particular stakeholder? This question would help us gain some insight into what conceptual modellers can do to improve conceptual model understandability for a particular stakeholder. In this paper we attempt at a partial answer of this question by combining two perspectives to analyse model understandability; namely the notation and the aesthetic perspective. We then test which of these diagram characteristics impact the understandability of UML more, through a case study performed on students.

The next section details the literature background; this is followed by a description of our questionnaire design and the survey performed. In the following section 4 we analyse the results of the questionnaire and end in section 5 with a discussion of the results.

2. MODEL APPEARANCE The following four aspects are understood to have an influence on the understandability of a diagram:

• Modelling notation [6] • Model semantics [4] MiSE'14, May 31 - June 07 2014, Hyderabad, India

Copyright is held by the owner/author(s). Publication rights licensed to ACM. ACM 978-1-4503-2849-4/14/05…$15.00. http://dx.doi.org/10.1145/2593770.2593779

• Sentence level design choices [4, 7] • Modelling tool [8]

Table 1 describes the particular perspective for which each of the four aspects fit.

Table 1. Model appearance perspectives and their aspects

Perspective Aspect

Notation perspective Modelling notation Modelling tool

Aesthetics perspective Sentence level design choices Modelling tool

Structure perspective Model semantics

In the following sections, we explain the aspects influencing diagram appearance. We then apply the categorisation of perspectives from Table 1 to discuss the aspects in a structured manner.

2.1 Notation perspective Moody [6] describes eight principles that determine the degree to which a chosen conceptual modelling language fits the characteristics of the task that it is used for, and also fits the problem solving skills of the stakeholders [2].

In order to have an optimal fit, different modelling notations or visual dialects may be required for different tasks and audiences. One of the reasons for considering fit is the big difference in the understanding of conceptual diagrams between novices and experts [9]. Moody defines the following eight principles:

§ Perceptual Discriminability § Graphic Economy § Semantic Transparency § Semiotic Clarity § Visual Expressiveness § Cognitive Integration § Complexity Management § Dual Coding

In order to measure the impact of these principles on diagram understandability, we need a way to measure them. Hence, we can arrive at metrics for measuring the extent to which each principle is met in a given diagram (sentence level). These metrics are shown in Table 2 (# indicates number). In this paper we propose and then validate some of these metrics, as well as use them to draw conclusions on the objective application of Moody’s [4] principles to diagrams. Note that we provide metrics for all the principles at the sentence level; hence, the metric for Graphic Economy principle cannot be used, as it is defined at the level of the language. The metrics can be used to gain some insight on how well some of Moody’s [4] principles can be applied in a particular diagram.

At the language level, a tool can influence the graphical appearance; especially in the way it fits the notation. E.g., a tool may not support all the graphical symbols of a particular modelling notation. This would limit the number of graphical symbols the modeller can use and may thereby influence the understandability of a diagram.

Table 2: Sentence Level metrics based on Moody's [4] Principles

Metric Description at the Sentence Level

Semiotic Clarity: Number of instances of symbol redundancy, symbol overload and symbol excess in a diagram Perceptual Discriminability: Visual Distance in a diagram Semantic Transparency: Number of symbols that are semantically opaque and perverse in a diagram Complexity Management: Diagrammatic complexity = no. of instances/tokens in a diagram Cognitive Integration: # Instances of lack of conceptual and perceptual integration Visual Expressiveness: Instances of: 8 - # information carrying variables Dual Coding: +1 for Instances of lack of dual coding Graphic Economy: (# symbols supported in the language) Cognitive Fit: Dependent Variable

When a tool uses a set of graphical symbols for some of the language’s semantic constructs that differs from the formal specification of the notation, we can consider the modelling language offered by the tool to be a different dialect of the notation that the tool says it offers.

Hence, for determining cognitive fit [6] of a conceptual diagram one should look at the notation in the way it is offered by the tool instead of the way the notation is formally specified. By using Moody’s [6] principles and the related metrics on the tool implementation of a notation, conclusions can be drawn regarding the fit between the notation used with a particular tool and task characteristics of the involved stakeholders.

2.2 Aesthetics perspective 2.2.1 Sentence Level Design Choices The modelling notation and semantics are not the only influencers for diagram appearance. For a given modelling notation and semantics, the graph designer can make a lot of design decisions that can affect the diagram’s understandability. These decisions include size of graphical elements, place of graphical elements, and font selection.

2.2.2 Diagramming Tool influencing Aesthetics At the sentence level, a tool can influence the appearance of a diagram in the following ways:

§ By applying an Automatic Graph Layout algorithm.

§ By using the different graphical symbols offered by the tool

Automatic Graph Layout algorithms make some aesthetic choices independently of the diagram designer. At the same time, it is conceivable that an independently made aesthetic choice of the Automatic Graph Layout algorithm persuades the designer to make different sentence level design choices. The Automatic Graph Layout algorithm assists the diagram designer in several ways. These include automatic alignment of graphical elements and in avoiding Line Crossings and Line Bends. Hence, sentence level graph aesthetics are usually an interplay between sentence level design choices and the Automatic Graph Layout algorithm of the tool.

A modelling notation determines the appearance of a model largely by the graphical symbols it uses. However, there are still

some free variables available for the tool to vary. Examples of free aesthetic variables often implemented by the tool are line thickness and font choices within graphical symbols

Currently, most diagram modelling tools do not implement state-of-the-art Automatic Graph Layout algorithms and are not capable of generating very understandable diagrams [10].

2.2.3 Purchase Metrics Purchase [11] defines a set of metrics defining understandability for several sentence level diagram aesthetics. She defines seven principles that reach optimal understandability [11]. For each principle, a metric is defined to identify the degree to which a diagram fulfils this principle.

The following seven principles are defined by Purchase [11]:

§ Minimizing edge crossings § Minimizing edge bends § Maximizing symmetry § Maximizing the minimum angle between edges leaving

a node § Maximizing Edge Orthogonality § Maximizing Node Orthogonality § Maximizing consistent flow direction (directed graphs

only)

2.3 Structure perspective The real-world objects and events that are being modelled have influence on the appearance of the graph. This is self-evident, as a different real-world model implies different semantic constructs that result in different symbol instances. The metrics that can be used to determine the understandability of the conceptual diagram structure depend on the notation in which the diagram is created. Genero (2005) created a set of metrics which can be used to determine understandability of ER diagrams [5]. Several metrics exist for UML diagrams, but they are still in early stages of development and are mostly not validated [12].

The real-world objects and events being modelled are almost in all cases not changeable by the diagram designer. Due to this reason we do not consider the influence of diagram semantics in our questionnaire design. Instead, we focus on Notation and the Aesthetic perspectives for designing our questionnaire.

3. QUESTIONNAIRE DESIGN In order to provide answers to our research questions, we created a questionnaire1 containing eleven conceptual models modelling the same real-world scenario.

We varied these eleven conceptual models both on aspects of notation and aesthetic perspectives. Among the many perspectives we chose four aspects of aesthetics perspective and two aspects of notation perspective. By testing the understandability of the diagrams while varying both the aesthetic perspective and notation perspectives, we could gain some insights on the relative impact of these particular perspectives on conceptual diagram understandability. We left out the structure perspective from the questionnaire as we felt that diagram designers are often not able to modify the semantic aspects within the structure perspective.

1 http://www.thesistools.com/web/?id=236082

3.1 Aesthetics perspective aspects Within the aesthetics perspective we chose Line Crossings, Line Bends, Node Orthogonality and Edge Orthogonality to vary the diagrams on, as these aspects have been validated to make a difference on understandability of UML class diagrams [4].

In the following sections, the metrics for Line Crossings, Line Bends, Node Orthogonality and Edge Orthogonality as defined by Purchase [11] will be explained briefly. In these sections, the following definitions are applicable:

𝑛 the number of nodes in the diagram 𝑚 the number of edges in the diagram 𝑢! the ith node in the diagram ℎ the number of vertical grid point in the

diagram 𝑤 the number of horizontal grid point in the

diagram 𝑛′ the number of nodes in a diagram if all bends

in the edges are promoted to nodes 𝑚’ the number of edges in a diagram if all bends

in the edges are promoted to nodes 𝜃! the positive angle between the ith edge and

the x-axis

3.1.1 Line Crossings Purchase [11] defines an approximation for the upper bound of the number of edge crossings. The number of edge crossings is calculated assuming every edge of the graph crosses every other edge

𝑐!"" = (𝑖 − 1)!!!!! = !!(!!!!)

!.

Where m’ is the total number of edges (each denoted by i). This is an overestimate that includes the total number of edge crossings, known to be impossible in a straight-line diagram. In straight-line drawings of connected graphs with at most one edge between nodes, adjacent edges cannot cross. The total number of such impossible crossings is therefore:

𝑐!"#$%%!&'( =12 𝑑𝑒𝑔𝑟𝑒𝑒(𝑢!)(𝑑𝑒𝑔𝑟𝑒𝑒 𝑢! − 1)!!

!!!

Where degree(uj) denotes the degree of the node uj. The upper bound on the number of edge crosses in a diagram is therefore:

𝑐!" = 𝑐!"" − 𝑐!"#$%%!&'(

The actual number of crosses in a diagram is scaled against the maximum number of crossings. To reach a scale in which 1 represents maximum ‘crosslessness’, the scaled measurement of crosses is subtracted from 1.

𝐿𝑖𝑛𝑒 𝑐𝑟𝑜𝑠𝑠𝑖𝑛𝑔𝑠 = 1 − 𝑐𝑐!"

𝑖𝑓 𝑐!" > 0

0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

3.1.2 Line Bends It is impossible to scale against an upper bound of the number of bends, which is infinite, so we divide it by the total number of edge segments:

𝑏!"# = 𝑚! −𝑚𝑚′

So that 1 represents maximum ‘bendlessness’, this scaled measurement of bends is subtracted from 1.

𝐿𝑖𝑛𝑒 𝑏𝑒𝑛𝑑𝑠 = 1 − 𝑏!"#

3.1.3 Edge Orthogonality The edge deviation factor of the ith edge segment (𝛿!) represents how far away from an orthogonal angle the edge segment has deviated. It is computed as a proportion of the angular deviation of the ith edge segment from the horizontal or vertical gridlines:

𝛿! = min (𝜃! , 90° − 𝜃! , 180° − 𝜃!)

45°

So that 1 represents drawings with optimal edge deviation with respect to the orthogonal grid (i.e., assumed easier to read), the average edge deviation factor over all edge segments is subtracted from 1.

𝐸𝑑𝑔𝑒 𝑜𝑟𝑡ℎ𝑜𝑔𝑜𝑛𝑎𝑙𝑖𝑡𝑦 = 1 −1𝑚! (𝜃!)

!!

!!!

3.1.4 Node Orthogonality The number of available grid-point intersections in the imaginary unit grid is:

𝐴 = 𝑤 + 1 ℎ + 1 Purchase defines the Node Orthogonality metric as the extent to which the drawing makes maximal use of the grid area:

𝑁𝑜𝑑𝑒 𝑜𝑟𝑡ℎ𝑜𝑔𝑜𝑛𝑎𝑙𝑖𝑡𝑦 =𝑛′𝐴

3.2 Notation perspective aspects For the notation perspective we choose Perceptual Discriminability and Semiotic Clarity (Moody [6]) to modify for the different diagrams. We choose these particular Moody principles, as both the principles directly or indirectly influence the cognitive fit of the diagram. While, at the same time, they are the most independent of other Moody principles [6] (Figure 36, page 772). Furthermore, the reasons for leaving out the other aspects of Notational perspectives from our questionnaire are as follows:

§ Graphic Economy – is hard to measure § Visual Expressiveness – impacts the language level § Cognitive Integration – impacts the language level § Complexity Management - by choice we wanted to use

the same Objects and Relationships § Dual Coding – always possible with UML § Cognitive Fit – is an aspect of the dependent variable

(diagram understandability)

In the following sections a brief explanation of Perceptual Discriminability and Semiotic Clarity is given. Metrics for both aspects can be found in Table 2.

3.2.1 Semiotic Clarity Symbol redundancy was added to several of the diagrams in order to decrease the Semiotic Clarity. We expected that the introduction of symbol redundancy to the UML diagrams would have a negative influence on diagram understandability.

3.2.2 Perceptual Discriminability The other notation perspective aspect that we varied in the diagrams was the Perceptual Discriminability between symbols within the notation. Perceptual Discriminability is the ease and accuracy with which different graphical symbols can be differentiated from each other. Accurate discrimination between symbols is a necessary prerequisite for accurate interpretation of diagrams.

Perceptual Discriminability is primarily determined by the visual distance between symbols, which is defined by the number of

visual variables on which models differ and the size of these differences. In general, the greater the visual distance between symbols used to represent different constructs, the faster and more accurately they will be recognised. Bad Perceptual Discriminability has been found to affect modelling novices more than the experts [13]. The UML class diagram uses only the visual shape variable to differ between types of symbols, and even within this visual variable, rectangular shapes are mostly used [14]. The use of colour as visual variable is specifically avoided in UML:

“UML avoids the use of graphic markers, such as colour, that present challenges for certain persons (the colour blind) and for important kinds of equipment (such as printers, copiers, and fax machines).” [15]

In several of the questionnaire diagrams, colour is added to the UML notation by assigning each used graphical symbol a different colour for better symbol discriminability. We expect that this increased symbol discriminability will lead to a better understanding of the diagram, especially with users that are less familiar with conceptual modelling.

3.3 Variance of aspects in the questionnaire diagrams To measure the relative impact of Line crossings, Line Bends, Node Orthogonality, Edge Orthogonality, Semiotic Clarity and Perceptual Discriminability on the perceived understandability of a diagram, we created several diagrams that differed on these aspects. We then used the metrics as defined by Purchase [11] (explained in section 5.1) to measure the extent of each aspect.

The metric for Perceptual Discriminability is defined as the visual distance in a diagram (Table 2). In UML, class diagrams only use shape as visual variable. When colour is added as extra visual variable to discriminate between symbols, the Perceptual Discriminability metric of that UML class diagram increases from 1 to 2. Moody states that the more visual variables the graphical symbols differ on, the more each graphical symbol pops out. We expect Perceptual Discriminability to have a positive effect on diagram understandability [6].

The metric for Semiotic Clarity can be defined as the number of instances of symbol redundancy, symbol overload and symbol excess in a diagram (Table 2). Most UML class diagrams do not include instances of symbol redundancy, symbol overload and symbol excess in a diagram. Those diagrams have a Semiotic Clarity score of 0. In several diagrams an instance of symbol redundancy is added, those diagrams have a score of 1 on the Semiotic Clarity metric. Table 3 provides an overview of how well each of the 11 diagrams scores on each metric. The aesthetic perspective aspects (Line Crossings, Line Bends, Node Orthogonality and Edge Orthogonality) have a score on a scale from 0 to 1. Purchase perceives a higher score to have a positive effect on understandability [11]. The dependent variable “perceived diagram understandability” is measured on a five point Likert scale.

The properties of the diagrams were chosen in such a way that most diagrams differed on one or more of the defined aesthetic perspective properties, one or more of the defined notation perspective properties, or a combination of both.

We need to compare the perceived understandability of these diagrams to the perceived understandability of a clean UML diagram without the defined flaws.

Table 3. Properties of the questionnaire diagrams D

iagr

am

Lin

e C

ross

ings

Lin

e B

ends

Nod

e O

rtho

gona

lity

Edg

e O

rtho

gona

lity

Perc

eptu

al

Dis

crim

inab

ility

Sem

iotic

Cla

rity

1 0.57 1.00 0.29 0.58 2 0

2 1.00 0.80 0.32 1.00 1 1

3 0.79 1.00 0.32 0.81 1 1

4 1.00 0.33 0.04 0.49 1 0

5 1.00 0.80 0.28 1.00 1 0

6 0.79 1.00 0.28 0.49 1 0

7 1.00 0.50 0.06 0.81 2 0

8 0.57 1.00 0.29 0.58 2 0

9 1.00 0.33 0.04 0.49 1 1

10 1.00 0.17 0.20 1.00 1 0

11 1.00 1.00 0.38 0.41 1 0

Hence, we created diagram 5 that scores quite well on all aspects. All the diagrams have the same real-world objects that are being modelled. This is needed to exclude the complexity (Diagram Complexity, Table 2) of the modelling subject from the data. On the other hand, this approach has the potential drawback that participants in the test might get more familiar with the modelling subject throughout the questionnaire. To be able to check to what degree the familiarity with the modelling subject influences the perceived model understandability, the first diagram of the questionnaire is shown again as diagram number eight. Thus, we check the effect of learning the semantics of the diagram on the understandability of the diagram.

We chose the model semantics in such a way that all participants had at least the basic domain knowledge. Thus, we minimised the advantage some of the participants may have had through their previous UML modelling knowledge.

The participants of the questionnaire were students enrolled in the Business Information Technology, Industrial Engineering or Computer Science programme at the University of Twente in the Netherlands. Students of these studies where used as questionnaire participants, as these students were expected to have at least basic knowledge of conceptual modelling. All questionnaire participants had at least completed the introductory course on Information Systems.

The questionnaire can be viewed online at: http://www.thesistools.com/web/?id=236082

4. RESULTS & DISCUSSION In total 47 students filled the questionnaire and we obtained 517 data points, with 11 diagrams per subject. We start by using a one-sample T test to determine if the familiarity of the model subject throughout the questionnaire has an influence on the diagram understandability.

4.1 One-sample T test

Table 4. One-sample T test results on the questionnaire diagrams

One-Sample Statistics

N Mean Std. Deviation Std. Error Mean

Diagram 1 47 3.28 1.136 0.166 Diagram 2 47 4.11 0.914 0.133 Diagram 3 47 2.62 0.922 0.134 Diagram 4 47 3.53 0.905 0.132 Diagram 5 47 4.79 0.463 0.068 Diagram 6 47 3.13 1.076 0.157 Diagram 7 47 3.64 0.942 0.137 Diagram 8 47 3.23 1.127 0.164 Diagram 9 47 3.02 0.944 0.138 Diagram 10 47 3.87 0.924 0.135 Diagram 11 47 3.28 0.902 0.132

As described earlier, diagram 1 and diagram 8 are identical. If both diagrams score equally in their perceived understandability, we can conclude that the increase in semantic model familiarity throughout the questionnaire does not influence diagram understandability.

In table 4, it can easily be seen that the mean perceived diagram understandability and standard deviation of diagram 1 and diagram 8 are almost identical. It can therefore be concluded that the increasing familiarity of the various aspects of the diagrams (while answering the questionnaire) does not significantly influence perceived diagram understandability.

4.2 Multiple regression analysis As previous research showed significant correlation of the differed factors in the diagrams and diagram understandability [4, 6], we presume all independent variables correlate with diagram understandability.

Table 5: Multiple regression analysis coefficients

Coefficients

Model Unstandard Coeff.

Standard Coeff

.

B Std. Error

Beta t Sig.

Constant 1.081 .598 1.808 .071 Line Crossings 1.472 .448 .230 3.285 .001 Line Bends .600 .350 .173 1.715 .087

Node Orthogonality

-.946 .906 -.104 -1.044 .297

Edge Orthogonality

1.743 .263 .359 6.618 .000

Perceptual Discriminability

-.110 .166 -.045 -.660 .510

Semiotic Clarity -.619 .114 -.256 -5.444 .000

Using a multiple regression analysis, we gained insight into the impact of the independent variables on diagram understandability. Studies [16, 17] have shown there is little harm in treating Likert-type scales as continuous variables when five or more categories are used. Table 5 shows the results of running a multiple regression on the different variables with “diagram understandability” as the dependent variable.

From Table 5, we see that the predictor variables Line Bends, Node Orthogonality and Perceptual Discriminability are not statistically significant (p>0.01). As a result, the coefficients (Beta in Table 5) of these metrics provided by the multiple regression analysis can be expected to be unreliable. Therefore, we do not elaborate on these metrics. The Line Crossings metric (p=0.001) and the Edge Orthogonality metric (p=0.00) have a very significant and a reasonably strong impact (Beta=0.23 and -0.104 respectively) on diagram understandability. We find the impact of Edge Orthogonality (Beta = 0.359) on diagram understandability to be more than the impact of Line Crossings (Beta = 0.230). The impact of Semiotic Clarity (Beta=-0.256) on diagram understandability is very significant and is equal to the impact of Line Crossings. It seems, however, that Semiotic Clarity has a negative impact on diagram understandability, which contradicts the findings of Moody [6]. This may be caused by the familiarity of the test group with a colourless UML diagram, a colourful UML class diagram may conflict their expectations of a UML class diagram that might influence their perceived understandability.

5. CONCLUSIONS Our research question in the introduction of this paper was the following: Which of the perspectives (notation, aesthetics or structure) has a higher impact on understandability of a particular modelling notation for a particular stakeholder?

It can be concluded that for novice UML modellers, the two notation perspective aspects used in the questionnaire have on average less impact than the four aesthetic perspective aspects.

It cannot be concluded that the aesthetic perspective has an overall larger impact on perceived diagram understandability than notation perspective, as the notation and aesthetics perspective both include more aspects than the aspects included in the questionnaire. However, this study provides us with insight into the relative impact of two different perspectives on diagram understandability. In particular, we measure the comparative impact of two notation perspective aspects and four aesthetic perspective aspects on perceived diagram understandability. Assessing such a relative impact on understandability is important as most papers only focus on one aspect of diagram understandability. This is the primary contribution of this paper. The other contribution of the paper is that we have suggested metrics based on Moody’s [4] principles for diagram understandability (Table 2) in the notation perspective.

We need to emphasise that we focused on perspectives that a conceptual modeller can vary, given a modelling language and domain semantics. We propose that in contexts, conceptual modellers can influence different aesthetic perspectives. Conceptual modellers are however not in all occasions free to choose the modelling notation and the modelling tool they want. Therefore, the choice of modelling tools given a target modelling language would be an interesting area of research for future studies. The questionnaire we designed only compared the impact of the four aesthetic perspective aspects with the impact of the two notation perspective aspects. To be able to fully compare the impact of the notation perspective with the aesthetics perspective, more research is needed where the impact of all aesthetic perspective aspects and notation perspective aspects are taken into account.

6. REFERENCES [1] Amrit, C. Improving coordination in software development through social and technical network analysis. University of Twente, 2008. [2] Hoppenbrouwers, S. J. B. A., Stokkum, W., Iacob, M. E., Wilmont, I., Linden, D. J. T., Amrit, C. and Joosen, M. Stakeholder Communication. Springer Berlin Heidelberg, City, 2012. [3] Lindland, O. I., Sindre, G. and Solvberg, A. Understanding quality in conceptual modeling. Software, IEEE, 11, 2 1994), 42-49. [4] Purchase, H. C., Carrington, D. and Allder, J. A. Empirical evaluation of aesthetics-based graph layout. Empirical Software Engineering, 7, 3 2002), 233-255. [5] Genero, M., Poels, G. and Piattini, M. Defining and validating metrics for assessing the understandability of entity-relationship diagrams. Data & Knowledge Engineering, 64, 3 2008), 534-557. [6] Moody, D. L. The “physics” of notations: Toward a scientific basis for constructing visual notations in software engineering. IEEE Transactions on Software Engineering2009), 756-779. [7] Andriyevska, O., Dragan, N., Simoes, B. and Maletic, J. I. Evaluating UML class diagram layout based on architectural importance. IEEE, City, 2005. [8] Eichelberger, H. and Schmid, K. Guidelines on the aesthetic quality of UML class diagrams. Information and Software Technology, 51, 12 2009), 1686-1698.

[9] Winn, W. An account of how readers search for information in diagrams. Contemporary Educational Psychology1993). [10] Eichelberger, H. Nice class diagrams admit good design? ACM, City, 2003. [11] Purchase, H. C. Metrics for graph drawing aesthetics. Journal of Visual Languages & Computing, 13, 5 2002), 501-516. [12] Genero, M., Piattini, M. and Calero, C. A survey of metrics for UML class diagrams. Journal of Object Technology, 4, 9 2005), 59-92. [13] Britton, C. and Jones, S. The untrained eye: how languages for software specification support understanding in untrained users. Human–Computer Interaction, 14, 1-2 1999), 191-244. [14] Moody, D. and Van Hillegersberg, J. Evaluating the visual syntax of UML: An analysis of the cognitive effectiveness of the UML family of diagrams. Software Language Engineering2009), 16-34. [15] Group, O. OMG Unified Modeling Language (OMG UML), Superstructure, V2.1.2. City, 2007. [16] Johnson, D. R. and Creech, J. C. Ordinal measures in multiple indicator models: A simulation study of categorization error. American Sociological Review1983), 398-407. [17] Zumbo, B. D. and Zimmerman, D. W. Is the selection of statistical methods governed by level of measurement? Canadian Psychology/Psychologie canadienne, 34, 4 1993), 390.

Towards understanding the understandability of UML models

Documents

Transcript of Towards understanding the understandability of UML models