0% found this document useful (0 votes)
24 views13 pages

Duration and Cost Variability of Construction Activities: An Empirical Study

Duration and Cost Variability of Construction Activities

Uploaded by

engftah2006
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
24 views13 pages

Duration and Cost Variability of Construction Activities: An Empirical Study

Duration and Cost Variability of Construction Activities

Uploaded by

engftah2006
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

Duration and Cost Variability of Construction Activities:

An Empirical Study
P. Ballesteros-Pérez, Ph.D. 1; E. Sanz-Ablanedo, Ph.D. 2; R. Soetanto, Ph.D. 3;
Ma. C. González-Cruz, Ph.D. 4; G. D. Larsen, Ph.D. 5; and A. Cerezo-Narváez, Ph.D. 6
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

Abstract: The unique nature of construction projects can mean that construction activities often suffer from duration and cost variability.
Because this variability is unplanned, it can present a problem when attempting to complete a project on time and on budget. Various factors
causing this variability have been identified in the literature, but they predominantly refer to the nature and/or context of the whole project
rather than specific activities. In this paper, the order of magnitude of and correlation between activity duration and cost variability is analyzed
in 101 construction projects with over 5,000 activities. To do this, the first four moments (mean, standard deviation, skewness, and kurtosis) of
actual versus planned duration and cost (log) ratios are analyzed by project, phase of execution, and activity type. Results suggest that,
contrary to common wisdom, construction activities do not end late on average. Instead, the large variability in the activity duration is
the major factor causing significant project delays and cost overruns. The values of average activity duration and cost variability gathered
in this study will also serve as a reference for construction managers to improve future construction planning and project simulation studies
with more realistic data. DOI: 10.1061/(ASCE)CO.1943-7862.0001739. © 2019 American Society of Civil Engineers.
Author keywords: Scheduling; Activity variability; Merge event bias; Network topology; Project delays.

Introduction around. In fact, this would constitute a compelling reason why so


many construction projects end late and exceed their initial budget.
Construction activities usually suffer from variability in terms of Factors that cause projects to end late or result in cost overruns
both duration and cost. With each construction project being unique, have been studied in the construction literature for a long time.
factors behind this variability are plentiful (Ballesteros-Pérez et al. Some of the most recurrent are poor planning and control practices,
2017). These factors, which include project location, clients, regu- deficient construction site management, shortages of labor and/or
lations, labor, equipment, technology, subcontractors, experience, low productivity, problems with the supply chain and/or procure-
stakeholders, and even the project team, are likely to change, at least ment practices, contractor’s and/or client’s financial problems,
partially, among projects (Chudley and Greeno 2016). All these fac- project specifications or design changes, communication and/or
tors, plus many others, make the duration and cost estimation ex- coordination problems among stakeholders, interferences with on-
ercise a challenging task for construction managers. site services, adverse weather conditions, and legal disputes and
It may be easy to believe, however, that construction activities are contract claims (Ballesteros-pérez et al. 2015, 2018b). Among all
apparently more likely to end later and cost more than the other way these, however, poor planning and control practices are consistently
among the most pervasive (AlSehaimi and Koskela 2008).
1
Ramon y Cajal Senior Researcher, Dpto. de Ingeniería Mecánica y Ballesteros-Pérez et al. (2018a) recently showed how the most
Diseño Industrial, Universidad de Cádiz, Avda. Univ. de Cádiz 10, Puerto common scheduling techniques [Gantt chart, critical path method,
Real, Cádiz 11519, Spain (corresponding author). ORCID: https://orcid
and project evaluation and review technique (PERT)] consistently
.org/0000-0002-4629-9664. Email: [email protected]
2
Associate Professor, Dpto. Tecnología Minera, Topográfica y Estruc- underestimate the actual project duration and cost. One of the major
turas, Universidad de León, Avda. Astorga, s/n., Ponferrada, León 24400, causes of this underestimation comes specifically from neglecting
Spain. Email: [email protected] activity duration variability.
3
Senior Lecturer, School of Architecture, Building and Civil En- Apart from the classical scheduling techniques, more advanced
gineering, Loughborough Univ., Loughborough LE11 6DF, UK. Email: techniques for getting improved project duration and/or cost esti-
[email protected] mates have been proposed over the years [e.g., fuzzy logic, neural
4
Associate Professor, Grupo de Investigación en Diseño y Dirección de network analysis, Monte Carlo simulations, artificial intelligence
Proyectos, Departamento de Proyectos de Ingeniería, Universitat Politèc-
nica de València, Camino de Vera s/n, Valencia 46022, Spain. Email:
methods, many variants of PERT, and even more extensions of
[email protected] earned value management (Ballesteros-Pérez 2017a)]. What all
5
Associate Professor, School of the Built Environment, Univ. of Read- these methods have in common, classical and modern alike, is that
ing, Whiteknights, Reading RG6 6DF, UK. Email: g.d.larsen@reading they all require some prior estimates of the potential activity dura-
.ac.uk tions and costs. For example, PERT-related techniques generally
6
Assistant Professor, Dpto. de Ingeniería Mecánica y Diseño Industrial, resort to three-point estimates (pessimistic, optimistic, and most
Universidad de Cádiz, Avda. Univ. de Cádiz 10, Puerto Real, Cádiz 11519, likely durations and costs); Monte Carlo simulations require the
Spain. Email: [email protected]
statistical distributions of each activity as input; and neural network
Note. This manuscript was submitted on February 1, 2019; approved on
May 17, 2019; published online on November 9, 2019. Discussion period analysis and artificial intelligence methods require training sets
open until April 9, 2020; separate discussions must be submitted for of similar construction projects. Access to this information is often
individual papers. This paper is part of the Journal of Construction the major limitation of these methods. Similarly, realistic data on
Engineering and Management, © ASCE, ISSN 0733-9364. the correlation between activity duration and costs is also a rare

© ASCE 04019093-1 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


commodity, which forces these techniques to either assume inde- context (same region, client, or type of projects, or a combination
pendence between activities and costs or resort to subjective cor- of these).
relation factors (Banerjee and Paul 2008; Cho 2009). Consequently, Conversely, there have hardly been any studies measuring the
when a sufficient quantity or quality of information is not available, activity durations and costs, let alone their variability in real con-
the forecasting accuracy of the actual project duration and/or cost is struction projects. With the exception of the study by Trietsch et al.
expected to be unreliable. (2012) mentioned previously, perhaps the closest are a handful of
Unfortunately, despite its importance, there is a dearth of re- studies analyzing the sensitivity of the project duration to different
search into activity duration and cost variability in the construction levels of activity mean duration and dispersion (e.g., Elmaghraby
management literature. The only exception would perhaps be the et al. 1999; Elmaghraby 2000).
work of Trietsch et al. (2012), who attempted to establish a distri- Additionally, but from a purely mathematical and simulation per-
bution that satisfactorily describes construction activity durations. spective, some studies have tried to gauge to what extent the adopted
However, as suggested early on by MacCrimmon and Ryavec activity statistical distributions have a significant repercussion on
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

(1964), trying to find a universal distribution that fits all types of the final project duration. In this regard, a recent study by Hajdu
activities is a futile effort because each type of activity is unique. and Bokor (2014) concluded that the maximum project duration
Furthermore, its context might also have a significant influence that deviation when using alternative activity distributions was generally
is difficult, if not impossible, to parameterize mathematically. well below 10%. This finding resonated with observations from an
Nonetheless, these difficulties should not be a deterrent to at earlier study on the limitations of PERT. MacCrimmon and Ryavec
least attempting to measure the average level of variability of con- (1964) showed that, if triangular distributions for modeling activity
struction activity durations and costs. As argued, this would be an durations had been chosen instead of beta distributions, the prob-
extremely valuable input for future project duration and cost fore- abilistic project duration would have produced almost identical
casting techniques, as well as providing powerful baseline informa- results.
tion for enhancing project control and monitoring. The reason why the choice of a particular statistical distribution
Hence, the present paper precisely attempts to fill this research does not seem that relevant is because the third and fourth moments
gap in the construction management literature by measuring the (skewness and kurtosis) are blurred very quickly in stochastic net-
average level of activity duration and cost variability. It will also work analysis (SNA) (Hajdu and Bokor 2016), which is currently
justify how and why, given this level of activity variability in com- considered the most accurate approach to model project schedule
networks (Ballesteros-Pérez 2017b). In SNA, activity durations and
mon project networks, it is expected that most construction projects
costs are modeled by statistical distributions (with or without cor-
end late and go over budget. To achieve this, the actual/planned
relation with each other). More precisely, distributions are summed
(log) ratios of many project and activity durations and costs will
when computing the total costs of activities or the total duration of
be analyzed. The correlation between activity durations and costs
activities arrayed in series. On the other hand, the maximum of dis-
will also be studied. Finally, the most common network topologies
tributions (instead of a sum) is calculated whenever one calculates
(descriptors of what the project networks are like, that is, how ac-
the total duration of a set of activities placed in parallel. In either
tivities are arranged and connected with each other) will be sum-
case, the third and fourth moments (skewness and kurtosis) have a
marized, and the potential impact of activity variability on these
minor influence on the resulting distribution of a path or project
networks are described in detail. duration.
The paper will be structured as follows. The “Background” However, the first two moments (mean and variance, or alter-
section will provide an overview of the importance of the first four natively, standard deviation) play a major role in the resulting dis-
moments of the activity duration and costs impacting the final tribution modeling the total project duration. When there is some
project duration and cost. This section will introduce the concept correlation between durations and costs (virtually always in con-
of merge event bias and describe how it may cause project delays struction projects), they also have an indirect but still significant
and cost overruns depending of each project network topology. The influence on the final project cost.
“Materials and Methods” section will describe how a data set of To sum up, when two or more distributions are convoluted
101 projects was classified according to different activity catego- (summed for computing the project cost or the duration of activities
ries, and then their log actual versus planned durations and cost in series) the resulting distribution, by the central limit theorem,
deviations analyzed activity by activity. The “Discussion” section quickly converges to a normal distribution. The mean and variance
will provide insights into what the numerical results mean and how of this normal distribution correspond to the sum of means and var-
they are connected to the project network topology in common con- iances, respectively, of the individual activity distributions. There-
struction projects. Finally, the “Conclusions” section will summa- fore, the first two moments will mostly determine what the resulting
rize the whole analysis, highlight the major contributions to the distribution looks like.
body of knowledge, state the study limitations, and propose future When some activities are arranged in parallel and they all need
research continuations. to finish before the project can continue, the resulting distribution
quickly converges to an extreme value distribution of maxima (nor-
mally a Fréchet or a Gumbel distribution) (Dodin and Sirvanci
Background 1990). Again, the first two moments of the involved activity dis-
tributions will determine the location and scale of the resulting ex-
There have been numerous studies analyzing delays and cost over- treme value distribution. This phenomenon is commonly known as
runs in construction projects at project level (e.g., Hamzah et al. the merge event bias (Khamooshi and Cioffi 2013; Vanhoucke
2011; Keane and Caletka 2008; Mahamid et al. 2012; Ogunlana 2012), and it is indeed the major source of inaccuracy of all deter-
et al. 1996; Orangi et al. 2011; Senouci et al. 2016). Most studies ministic scheduling techniques.
have focused on either establishing the causes of delays and cost Real construction project schedules (networks) generally in-
overruns and/or proposing some regression analyses to avoid slip- volve many subsets of activities both arranged in parallel and in
pages in the future. Generally, these studies have been aligned with series. Hence, multiple convolutions (sums) and maxima of distri-
a more reductionist perspective, seeking to emphasize a particular butions need to be computed so that the final project duration and

© ASCE 04019093-2 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


cost can be calculated. The influence of each activity’s first two At the time of writing, the complete project data set is curated
moments (mean and standard deviation) will be key in this final by the Operations Research and Scheduling Research Group at
result. This justifies why an order of magnitude of these two mo- Ghent University (OR-AS.be 2018) and contains 125 projects,
ments is worth collecting from a representative data set of real con- of which 24 projects were not used because they did not include
struction activities. tracking information (actual activity durations and costs). All 125
Finally, another factor that determines how the activity distribu- projects, however, can be accessed at the website of OR-AS.be
tions are merged with each other is dependent on the project net- (2018). The major features of the 101 construction projects selected
work topology itself. Network topology refers to the logical layout for this study are summarized in Table 1. The last four columns of
of a network (a project schedule). It defines the way different ac- Table 1 include some project network topological information
tivities (often referred to as nodes) are placed and interconnected (indicators SP, AD, LA, and TF) that will be used later.
with each other. Many metrics have been proposed for describing The authors deem the variety and number of project types,
the network configuration. Some well-known examples are the co- costs, durations, topologies, and number of activities as sufficiently
efficient of network complexity (Davies 1973; Pascoe 1966), the representative for a first representative analysis. Yet, further details
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

order strength (Mastor 1970) and the complexity index (Bein et al. and specific project information can also be found as individual
1992). However, these only capture the project complexity and will project cards at OR-AS.be (2018).
not be used here.
Instead, this study will make use of four topology measures
that describe the structure of an activity-on-the-node network, not Analysis Outline
just its complexity. These measures were initially proposed by This analysis focuses first on the activity-level deviations of
Valadares Tavares et al. (1999) and later improved by Vanhoucke durations and costs. Project-level data will also be analyzed sub-
(2008). The four measures (also named indicators) used are: serial- sequently, but from a complementary point of view to activities
parallel (SP) indicator, activity distribution (AD), length of arcs analyses. The activity duration and cost deviations are calculated
(LA) indicator, and topological float (TF) which will be explained for each activity i in the first data set according to the following
in the following sections. All these indicators range between 0 two expressions, respectively:
and 1 and constitute simple measures describing to what extent the
first two moments of the construction activities may condition the Activity duration deviation of activity i
final duration and cost of a project.  
Actual duration of activity i
¼ LOG10 ð1Þ
Planned duration of activity i
Materials and Methods
Activity cost deviation of activity i
In this section, the characteristics of the projects and activity data  
sets analyzed are described. Details of how the activity and project Actual cost of activity i
¼ LOG10 ð2Þ
data were filtered and categorized under multiple levels of analysis Planned cost of activity i
are also presented. Next, the first four moments of activity durations
and costs are reported and commented upon separately. Finally, the It is worth emphasizing that both of these ratios are expressed in
correlations between activity durations and costs are reported along logarithmic scale. This is important because ratios of variables that
with their statistical significance. are always positive (e.g., durations and costs) are not symmetrical
respect to the value 1. The scale distortion of these ratios (they
range between 0 and 1 when the denominator is bigger than the
Projects and Activities Data Set
numerator, but between 1 and +infinity when the numerator is big-
This research used two different project data sets. The first (and ger than the denominator) creates an artificial positive skewness in
main) one is analyzed at both activity and project levels. The sec- the data distribution that can only be removed by taking the log
ond data set contains project-level information (planned and actual ratios beforehand. Additionally, in log scale, the variable variances
project durations and costs) and will be used for illustrative pur- are additive, rather than multiplicative.
poses in the “Discussion” section. Therefore, this study will take the logarithm of every ratio before
In order to obtain representative values of the first four moments analyzing their activity duration and cost moments. Logarithms with
of the activity durations and costs, a significant amount of activities base 10 were used because their orders of magnitude are a little more
is necessary. In the first data set, 101 construction projects are an- familiar, but any other base would have been possible.
alyzed initially encompassing 5,697 activities. All these activities Lastly, ratios in natural scale from 0 to 1 correspond to values
analysis can be found in the Supplemental Data. from -infinity to 0 in any log scale, whereas ratios in natural scale
Projects are classified in four types: building, civil engineer- from 1 to +infinity correspond to the (0; þ∞Þ range. Both ranges
ing, industrial, or services. Building projects are mostly aimed at also have a symmetrical correspondence with each other in log
constructing a building or parts of a building. Civil engineering re- scale (e.g., ratios 1=2 and 2 in natural scale have the same values
fers to infrastructure construction in general. Industrial projects re- with opposite signs in log scale, that is −0.301 and 0.301, respec-
fer to installations and/or electromechanical equipment. Services tively) which makes the interpretations of variability results easier.
refer to projects with a significant operational and/or production Bearing this in mind, the next step consists of describing how
component. the activities were grouped to analyze their ratios and produce
The 101-project data set was retrieved from a real projects data robust results. The progressive classification levels can be found
set originally developed by Batselier and Vanhoucke (2015) and in Tables 2–5.
Vanhoucke et al. (2016). Although the exact locations of those proj- In every table, three levels of activity classifications are pre-
ects were not disclosed in most cases (due to a confidentiality clause sented. Each level consists of three types of activities:
with the information donors), it is known that most of them occurred • Planned and performed (P&P): these activities correspond to ac-
in Belgium, the Netherlands, Italy, the US, and Azerbaijan. tivities that were initially planned and were also finally executed

© ASCE 04019093-3 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Table 1. First projects data set summary
Planned Actual
Planned Actual duration duration No. SP AD LA TF
Project ID Project name Project type cost (€) cost (€) (days) (days) activity (%) (%) (%) (%)
C2011-05 Telecom System Agnes Service 180,485.27 180,485.27 43 53 20 60 58 38 9
C2011-07 Patient transport system Service 180,759.44 191,065.06 389 444 49 70 70 7 8
C2011-10 Building a house Building 484,398.41 494,947.71 195 203 32 51 47 27 10
C2011-12 Claeys-Verhelst premises Building 3,027,133.19 3,102,395.91 443 453 49 41 50 5 43
C2011-13 Wind farm Civil engineering 21,369,835.51 26,077,764.74 525 600 107 27 36 0 48
C2012-13 Pumping Station Jabbeke Industrial 336,410.15 350,511.31 125 140 74 64 59 3 27
C2012-15 The Master Project Service 185,472.45 185,113.10 32 32 121 17 66 0 84
C2012-17 Building a Dream Building 241,015.00 314,856.14 145 204 33 65 61 35 19
C2013-01 Wiedauwkaai Fenders Civil engineering 1,069,532.42 1,314,584.58 152 152 39 48 45 0 68
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

C2013-02 Sewage plant Hove Civil engineering 1,236,603.66 1,146,444.38 403 408 175 12 38 0 62
C2013-03 Brussels Finance Tower Building 15,440,865.89 16,338,027.20 425 426 55 3 82 0 87
C2013-04 Kitchen Tower Anderlecht Building 2,113,684.00 2,512,524.00 333 453 244 47 59 0 63
C2013-05 PET packaging Service 874,554.28 874,554.28 521 632 28 14 69 0 80
C2013-06 Government office building Building 19,429,810.51 21,546,846.18 352 344 275 10 36 0 34
C2013-07 Family residence Building 180,476.47 175,030.65 170 174 46 40 44 3 25
C2013-08 Timber house Building 501,029.51 576,624.05 216 235 41 29 42 0 47
C2013-09 Urban development project Civil engineering 1,537,398.51 1,696,971.79 291 360 71 34 51 6 16
C2013-10 Town square Civil engineering 11,421,890.36 15,218,926.38 786 785 186 18 36 0 62
C2013-11 Recreation complex Building 5,480,518.91 5,451,028.00 359 277 159 27 44 0 32
C2013-12 Young cattle barn Building 818,439.99 879,853.17 115 188 27 64 77 6 54
C2013-13 Office finished works (1) Building 1,118,496.59 955,929.22 236 217 11 20 49 33 6
C2013-14 Office finished works (2) Building 85,847.89 75,468.30 80 88 9 62 80 66 47
C2013-15 Office finished works (3) Building 341,468.11 308,343.78 171 115 17 25 43 21 35
C2013-16 Office finished works (4) Building 248,203.92 198,567.00 196 108 7 33 62 0 75
C2013-17 Office finished works (5) Building 244,205.40 203,605.97 161 107 23 36 38 20 32
C2014-01 Mixed-use building Building 38,697,822.73 39,777,643.30 474 448 41 50 38 3 49
C2014-02 Playing cards Industrial 191,492.70 190,266.50 124 146 21 81 94 0 14
C2014-03 Organizational development Service 43,170.15 83,712.15 229 260 112 9 31 0 36
C2014-04 Compression Station Zelzate Industrial 62,385,597.58 65,526,930.04 522 844 24 95 100 0 100
C2014-05 Apartment building (1) Building 532,410.29 591,410.53 228 274 25 58 71 35 18
C2014-06 Apartment building (2) Building 3,486,375.47 3,599,114.11 547 611 29 57 75 46 15
C2014-07 Apartment building (3) Building 1,102,536.78 1,289,696.78 353 404 25 58 71 35 18
C2014-08 Apartment building (4) Building 1,992,222.09 2,380,299.86 233 275 39 44 29 11 14
C2015-01 Young cattle barn (2) Building 612,769.44 646,473.65 131 210 27 57 73 0 46
C2015-02 Railway station (1) Civil engineering 1,121,316.94 967,988.79 417 501 216 8 66 1 80
C2015-03 Industrial complex (1) Building 2,244,090.74 1,868,796.28 257 278 135 16 43 0 58
C2015-04 Apartment building (5) Building 2,750,938.00 2,590,796.73 160 205 56 27 37 0 57
C2015-06 Family residence (2) Building 143,673.20 186,107.00 260 290 184 18 0 30 38
C2015-07 Industrial complex (2) Building 5,999,600.00 5,414,544.00 297 313 138 27 38 0 49
C2015-08 Garden center Building 467,297.21 461,900.17 191 186 186 14 52 0 79
C2015-09 Railway station (2) Civil engineering 1,457,424.00 2,145,682.26 354 569 340 4 48 0 75
C2015-10 Tax return system (1) Service 18,990.00 8,010.00 85 85 15 10 82 23 21
C2015-11 Staff authorized system Service 14,400.00 9,105.00 55 55 7 25 66 0 52
C2015-12 Premium payment system Service 132,570.00 58,410.00 184 184 35 19 63 9 61
C2015-13 Broker Access Convection System Service 12,735.00 9,990.00 117 117 16 19 60 7 51
C2015-14 Superior Pensions Database Service 34,260.00 18,285.00 124 124 17 17 55 3 50
C2015-15 FACTA System Service 11,700.00 7,035.00 57 57 13 22 57 8 18
C2015-16 Generic document output system Service 64,620.00 64,125.00 270 270 22 10 61 12 26
C2015-17 Insurance bundling system Service 281,430.00 281,070.00 208 236 86 6 77 8 41
C2015-18 Tax return system (2) Service 39,450.00 25,380.00 128 128 15 10 66 16 11
C2015-19 Receipt number system Service 43,800.00 37,530.00 182 182 20 21 46 8 31
C2015-20 Policy numbering system Service 12,645.00 11,100.00 171 161 6 20 62 20 13
C2015-21 Investment product (1) Service 4,020.00 3,240.00 37 37 12 18 35 2 36
C2015-22 Risk profile questionnaire Service 29,880.00 17,400.00 151 151 22 16 70 9 40
C2015-23 Investment product (2) Industrial 46,920.00 32,805.00 122 120 33 17 53 5 39
C2015-24 CRM system Service 44,130.00 36,870.00 233 233 21 7 59 7 29
C2015-25 Beer tasting Service 1,210.00 1,780.00 14 14 18 16 40 21 19
C2015-26 Debt collection system Service 458,112.37 512,546.15 148 154 214 9 43 0 61
C2015-27 Railway station Antwerp Building 22,703.52 25,313.12 68 81 18 23 40 −2 54
C2015-28 Website tennis Vlaanderen Service 219,275.00 382,475.00 201 212 20 15 54 0 67
C2015-29 Fire station Building 1,874,496.82 1,887,087.25 284 298 204 48 34 0 41
C2015-30 Social apartments Ypres (1) Building 440,940.89 440,940.89 244 254 40 25 51 −1 76
C2015-31 Social apts Ypres (2) Building 1,310,723.46 1,282,185.98 271 364 29 32 49 23 43
C2015-32 Social apts Ypres (3) Building 2,509,031.42 2,509,031.42 358 265 48 38 63 3 59
C2015-33 IJzertoren Memorial Square Civil engineering 214,417.71 224,789.67 50 94 12 63 57 0 14

© ASCE 04019093-4 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Table 1. (Continued.)
Planned Actual
Planned Actual duration duration No. SP AD LA TF
Project ID Project name Project type cost (€) cost (€) (days) (days) activity (%) (%) (%) (%)
C2015-34 Roadworks Poperinge Civil engineering 511,325.86 440,394.16 120 193 13 91 99 0 18
C2015-35 Retirement apartments Building 14,956,314.25 16,068,878.30 850 951 11 48 57 21 35
C2016-01 Railway bridge (1) Civil engineering 671,383.50 703,703.50 225 274 26 51 71 0 86
C2016-02 Railway bridge (2) Civil engineering 962,181.56 972,341.56 229 239 23 63 71 0 82
C2016-03 Railway bridge (3) Civil engineering 926,888.01 910,728.01 203 220 25 16 37 0 56
C2016-04 Railway bridge (4) Civil engineering 906,253.87 906,253.87 248 242 26 64 62 0 71
C2016-05 Railway bridge (5) Civil engineering 832,497.46 832,497.46 195 197 32 77 74 0 51
C2016-06 Defense building Service 4,331,260.49 4,331,260.49 252 232 96 14 55 0 76
C2016-07 Shopping village walkways Civil engineering 930,179.09 932,757.25 224 316 110 95 98 0 99
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

C2016-08 SCM system Service 375,253.34 438,741.66 725 725 99 49 59 8 52


C2016-09 Data loss preventive system Service 584,951.77 1,425,155.96 195 189 113 10 36 1 51
C2016-10 Biofuel refinery Industrial 14,362,625.00 14,466,100.00 360 375 23 18 22 6 21
C2016-11 Residential house (1) Building 162,472.00 163,189.00 241 254 55 57 77 52 16
C2016-12 Residential house (2) Building 222,858.00 226,285.00 291 291 59 56 72 50 19
C2016-13 Residential house (3) Building 367,952.00 379,300.00 306 330 51 64 81 54 14
C2016-14 Residential house (4) Building 218,366.00 222,021.78 321 320 48 68 78 42 10
C2016-15 Residential house structure work Building 95,694.00 100,763.00 126 130 13 66 75 100 0
C2016-16 Residential finishings works (1) Building 54,577.76 64,526.76 90 90 24 69 68 50 28
C2016-17 Residential finishings works (2) Building 54,703.17 64,580.17 86 86 24 69 68 50 28
C2016-18 Residential finishings works (3) Building 51,115.52 60,829.52 91 91 25 66 62 27 31
C2016-19 Residential finishings works (4) Building 51,303.38 53,351.38 91 91 25 66 62 27 31
C2016-20 Residential finishings works (5) Building 52,021.28 53,783.28 91 91 25 66 62 27 31
C2016-21 Residential finishings works (6) Building 54,324.22 54,996.22 101 101 24 69 68 50 28
C2016-22 Residential finishings works (7) Building 56,969.40 57,822.40 101 101 24 69 68 50 28
C2016-23 Residential finishings works (8) Building 56,182.71 56,645.71 101 101 24 69 68 50 28
C2016-24 Residential finishings works (9) Building 52,262.83 53,176.83 101 101 24 69 68 50 28
C2016-25 Residential finishings works (10) Building 54,580.33 56,748.33 91 91 24 69 68 50 28
C2016-26 Residential finishings works (11) Building 51,286.24 53,319.24 91 91 24 69 68 50 28
C2016-27 Apartments building foundation (1) Building 813,663.06 879,701.06 78 88 16 66 59 0 48
C2016-28 Apartments structure work (1) Building 569,177.85 586,086.85 71 79 19 55 29 0 30
C2016-29 Apartments structure work (2) Building 1,797,873.62 1,860,330.62 129 148 19 72 69 0 35
C2016-30 Apartments structure work (3) Building 1,319,736.29 1,353,361.29 85 96 23 81 83 0 31
C2016-31 Apartments structure work (1) Building 488,936.00 498,473.00 105 117 23 31 40 0 11
C2016-32 Apartments structure work (2) Building 477,381.00 496,991.00 89 97 22 52 72 0 27
C2016-33 Apartments structure work (3) Building 377,282.00 394,829.00 116 129 23 50 72 0 30
C2016-34 Apartments structure work (4) Building 362,476.00 383,871.00 83 92 23 40 43 0 26
Average — — 2,647,861.81 2,837,446.83 221 240 Σ ¼ 5,697 41.0 58.2 14.1 40.5
Note: PET = polyethylene terephthalate; CRM = customer relationship management; and SCM = supply chain management.

Table 2. Total number of activities analyzed: Level 0 (all activities from all necessary and had to be eventually carried out. These activities
projects) were removed from the analysis because their ratios converged
Planned and Unplanned but Planned but not to + infinity (because the planned values in the denominators
performed performed performed equal 0), and because most of the time they come from planning
mistakes or omissions.
5,289 279 129
• Planned but not performed (PBNP). These activities corre-
spond to activities that were initially planned, but that were not
in the projects analyzed. These are the most frequent and the executed in the end. These activity ratios would equal zero in nat-
only ones that are considered in the analysis. ural scale but their logarithmic values would converge to − infinity.
• Unplanned but performed (UBP): these activities correspond to They also represent bad estimates of the planned schedule like
activities that were not initially planned but that were deemed UBP activities; hence, they were also removed from the analysis.

Table 3. Number of activities analyzed: Level 1 (activities classified by project type)


Actual duration
No. activities Actual cost (103 €) (days)
Planned and Unplanned but Planned but not
Project type N performed performed performed Average SD Average SD
Building 56 2,894 18 12 48.88 267.20 11.35 29.78
Civil engineering 15 1,092 250 59 40.43 161.26 12.92 15.92
Industrial 5 170 0 5 473.92 1,225.85 21.60 48.60
Services 25 1,133 11 53 8.03 31.15 11.13 31.48
Sum 101 5,289 279 129 — — — —

© ASCE 04019093-5 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Table 4. Number of activities analyzed: Level 2 (activities classified by project phase)
Project type
Building Civil engineering Industrial Service
Planned and Unplanned but Planned and Unplanned but Planned and Unplanned but Planned and Unplanned but
Project phase performed performed performed performed performed performed performed performed
Planning 49 0 38 0 10 0 81 0
Execution 2,810 18 1,034 250 154 0 990 11
Closure 35 0 20 0 6 0 62 0
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

Table 5. Number of activities analyzed: Level 3 (activities classified by activity type)


Project type
Building Civil Engineering
Planned and Unplanned but Planned but not Planned and Unplanned but Planned but not
Activity type performed performed performed performed Performed performed
Auxiliary works 139 1 0 207 27 9
Substructure 171 2 0 229 11 4
Superstructure 654 1 0 257 104 20
Specialized works 1,272 11 10 264 88 25
Facilities 574 3 2 77 20 1

Concerning activity grouping, four levels of analysis (0–3) were It also reflects that execution activities belonging to services pro-
considered: jects, despite being higher in number, were found to be too hetero-
• Level 0 (Table 2) contains all activities analyzed from all pro- geneous. The latter made it hard to classify these activities within
jects. This allows drawing general average conclusions without similar self-contained categories (services projects are indeed much
paying attention to proportions nor types of those activities. more varied regarding the nature of their activities).
• Level 1 (Table 3). Here, activities are classified under the same
four types of projects stated in Table 1 (building, civil engineer-
Activity Duration Results
ing, industrial, and services). As expected, this level allows ana-
lyzing how the activity durations and costs deviations differ by The first four moments (average, standard deviation, skewness, and
(generic) types of projects. Some group average and dispersion kurtosis) of the activities log ratios were analyzed according to the
results of activity durations and costs are also included for four levels described in Tables 2–5. Table 6 presents the results for
reference on the right columns. the activity duration log ratios [log10 (actual/planned)].
• Level 2 (Table 4). Within the previous four project type For each case and level analyzed, four numerical values are
categories, activities are further classified into three standard displayed: n (sample size, that is, the number of activities used to
phases of the every project life cycle according to the pro- calculate the four moments), and the four moment values (in log-
ject management body of knowledge (PMBoK): planning, arithmic scale). However, due to the major relevance of the first two
execution, and closure (Project Management Institute 2017). moments (average and standard deviation), these two have also
Classifying activities into these three categories is straightfor- been included in natural scale within parentheses right below their
ward with the activity descriptions available in almost all pro- respective logarithmic values. Values in natural scale are expected
jects. The fourth phase considered by the PMBoK (monitoring to help the reader to better grasp the order of magnitude of these
and control) is not relevant for this analysis and therefore not moments. With this information, Table 6 is self-explanatory.
considered. The number of readings and details in this table are numerous,
• Level 3 (Table 5). For the execution phase of building and civil so attention is given here to the most relevant findings.
engineering projects, only activities are further classified into Concerning averages, it is striking to observe how most values
five generic groups, called here activity types (auxiliary works, remain very close to 0 (in log values) or 1 (in natural values). Some
substructure, superstructure, specialized works, and facilities). exceptions may be services projects and the planning-phase activ-
These are also common and relatively straightforward groups ities (Level 2) from building and civil engineering projects. Yet, in
of activities in most construction projects. A more detailed de- the latter, average ratios values remain close to 5% (in log values)
scription of the scope of each group the reader has been given by or 11% (in natural values). Overall, as these log ratios are so close
Chudley and Greeno (2016). to zero; this suggests that construction activities do not end late (on
Level 3 allowed classifying activities into one last level right above average). This may be an unexpected finding because the easier
the nature of the activity itself. Activities in this level were classi- explanation for projects ending late was that its activities ended late
fied mostly thanks to the descriptions of the project summary ac- on average. This result seems to suggest the problem lies some-
tivities (that were indeed not used for anything else in the analysis). where else.
Finally, as highlighted at the beginning of Level 3’s description, Concerning the standard deviation (SD) values, results are
only activities from the execution phase of building and civil very different. SD, by definition, can only be positive, but it is
engineering projects were used. This is due to the number of ex- quite clear that, unlike the averages, SDs are not close to zero. In-
ecution activities in industrial projects being considered too low. stead, with a few exceptions, SD values are almost always above

© ASCE 04019093-6 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

Table 6. Activity actual/planned duration log ratios (natural values in parentheses)


Level 0 (all activities) Level 1 (by project type) Level 2 (by project phase) Level 3 (by activity type)

© ASCE
n Average SD Skewness Kurtosis Type n Average SD Skewness Kurtosis Phase n Average SD Skewness Kurtosis Type n Average SD Skewness Kurtosis
5,289 0.010 0.19 0.91 9.90 Building 2,894 0.004 0.15 −0.36 9.88 Planning 49 0.035 0.21 1.51 8.13 Insufficient data sample
(1.023) (1.56) (1.009) (1.43) (1.083) (1.62)
Execution 2,810 0.003 0.15 −0.46 9.92 Auxiliary 139 0.017 0.16 0.36 5.54
(1.007) (1.42) works (1.040) (1.46)
Substructure171 0.035 0.14 1.89 8.38
(1.083) (1.39)
Superstructure 654 −0.018 0.16 −0.98 9.60
(0.960) (1.45)
Specialized 1,272 0.004 0.15 −0.87 10.02
works (1.010) (1.42)
Facilities 574 0.011 0.14 0.53 11.71
(1.026) (1.39)
Closure 35 0.022 0.16 1.33 3.91 Insufficient data sample
(1.052) (1.45)
Civil 1,092 −0.008 0.20 0.53 9.61 Planning 38 0.052 0.18 2.38 12.88 Insufficient data sample
Engineering (0.982) (1.58) (1.126) (1.53)
Execution 1,034 −0.010 0.20 0.49 9.36 Auxiliary 207 0.013 0.13 1.75 9.43
(0.977) (1.58) works (1.030) (1.36)
Substructure 229 −0.005 0.18 −0.39 8.57
(0.990) (1.53)
Superstructure 257 −0.030 0.20 0.90 10.73
(0.934) (1.57)
Specialized 264 −0.012 0.24 0.28 5.81

04019093-7
works (0.972) (1.72)
Facilities 77 −0.018 0.26 1.09 10.65
(0.959) (1.82)
Closure 20 −0.011 0.05 −4.47 20.00 Insufficient data sample
(0.975) (1.12)

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Industrial 170 −0.010 0.22 −0.76 3.37 Planning 10 0.001 0.05 0.43 4.59 Insufficient data sample
(0.977) (1.65) (1.003) (1.12)
Execution 154 −0.009 0.23 −0.76 3.17 Insufficient data sample
(0.981) (1.68)
Closure 6 −0.090 0.25 0.04 0.81 Insufficient data sample
(0.813) (1.77)
Services 1,133 0.045 0.26 1.53 5.87 Planning 81 0.055 0.22 1.05 3.55 Insufficient data sample
(1.110) (1.83) (1.134) (1.67)
Execution 990 0.048 0.27 1.58 5.70 Insufficient data sample
(1.118) (1.86)
Closure 62 −0.014 0.19 −1.36 7.74 Insufficient data sample
(0.969) (1.54)

J. Constr. Eng. Manage.


0.15 (in log scale) or 43% (in natural scale) between the actual and Activity Duration and Cost Correlations
planned durations. This is an extremely high level of variability
Numerical results of the log ratios of the first four moments offered
and, despite construction activities do not end late on average, they
very interesting information about the nature of duration extensions
do suffer from wide dispersions that condition to a big extent the
and cost overruns at activity level. However, it is not the intention
project-level delays, which will be justified later. On a secondary
of this study to find a distribution that fits these four moments. As
note, industrial and services projects also have a bigger variability
than the other types of projects. Interpretations by project phase suggested by other researchers and also discussed previously, each
(Level 2) and activity type (Level 3) are more varied. activity is different in nature and it is quite likely that a fit-for-all
The results for skewness are relatively uniform. A common rule distribution does not exist. Indeed, on observing the wide range of
of thumb assumes that skewness values ranging from −2 to þ2 are skewness and kurtosis values in Tables 6 and 7, that seems to be
indicative of a low distribution asymmetry (George and Mallery exactly the case.
2010). This is the case in Table 6 with very few exceptions. There- However, a pending but also equally relevant issue is to analyze
fore, the log ratios distribution must be approximately symmetrical the potential correlation between activity duration variation and
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

and, combined with averages also close to zero, one can conclude cost variation. For this aim, all activities were grouped under the
that there is approximately the same probability of finding early very same levels previously described, and linear correlations were
activities than tardy activities. calculated among the duration log ratios and the cost log ratios. A
Concerning kurtosis, the picture is very different. Values are summary of this analysis is presented in Table 8. Spearman’s rho
generally well above 3, which would describe the kurtosis corre- and Kendall’s tau nonlinear (rank) correlations were also tested.
sponding to the normal distribution. This result means that log ratio However, they only very marginally improved the linear correlation
duration values resemble a peaked distribution with heavy tails. In results and were considered not worth including because they did
other words, the majority of the actual durations are not close to not seem to barely depart from the linear case given in Table 8.
their planned values. As stated previously, many other readings Table 8 is divided in two major blocks. The upper block is de-
may be extracted from Table 6. However, for the sake of clarity, voted to activity-level correlations. The lower block is reserved for
only the most relevant high-level interpretations are presented. project-level correlations. For each correlation, it has been specified
how many data points were used (column labeled n), Pearson’s cor-
relation coefficient (R), and coefficient of determination (R2 ), along
Activity Cost Results
with the gradient (slope column) and intercept of the linear regres-
Table 7 represents the first four moments of the activity actual sion lines. Statistically highly significant correlations have been
versus planned cost log ratios. Parentheses contain the antilogarith- identified separately for R2 tests (with the Snedecor’s F distribu-
mic (natural scale) values of the first two moments as well. Table 7 tion) and slope tests (with the Student’s T distribution). Significant
values differ substantially from those found in Table 6. statistical correlations have also been indicated.
Concerning average values, most of them are clearly positive In the case of activity-level correlations, almost all correlation
and generally above 1.01 (in log values) or alternatively above 3% values are significant. This mean that values of R2 are very unlikely
(in natural scale). A clear exception may be industrial projects, to have happened by chance. This is not the case at project-level
whose average is negative. This may be because industrial projects correlations, where, apart from the Level 0 of analysis (all 101 proj-
are frequently composed of electromechanical equipment whose ects grouped together), R2 values have not been found to be statisti-
procurement prices are relatively easier to estimate more accurately cally significant. This means one cannot count on the reliability
ex ante than other types of projects. Additionally, civil engineering of project-level duration-cost correlations, and hence they will be
and services projects are among the ones whose activities tend to ignored moving forward.
suffer from more cost overruns. This may be due to civil engineer- Correlations at the activity level do offer very interesting results.
ing projects being (generally) less standard than buildings, whose R and R2 evidence weak to moderate correlations (R2 ranging be-
average log ratios remain closer to 0. On the other hand, services tween 0.10 and 0.62), but the slopes of such correlations are rather
projects, as indicated in Table 6, suffered from more delays on aver-
close to 0.50 in some levels and almost all of them are significant.
age than other types of projects. Because these types of projects
More precisely, when there is no differentiation among activities
frequently more labor-intensive, it seems logical that those extra
(Level 0), the slope is as high as 0.704. This means that a 100%
durations are correlated with these extra costs.
activity duration extension (in log scale) would cause a 70.4% cost
Concerning SD, variability is even more evident than in the case
of duration log ratios. On Level 0, one can appreciate how the aver- increment on that activity. This is quite a high gradient.
age activity SD reaches 0.25 (78% of variability in natural scale). Differentiating by project type (Level 1), the slopes become
On Level 1, no project type has a variability below 0.16 (46% of more informative. Building and civil engineering projects boast
variability in the case of building projects), and two of them (civil a gradient close to 0.5, that is, every 100% of duration increment
engineering and services) remain above 0.30 (>100% of variabil- is likely to cause a 50% of cost increment for that activity. The other
ity). SDs on Levels 2 and 3 offer similar readings but with wider two types of projects have no statistically significant slopes; how-
values. ever, it seems clear that industrial projects (probably due to the
Concerning skewness, cost log ratios are more varied than their higher component of electromechanical equipment in the project
duration counterparts. In general, when average values are negative, budget) have lower slopes. On the contrary, services projects, being
the skewness values are also predominantly negative. Similarly, more labor-intensive, have higher slopes.
when the average costs are positive, the cost distribution is also Results by project phase (Level 2) seem more homogeneous.
positively skewed. However, only the execution activities’ slope is statistically signifi-
Concerning kurtosis, values are much higher than its duration cant. This level of correlation seems to replicate the results previ-
ratios counterpart too. This would be indicative again that most ac- ously provided for Level 0.
tivity actual costs substantially differ from their planned values Results at Level 3 are again not that heterogeneous, and they all
(a high proportion of the actual costs tend to be substantially differ- are statistically significant. However, there is nothing remarkable
ent from their planned costs). that has not been highlighted previously.

© ASCE 04019093-8 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

Table 7. Activity actual/planned costs log ratios (natural values in parentheses)


Level 0 (all activities) Level 1 (by project type) Level 2 (by project phase) Level 3 (by activity type)

© ASCE
n Average SD Skewness Kurtosis Type n Average SD Skewness Kurtosis Phase n Average SD Skewness Kurtosis Type n Average SD Skewness Kurtosis
5289 0.031 0.25 2.49 15.56 Building 2894 0.015 0.16 2.02 25.27 Planning 49 −0.002 0.19 −1.66 10.45 Insufficient data sample
(1.074) (1.78) (1.035) (1.46) (0.996) (1.56)
Execution 2810 0.015 0.16 2.12 25.91 Auxiliary works 139 0.027 0.11 2.73 14.82
(1.035) (1.46) (1.065) (1.29)
Substructure 171 0.014 0.10 −0.21 8.37
(1.034) (1.26)
Superstructure 654 0.010 0.10 0.01 10.14
(1.023) (1.26)
Specialist 1272 0.014 0.21 2.12 18.98
works (1.034) (1.62)
Facilities 574 0.020 0.12 0.53 13.85
(1.046) (1.33)
Closure 35 0.041 0.16 2.01 6.60 Insufficient data sample
(1.098) (1.43)
Civil engineering 1092 0.057 0.30 1.78 6.28 Planning 38 0.322 0.43 0.99 −0.75 Insufficient data sample
(1.139) (2.01) (2.099) (2.71)
Execution 1034 0.048 0.30 1.77 6.87 Auxiliary works 207 0.059 0.32 2.89 11.31
(1.116) (1.98) (1.147) (2.08)
Substructure 229 0.057 0.30 0.63 2.26
(1.140) (1.98)
Superstructure 257 0.057 0.32 1.40 3.11
(1.141) (2.11)
Specialist 264 0.016 0.24 1.93 13.70

04019093-9
works (1.038) (1.74)
Facilities 77 0.067 0.31 2.03 7.66
(1.166) (2.04)
Closure 20 0.011 0.01 −0.95 −1.24 Insufficient data sample
(1.026) (1.02)

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Industrial 170 −0.011 0.20 −2.05 11.12 Planning 10 0.02 0.05 0.74 0.71 Insufficient data sample
(0.975) (1.59) (1.046) (1.03)
Execution 154 −0.004 0.18 −1.65 12.65 Insufficient data sample
(0.99) (1.50)
Closure 6 −0.27 0.67 −0.12 −2.71 Insufficient data sample
(0.536) (4.63)
Services 1133 0.052 0.36 2.25 9.01 Planning 81 0.021 0.18 0.40 3.89 Insufficient data sample
(1.128) (2.29) (1.05) (1.53)
Execution 990 0.059 0.37 2.23 8.27 Insufficient data sample
(1.145) (2.37)
Closure 62 −0.007 0.29 1.28 13.16 Insufficient data sample
(0.985) (1.93)

J. Constr. Eng. Manage.


Table 8. Duration versus cost (log ratios) linear correlations to increase around 5% by default. These values are in line with the
Group of analysis n R R2 Slope Intercept log ratio cost averages found in Table 7.
Activity level (duration-cost correlations)
Level 0 All activities 5,289 0.55 0.30a 0.704a 0.024
Level 1 Building 2,894 0.46 0.21a 0.488a 0.013 Discussion
Civil engineering 1,092 0.33 0.11a 0.502a 0.061
Industrial 170 0.11 0.01 0.106 −0.010 So far, almost all analyses have focused on individual activities.
Services 1,133 0.79 0.62a 1.074 0.004 Yet, it is acknowledged that the construction process is not an exact
Level 2 Planning 178 0.38 0.15a 0.534 0.055 science and construction managers are often judged upon their
Execution 4,988 0.55 0.30a 0.706a 0.024 capability to manage activity variability. Hence, the key concern is
Closure 123 0.60 0.36a 0.534 0.055 the whole project suffering from delays and cost overruns, not just
Level3 Auxiliary works 349 0.34 0.12a 0.601b 0.037 some of its activities. It was proposed previously that this is because
Substructure 400 0.25 0.06a 0.343b 0.035
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

activities suffer from variability (both positive and negative), not


Superstructure 912 0.32 0.10a 0.289a −0.028
Specialized works 1,609 0.44 0.20a 0.566b 0.013
because they are delayed on average. This section is devoted to
Facilities 654 0.53 0.28a 0.522a 0.021 analyzing whether this speculation seems acceptable.
The problem will be approached from a graphical perspective
Project level (duration-cost correlations) first. For that purpose, a second data set of 746 road construction
— All projects 101 0.22 0.05b 0.156b 0.029 projects from the Florida Department of Transportation is used.
— Building 56 0.52 0.27 0.957 0.006
— Civil engineering 15 0.01 0.00 0.017 0.080
Given the number of contracts, no descriptive table is included
— Industrial 5 0.56 0.31 0.629 0.083 here, but the complete data set can be found as Supplemental Data.
— Services 25 0.23 0.05 0.039 0.014 This additional project data set has been used here because these
— Road projects (Fig. 1) 746 0.34 0.11 0.108 0.016 data represent relatively similar (homogeneous) contracts, from the
same client, and during a short period of time. Arguably, this is the
a
Snedecor’s F test (for R2 ) or Student’s T-test (for slopes) significant at
α < 0.001. closest to assuming that these projects are 746 different realizations
b
Snedecor’s F test (for R2 ) or student’s T-test (for slopes) significant at (possible outcomes) of the same generic type of project (in this case
α < 0.05. a road construction, that is, a civil engineering project).
Fig. 1 presents the distributions of the log deviation ratios for
durations and costs for the 746 contracts [using Eq. (1) and (2) at
Concerning the regression line intercepts (last column in project level, not activity level].
Table 8), as can be seen, these values remain above 0.02 (in log Concerning project duration deviations (curve with black
scale) most of the time. That is approximately equivalent to an in- open circles), it closely resembles an extreme value distribution of
tercept of 5% in natural scale, which means that, no matter whether maxima (both Fréchet and Gumbel fits have been provided for com-
activity duration extensions are materialized or not, costs are likely parison). This means that the merge event bias takes an important

LOG10 (Act./Plan. cost) Exponential fit Gamma fit


LOG10 (Act./Plan. Dur.) Frechet fit Gumbel fit

Fig. 1. Duration and cost overrun probability distribution of 746 road construction projects from the Florida Department of Transportation.

© ASCE 04019093-10 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


role when determining the actual project duration. Results in natural of critical and noncritical activities in a network (Vanhoucke and
scale are, in this occasion, almost identical but they have not been Vandevoorde 2009). Therefore, rounded up values of the inverse of
provided to avoid curve cluttering. the SP (that is d1=SPe) provide an estimate of the minimum number
Furthermore, the average project duration extension is around of paths of a project schedule. Values of SP below 50% would mean
0.21 (in log scale). For civil engineering projects in Table 6, the that construction schedules have (approximately) at least three
average of the duration log ratio was negative (−0.008). This means paths. This agrees with the black, rather than gray, curve of Fig. 1.
the activities from civil engineering projects ended sooner than Industrial projects, despite having on average at least two paths, gen-
planned (on average). It is unlikely then, that the projects repre- erally have a dominant one (which condenses, on average, 55% of the
sented in Fig. 1 could have ended later because a significant pro- activities). In service project schedules, there are at least five paths (on
portion of their activities ended late. However, the activity duration average) and only 20% (a fifth) of the activities are critical.
variability (the standard deviation) was 0.20. In extreme value The AD measures the distribution of project activities along the
theory, the mean of the highest order statistic distribution of a levels of the project. In network topology, the number of project
normal distribution with three or four draws is approximately one
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

levels can be loosely defined as the number of activities that are


standard deviation. The normal distribution represents very well arrayed in parallel in a project schedule. Hence, AD measures the
the distribution of the durations of each path (before they merge) width of the network. However, activities arrayed in parallel do not
(Ballesteros-Pérez 2017a). Therefore, the average of the duration necessarily have be executed simultaneously (because they may have
distribution coincides very closely with what is to be expected from different time lags and/or activity durations). When AD ¼ 0, all levels
the data from Table 6 for civil engineering projects (0.21 ≈ 0.20). contain a similar number of activities and the number of activities is
Subsequently, it will be shown how more than three paths are quite uniformly distributed over all levels. When AD ¼ 100%, there is one
common in civil engineering construction schedules. level with a maximal number of activities, and all other levels contain a
Concerning project costs (curve with gray crosses), the situation single activity. All four types of projects average AD values are close
is very different. The distribution of costs (log scale) resembles a to 58%, indicating that the longest path has more activities than other
gamma distribution. When a random variable X follows a Pareto paths, but still those other paths contain a significant number of ac-
distribution with parameter λ, the logarithm of X follows an expo- tivities, that is, they can potentially cause project delays.
nential distribution with the same parameter λ. This is relevant be- The LA indicator measures the tightness of each precedence
cause the costs of individual activities are well known to resemble a relationship between two activities as the distance between two ac-
Pareto distribution in almost all construction projects (Love et al. tivities in the project network. When LA ¼ 0, the network has
2014; Love and Sing 2013). Hence, because cost ratios are being many precedence relationships between two activities on levels
processed here in log values, the distribution should also resemble far from each other such that the activity can be shifted further
an exponential distribution (continuous gray line in Fig. 1). Addi- in the network. When LA ¼ 100%, many precedence relationships
tionally, because the sum of exponential distributions is a gamma have a length of 1, resulting in activities with immediate successors
distribution, this would offer some explanation, to why a gamma on the next level of the network and with little freedom to shift.
distribution (dashed gray line) fits almost perfectly the log cost de- Average LA values are much closer to 0 than to 100% (overall aver-
viations in Fig. 1. In this case, the exponential distribution also pro- age of 14.1%). This means that activities tend to have many pred-
vides a good fit, but that is not always the case in other construction ecessors (on average) from different levels (paths), which would
project data sets. reinforce the merge event bias effect.
Having approached the problem from a graphical and statistical Finally, TF measures the degrees of freedom per activity as
perspective, it will be addressed now from a topological perspec- the amount of slack or float an activity has. When TF ¼ 0, the net-
tive. Network topology describes the layout of project schedules. work structure is 100% dense and no activities can be shifted within
The values of four representative topological indicators are dis- its structure. When TF ¼ 100%, the schedule consists of a single
played on the last four columns of Table 1 for the 101 projects an- chain of activities without topological float. The average TF indi-
alyzed. Table 9 now presents the average values of each topological cator value of 40.5% means that the average activity structure of
indicator listed in Table 1, but categorized by project type (building, construction projects is rather dense.
civil engineering, industrial, and services), as well as for all projects Therefore, the highlights of this brief topological analysis for
together (last row). construction projects are that construction schedules are relatively
The SP indicator is probably the most relevant of the four dense (activitywise), usually composed of at least three major paths,
indicators for the purpose of this study. This indicator measures and contain activities whose predecessors usually come not just
the closeness of a network to a serial or parallel network. Namely, from activities located on the same path but also from other paths.
SP ¼ ðm − 1Þ=ðn − 1Þ, where n is the total number of project ac- This means that the merge event bias plays a very important role in
tivities in a project schedule, and m is the number of activities in the construction schedules. Furthermore, precisely thanks to the high
path with a higher number of activities (which may not necessarily level of duration variability existing at the activity level, many de-
be the longest in duration because topological measures ignore lays are expected to cumulate every time two or more paths merge
the activity durations). Hence, SP ¼ 0 means all activities are in into a single successor.
parallel, whereas SP ¼ 100% means all activities are in series. However, mergers are much more frequent toward the end of the
This indicator can also be considered as an estimate of the amount project compared with in the earlier stages of execution. This is be-
cause, in order for any paths to close, they have to open first. There-
Table 9. Average network topological values by project type fore, it is not a surprise that many construction projects get off to
a good start (on time and on budget), but half way across their du-
Project type n SP (%) AD (%) LA (%) TF (%) ration, (local) delays start being detected (whenever two or more
Building 56 48.2 57.4 21.4 35.2 paths are merged into one). As delays emerge, the cost of activities
Civil engineering 15 44.7 59.3 0.5 44.7 will also increase proportionally because the correlation between
Industrial 5 55.0 65.6 2.8 55.0 duration deviations and cost deviations was quite substantial on
Services 25 20.1 57.6 8.3 20.1 average. As a result, it is not that surprising that projects end later
All 101 41.0 58.2 14.1 40.5
and cost more than initially anticipated.

© ASCE 04019093-11 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


Conclusions ontologically questionable and certainly methodologically chal-
lenging. Companies are not open to share this information because
The activity duration and cost variability of construction projects it would clearly indicate how competent and efficient their opera-
has been analyzed in this research by different types of projects tions are. Under that perspective, the current sample size probably
(building, civil engineering, industrial, and services), project phase seems satisfactory, at least for a first representative analysis.
(planning, execution, and closure), and activity type (auxiliary A second limitation arises from having removed at the outset the
works, substructure, superstructure, specialized works, and facili- UBP and planned but unperformed (PBU) activities. This was nec-
ties). Correlation factors between activity duration deviations and essary because the ratios (either in natural or log scale) converged to
activity cost deviations have also been studied under the same ac- infinity, causing a distortion in the moments calculation. However,
tivity categories. The research is novel because it describes the first these activities can be found in almost all real projects. Frequently,
four moments (average, standard deviation, skewness, and kurtosis) they are the consequence of scope changes, works reorganization, or
of how actual versus planned durations and costs differ at the changes in the available resources. Obviously, UBP and PBU activ-
activity level in construction projects. A set of 101 projects and
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

ities add to the total project variability (beyond the activity duration
5,289 activities, plus another set with 746 projects, have been used. and cost variability analyzed here). In the present analysis, however,
The first contribution of this study is providing construction there were only 279 UBP þ 129 PBU ¼ 408 activities out of the
managers with a first, yet rather complete, set of actual-versus- initial 5,697 (7% in total). Hence, although the authors believe the
planned average activity durations and costs deviations with appli- influence of UBP and PBU activities needs to be duly investigated,
cation in multiple contexts (project types, execution phases, and the present analysis (with 93% of the activities) can still be consid-
types of activity). From now on, a construction manager will be ered representative enough to draw valid conclusions. Additionally,
able to more realistically (thus accurately) anticipate how likely it is also expected that some degree of cancellation will occur among
and how much the activities in the project schedule will vary, that those 7% of activities (because frequently, new activities replace
is, last or cost something different. This might potentially improve others that are not eventually performed).
the quality and robustness of all construction schedules, for exam- In the same vein, there are many potential future research con-
ple allowing them to feed more advanced (nondeterministic) sched- tinuations following this research. Again, this study might be ex-
uling and simulation tools with more representative data. These
tended to analyze other types of projects and/or other more specific
techniques generally need a substantial amount of information from
types of activities (maybe at trade-level: concrete, steel, asphalt,
previous similar projects, which is rarely available. With the set of
earthworks, for example). The network topologies for other types
moments provided here, these techniques will be able to resort to
of projects may also be studied to anticipate to what extent current
average values for their activity durations and cost distribution
levels of activity variability might impact their final schedules. The
parameters depending on the type and/or execution phase of the
statistical distribution of activity (duration and cost) variability may
project. These distributions will also be able to assume noninde-
also be analyzed. This was not possible at the general activity level,
pendence between the stochastically generated activity durations
as discussed in this paper, but it should be possible for activities at
and costs values (thanks to the set of duration-cost correlation
their trade level.
values also published in this study). This is expected to enhance
A last conclusion derived from this research is that activity du-
future construction project monitoring and control as well as actual
ration variability is the actual foe in project monitoring and control.
project duration and cost forecasting accuracy.
This may not sound new to Lean Construction researchers and
The analysis developed has also provided some interesting in-
practitioners. However, this research has provided compelling em-
sights from its numerical perspective. One of the most relevant is
pirical evidence suggesting that activity variability really must be
that it has been shown that construction activities do not end late on
average. Instead, their high level of variability (around 60% of its taken more seriously. There is a need to develop more techniques
average duration) is the key factor eventually causing project-level that can effectively handle/restrain this variability. Value stream
delays. Such high levels of activity variability exacerbate the merge mapping and last planner have been some attempts to address this
event bias, a phenomenon by which whenever two or more sched- problem, but more are needed. This will open the door to new and
ule paths converge into a single one, the average completion times more effective approaches for tackling the widespread phenomenon
exceed the maximum average path durations. of construction projects ending late.
Actual activity costs, on the other hand, do tend to be higher
than planned (around 7%). This cannot be the result of price adjust-
ments or inflation because hardly any project lasted longer than a Data Availability Statement
year. Instead, the major project-level cost overruns are expected to
occur as a consequence of delayed start of activities located nearer All data generated or analyzed during the study are included in the
the end of the project. It has been demonstrated how most duration- published article or Supplemental Data.
cost correlation factors range within 0.40 and 0.70. The latter would
mean that activities that cannot start until their predecessors have
finished would start to incur costs before their actual execution. Acknowledgments
Many other interpretations can arise from the numerical results
The first author acknowledges the Spanish Ministry of Science,
of the four moments describing activity duration and cost variabil-
ity that refer to specific types of projects, phases of execution, or Innovation and Universities for his Ramon y Cajal contract (RYC-
activity types that have not been recounted here. Tables 6 and 7 give 2017-22222) cofunded by the European Social Fund. This work
more information for such a purpose. was also supported by the second author’s “Estancias de movi-
A limitation of this study is mostly connected to the composition lidad en el extranjero José Castillejo para jóvenes doctores, 2017
and sample size of the construction projects analyzed. A total of (Grant Ref. CAS17/00488)” and the fourth author’s “Estancias de
101 projects have been used here with a varied composition. How- profesores e investigadores senior en centros extranjeros Salvador
ever, this sample size could have been bigger. It must be clarified, de Madariaga 2018 (Grant Ref. PRX18/00381),” both from the
however, that accessing actual duration and cost information is Spanish Ministry of Science, Innovation and Universities. The first

© ASCE 04019093-12 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093


and last authors also acknowledge the help received by the research George, D., and P. Mallery. 2010. SPSS for windows step-by-step: A simple
group TEP-955 from the PAIDI (Junta de Andalucía, Spain). guide and reference, 17.0 update. Boston: Pearson.
Hajdu, M., and O. Bokor. 2014. “The effects of different activity distribu-
tions on project duration in PERT networks.” Procedia Soc. Behav. Sci.
119: 766–775. https://doi.org/10.1016/j.sbspro.2014.03.086.
Supplemental Data Hajdu, M., and O. Bokor. 2016. “Sensitivity analysis in PERT networks:
Does activity duration distribution matter?” Autom. Constr. 65 (May):
The two complete data sets of 5,697 activities (from the 101 con-
1–8. https://doi.org/10.1016/j.autcon.2016.01.003.
struction projects) and the 746 road construction projects from the Hamzah, N., M. A. Khoiry, I. Arshad, N. M. Tawil, and A. I. Che Ani.
Florida Department of Transportation are available online in the 2011. “Cause of construction delay: Theoretical framework.” Procedia
ASCE Library (www.ascelibrary.org). Eng. 20 (Jan): 490–495. https://doi.org/10.1016/j.proeng.2011.11.192.
Keane, P. J., and A. F. Caletka. 2008. Delay analysis in construction con-
tracts. Oxford, UK: Wiley-Blackwell.
References Khamooshi, H., and D. F. Cioffi. 2013. “Uncertainty in task duration and
Downloaded from ascelibrary.org by University of Western Ontario on 11/17/19. Copyright ASCE. For personal use only; all rights reserved.

cost estimates: Fusion of probabilistic forecasts and deterministic


AlSehaimi, A., and L. Koskela. 2008. “What can be learned from studies on scheduling.” J. Constr. Eng. Manage. 139 (5): 488–497. https://doi.org
delay in construction?” In Proc., 16th Annual Conf. of the Int. Group /10.1061/(ASCE)CO.1943-7862.0000616.
for Lean Construction, 95–106. Salford, UK: Univ. of Salford. Love, P. E. D., R. Lopez, J. T. Kim, and M. J. Kim. 2014. “Probabilistic
Ballesteros-Pérez, P. 2017a. “M-PERT: Manual project-duration estimation assessment of design error costs.” J. Perform. Constr. Facil. 28 (3):
technique for teaching scheduling basics.” J. Constr. Eng. Manage. 518–527. https://doi.org/10.1061/(ASCE)CF.1943-5509.0000439.
143 (9): 04017063. https://doi.org/10.1061/(ASCE)CO.1943-7862 Love, P. E. D., and C.-P. Sing. 2013. “Determining the probability distri-
.0001358. bution of rework costs in construction and engineering projects.” Struct.
Ballesteros-Pérez, P. 2017b. “Modelling the boundaries of project fast- Infrastruct. Eng. 9 (11): 1136–1148. https://doi.org/10.1080/15732479
tracking.” Autom. Constr. 84 (Dec): 231–241. https://doi.org/10.1016/j .2012.667420.
.autcon.2017.09.006. MacCrimmon, K. R., and C. A. Ryavec. 1964. “An analytical study of the
Ballesteros-Pérez, P., M. L. Campo-Hitschfeld, D. Mora-Melià, and PERT assumptions.” Oper. Res. 12 (1): 16–37. https://doi.org/10.1287
D. Domínguez. 2015. “Modeling bidding competitiveness and posi- /opre.12.1.16.
tion performance in multi-attribute construction auctions.” Oper. Res. Mahamid, I., A. Bruland, and N. Dmaidi. 2012. “Causes of delay in road
Perspect. 2 (1): 24–35. https://doi.org/10.1016/j.orp.2015.02.001. construction projects.” J. Manage. Eng. 28 (3): 300–310. https://doi.org
Ballesteros-Pérez, P., G. D. Larsen, and M. C. González-Cruz. 2018a. “Do /10.1061/(ASCE)ME.1943-5479.0000096.
projects really end late? On the shortcomings of the classical scheduling Mastor, A. A. 1970. “An experimental investigation and comparative evalu-
techniques.” J. Technol. Sci. Educ. 8 (1): 86–102. https://doi.org/10 ation of production line balancing techniques.” Manage. Sci. 16 (11):
.3926/jotse.303. 728–746. https://doi.org/10.1287/mnsc.16.11.728.
Ballesteros-Pérez, P., Y. A. Rojas-Céspedes, W. Hughes, S. Kabiri, Ogunlana, S. O., K. Promkuntong, and V. Jearkjirm. 1996. “Construction
E. Pellicer, D. Mora-Melià, and M. L. del Campo-Hitschfeld. 2017. delays in a fast-growing economy: Comparing Thailand with other
“Weather-wise. A weather-aware planning tool for improving construc- economies.” Int. J. Project Manage. 14 (1): 37–45. https://doi.org/10
tion productivity and dealing with claims.” Autom. Constr. 84 (Dec): .1016/0263-7863(95)00052-6.
81–95. https://doi.org/10.1016/j.autcon.2017.08.022. Orangi, A., E. Palaneeswaran, and J. Wilson. 2011. “Exploring delays in
Ballesteros-Pérez, P., S. T. Smith, J. G. Lloyd-Papworth, and P. Cooke. Victoria-based Australian pipeline projects.” Procedia Eng. 14 (Jan):
2018b. “Incorporating the effect of weather in construction scheduling 874–881. https://doi.org/10.1016/j.proeng.2011.07.111.
and management with sine wave curves: Application in the United OR-AS.be. 2018. “Real project data.” Accessed April 23, 2019. http://www
Kingdom.” Constr. Manage. Econ. 36 (12): 666–682. https://doi.org/10 .projectmanagement.ugent.be/research/data/realdata.
.1080/01446193.2018.1478109. Pascoe, T. 1966. “Allocation of resources: CPM.” Rev. Française Recherche
Banerjee, A., and A. Paul. 2008. “On path correlation and PERT bias.” Eur. Opérationnelle 38: 31–38.
J. Oper. Res. 189 (3): 1208–1216. https://doi.org/10.1016/j.ejor.2007 Project Management Institute. 2017. Project management body of knowl-
.01.061. edge (PMBOK) guide. 6th ed. Newtown Square, PA: Project Manage-
Batselier, J., and M. Vanhoucke. 2015. “Construction and evaluation frame- ment Institute.
work for a real-life project database.” Int. J. Project Manage. 33 (3): Senouci, A., A. Ismail, and N. Eldin. 2016. “Time delay and cost overrun in
697–710. https://doi.org/10.1016/j.ijproman.2014.09.004. Qatari public construction projects.” Procedia Eng. 164 (Jan): 368–375.
Bein, W. W., J. Kamburowski, and M. F. M. Stallmann. 1992. “Optimal https://doi.org/10.1016/j.proeng.2016.11.632.
reduction of two-terminal directed acyclic graphs.” SIAM J. Comput. Trietsch, D., L. Mazmanyan, L. Gevorgyan, and K. R. Baker. 2012. “Mod-
21 (6): 1112–1129. https://doi.org/10.1137/0221065. eling activity times by the Parkinson distribution with a lognormal core:
Cho, S. 2009. “A linear Bayesian stochastic approximation to update Theory and validation.” Eur. J. Oper. Res. 216 (2): 386–396. https://doi
project duration estimates.” Eur. J. Oper. Res. 196 (2): 585–593. https:// .org/10.1016/j.ejor.2011.07.054.
doi.org/10.1016/j.ejor.2008.04.019. Valadares Tavares, L., J. Antunes Ferreira, and J. Silva Coelho. 1999. “The
Chudley, R., and R. Greeno. 2016. Building construction handbook. risk of delay of a project in terms of the morphology of its network.”
11th ed. Abingdon, UK: Routledge. Eur. J. Oper. Res. 119 (2): 510–537. https://doi.org/10.1016/S0377
Davies, E. M. 1973. “An experimental investigation of resource allocation -2217(99)00150-2.
in multiactivity projects.” J. Oper. Res. Soc. 24 (4): 587–591. https://doi Vanhoucke, M. 2008. “Measuring time using novel earned value manage-
.org/10.1057/jors.1973.107. ment metrics.” In Vol. 1 of Proc., 22nd IPMA World Congress, 99–103.
Dodin, B., and M. Sirvanci. 1990. “Stochastic networks and the extreme Nijkerk, Netherlands: International Project Management Association.
value distribution.” Comput. Oper. Res. 17 (4): 397–409. https://doi.org Vanhoucke, M. 2012. Project management with dynamic scheduling. Berlin:
/10.1016/0305-0548(90)90018-3. Springer.
Elmaghraby, S. E. 2000. “On criticality and sensitivity in activity networks.” Vanhoucke, M., J. Coelho, and J. Batselier. 2016. “An overview of project
Eur. J. Oper. Res. 127 (2): 220–238. https://doi.org/10.1016/S0377 data for integrated project management and control.” J. Mod. Project
-2217(99)00483-X. Manage. 3 (3): 6–21.
Elmaghraby, S. E., Y. Fathi, and M. R. Taner. 1999. “On the sensitivity of Vanhoucke, M., and S. Vandevoorde. 2009. “Forecasting a project’s dura-
project variability to activity mean duration.” Int. J. Prod. Econ. 62 (3): tion under various topological structures.” Measurable News, October
219–232. https://doi.org/10.1016/S0925-5273(98)00241-2. 29, 2012.

© ASCE 04019093-13 J. Constr. Eng. Manage.

J. Constr. Eng. Manage., 2019, 146(1): 04019093

You might also like