S1-ln5616706-1570301650-1939656818Hwf1824382083Id V-11449873945616706PDF HI0001
-
Upload
independent -
Category
Documents
-
view
0 -
download
0
Transcript of S1-ln5616706-1570301650-1939656818Hwf1824382083Id V-11449873945616706PDF HI0001
For Peer Review
DATA EXPLORATION ON STANDARD ASPHALT MIX
ANALYSES
Journal: Journal of Chemometrics
Manuscript ID: CEM-08-0052.R2
Wiley - Manuscript type: Research Article
Keyword: asphalt, asphalt mix design, counterpropagation, bitumen, PLS
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
For Peer Review
Data exploration on standard asphalt mix analyses
Page 1/17
DATA EXPLORATION ON STANDARD ASPHALT MIX ANALYSES
Marjan Tušar*, Marjana Novič
National Institute of Chemistry
Hajdrihova 19, P.O.B. 660
SI-1001 LJUBLJANA
SLOVENIA
E-mail: [email protected], [email protected]
Summary
The purpose of our study was the evaluation of the most important factors that
affect the volumetric and conventional mechanical properties of produced asphalt
mix and the volumetric properties of built-in asphalt layer. Asphalt mix design
follows the standard procedure (Marshall procedure). We were interested not only
in the quantity of bitumen specified by the Marshall procedure, but also in the
quantity of stone aggregate fractions, temperatures of production and properties of
bitumen that is used. The influence of these factors was investigated with several
models.
For the building of models we used 444 asphalt samples, analysed by one
laboratory. To select the most important factors, several Multiple Linear
Regression (MLR) models, Partial Least-Squares (PLS) regression models, and
counterpropagation neural network models were made. Obtained models were
tested with Leave-One-Out and leave-10%-out Cross-Validation procedures. The
results of MLR and PLS models show that the independent variables are closely
related. Among 21 variables there is only one found as less important. MLR and
PLS models show better predictive ability than counterpropagation neural network
Page 1 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 2/17
models. The best MLR models will be employed for the preparation of the asphalt
mix design (recipe) with some unknown material.
Key words: asphalt, bitumen, asphalt mix design, counterpropagation, PLS
1. Introduction
Asphalt is a complex mix of stone aggregate, bitumen and air. In production process
several fractions of stone aggregates must be prepared. Warmed and dried fractions are mixed
in a mixer with hot bitumen. The produced asphalt mix must be transported warm to the
construction site. With asphalt laying machine, hot asphalt mix is built-in and after that asphalt
layer is additionally compacted with compaction machines (rollers).
Built-in asphalt layer consists of bitumen, stone aggregates and air voids; only mastic
asphalt type is without voids. It can be seen in Figure 1a that the stone aggregate takes majority
of the volume in the asphalt layer. Bitumen takes 10% and air 6% of volume in built-in asphalt
layer on average. Depending on traffic, asphalt can be built-in one or more layers.
Properties of input materials, produced asphalt mix, and at built-in asphalt layers are
prescribed with standards and other technical regulations (technical approvals,
recommendations, demands of customer, etc.). Standards and other technical regulations
contain the limiting values defining the allowed range of every property that is why asphalt mix
design (recipe) must be within certain limits.
Asphalt mix design is determined with standard Marshall procedure in most European
countries.1 The purpose of Marshall method is to determine the optimum bitumen content for
particular blend of aggregate. According to this procedure, the first stone aggregate sieving
curve is produced from the stone aggregate fractions. Five different contents of bitumen are
added to predetermined stone aggregate. All five asphalt mixtures are compacted with Marshall
Page 2 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 3/17
compactor. On compacted specimen first volumetric properties are measured (quantity of voids
and voids in stone aggregate, filled with bitumen) and then Marshall stability and flow are
determined. Marshall stability is defined as the maximum load carried by a compacted
specimen tested at 600C at a loading rate of 50 mm per minute. The flow is defined in the same
time as the vertical deformation of the specimen while being loaded. It is determined as
deformation from the beginnig of loading to the maximum of load. With common regression
curves (most of them quadratic), the proper content of bitumen is determined to gain the
optimal quantity of voids, voids in stone aggregate, filled with bitumen, the optimal stability
and flow. At the described Marshall procedure we only have one independent variable, which is
the quantity of bitumen.
More advanced UPROJ procedure takes two independent variables into account: quantity
of bitumen and quantity of filler. 2 Filler is the smallest fraction of stone aggregate with grain
size up to 0.09 mm (in new European standards 0.063 mm). The upgrade of this procedure is
RAMPEIN procedure, which takes an additional independent variable, absorption of bitumen in
stone aggregate into account.3
SUPERPAVE procedure, developed in the nineties in the USA, is focused on the
performance tests of the bituminous binder. Ageing of bitumen has a high importance in this
procedure.4,5,6 SUPERPAVE volumetric asphalt mix design procedure is a performance
prediction model, based on the results of performance based experiments. Due to the fact that
the performance based experiments were not standardised in Europe before year 2008,
SUPERPAVE volumetric asphalt mix design can not be directly implemented to national
asphalt mix design specifications. European countries had their national standards for testing
the asphalt mix before year 2008. The new European standards allow laboratories to perform
two types of tests on asphalt mix: empirical or fundamental.
Old national standards for testing building materials, used in in most European countries,
were more oriented to quality control and were not directly connected to asphalt properties.
Page 3 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 4/17
New European fundamental standards for asphalt are oriented to performance and allow us to
directly predict the properties of asphalt layers from test results. It is an obligation of every EU
member to the use new specification standards from serial EN 13108 and material testing
standards from serial EN 12697 from year 2008 forward.
The problems of most European countries are that there is a lot of test data and
knowledge about the relation between the results of testing according to the old national
standards and performance of pavement, but only few data about the relation between the
results of testing according to the new fundamental European standards and performance of
pavement. In BiTVal project we tried to find all available data about relations between the
results of testing according to new European standards and performance of pavement. 7 But the
standards still do not contain anything about the procedure for preparing an optimal asphalt mix
design (recipe). All European test methods for the determination of input material properties,
asphalt mix design procedures and pavement design methods were validated.8 All of them
provided insufficient information.
Here we present the evaluation of influential variables that affect the volumetric and
conventional mechanical properties of produced asphalt mix and the volumetric properties of
built-in asphalt layer. The data driven models were built using 444 asphalt samples, analysed by
one laboratory.
The analyses in the study follow standards, which are almost identical to the empirical
tests. When produced asphalt mix is tested according to the empirical standards for testing the
asphalt mix, the whole composition of asphalt mix is determined. The target volumetric and
conventional mechanical properties of produced asphalt mix were measured and compared to
values, specified in the requirements. The information about the temperature of production and
type of input bitumen was also available. We tried to use all of the available information to
construct predictive models, based on MLR, PLS, and counterpropagation neural network
modelling methods in our work.
Page 4 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 5/17
2. Experimental
For a long time the tests according to the empirical standards were performed in the
asphalt laboratory. Asphalt mix has always been sampled from the construction sites and the
following tests were regularly done:
• determination of bitumen quantity from produced asphalt mix. Quantity of bitumen in
asphalt mix is from 2.9 weight percent to 7.weight percent. It is usually determined
with hot extraction of bitumen soluble in trichloroethylene from asphalt mix.
• determination of stone aggregate quantity on sieves 0.09 mm, 0.25 mm, 0.71 mm, 2
mm, 4 mm, 8 mm, 11.2 mm, 16 mm, 22.4 mm, 32 mm from produced asphalt mix
(Figure 1b). When bitumen is removed, the remaining dried stone aggregate is sieved
through the standardised sieves.
• measurement of density of asphalt mix (maximal density of asphalt mix) with
pyknometer. Weighted amount of asphalt mix is put in pyknometer and the rest of
volume is filled with de-aired water.
• determination of bulk density of asphalt mix, that is the density of specimen,
containing air voids; from maximal density and bulk density the air void content is
calculated. To determine the of bulk density from produced asphalt mix, cylindrical
specimens are compacted with impact compactor. Ratio between weight and volume
of these specimens is bulk density.
• determination of mechanical properties of cylindrical specimen. Cylindrical
specimens from produced asphalt mix are compacted with impact compactor. Stability
is determined by the force, needed to break the asphalt specimen, and the flow is
determined by the shift of press, needed to break the asphalt specimen (Figure 2).
Page 5 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 6/17
• determination of bulk density and air void content of cylindrical specimens of asphalt
layer which were drilled at asphalt mix sampling site and do not have uniform
thickness as specimens prepared in laboratory (Figure 3);
Asphalt analyses data were first separated to independent and dependent variables. It was
clear that the quantity of bitumen and sieving curve of stone aggregate are the independent
variables. Additional independent variables which are important for stone aggregates are the
quantity of rounded stone and the quantity of silica. Three tests are always done on the
bitumen, extracted from the asphalt: softening point, penetration at 250C, and low temperature
Fraass breaking point. We always know if the input bitumen contains additional polymer
(mostly styrene-butadiene-styrene) and the class of hardness. Class of bitumen hardness is
designated according to its penetration. Bitumen with 45 mm/10, 60 mm/10 and 90 mm/10
penetrations is most common. For additional independent variables we selected temperature of
asphalt at construction site, thickness of asphalt layer, temperature of asphalt when asphalt mix
is compacted in laboratory, and finally maximal density of asphalt mix. The dependent
variables represent the properties of asphalt mix we would like to predict, such as air void
content and mechanical properties. We end up with 21 independent variables and 6 dependent
variables given in Table I.
In the first training set there were 444 analyses of asphalt. Regarding to sieving curve
there were three types of asphalt mix. 108 analyses were made on Stone Mastic Asphalts
(SMA), which is used for wearing course on highways; 109 analyses were made on Asphalt
Concretes (AC), used for wearing course on other roads; and 227 analyses were made on
Asphalt Concretes (AC), used for binder and base course. Typical values (mean values) of
independent and dependent variables for all three types of asphalt mixes are collected in Table
I.
Another set with 38 analyses of asphalt consists of four analyses of porous asphalt (PA),
which contains enormous content of air voids, two analyses with special asphalts mixes, used
Page 6 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 7/17
only at special trials, and 32 analyses of mastic asphalt (MA) without air voids. Analyses of
MA do not contain data about stability and flow, due to experimental circumstances. These are
the missing values of the second data set, consequently only counterpropagation neural network
could be directly used for modelling. The advantage of the counterpropagation neural network
method is in its straightforward applicability for data with missing values without any
assumption about the distribution of data.
Analysed asphalt mixes were collected from 10 asphalt plants. They were produced
according to the 80 different mix designs. Eight different bitumen types and several different
types of stone were used in the analysed asphalts. Before further handling, all data were pre-
processed (standardized).
Our first intention was to find out which are the most important independent variables
influencing the model for prediction of selected dependent variables. The main goal was to
make a universal model for all types of asphalts. With such universal model we would be able
to determine asphalt mix design (recipe) also for new innovative types of asphalt.
We knew for the two hidden independent variables in advance that we were not able to
control. It is known from practice that the main hidden independent variables are the source of
bitumen and the use of baghouse fines as filler. It is hard to get information about the source of
bitumen from the asphalt plants. The second hidden variable, the baghouse fines are dust
particles that are captured from the exhaust gases of asphalt mixing plants. Secondary
collection equipment called baghouses is commonly used to capture these very fine sized
materials. However, it is not advisable to use it in asphalt mix due to unsuitable angularity of
grains. Producers of asphalt should dump their baghouse fines, but to avoid problems with
waste materials, they mix it in asphalt as filler. It is hard to detect such filler, so the producers
of asphalt often claim that they do not use it.
Page 7 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 8/17
3. Methods
3.1. Multiple Linear Regression
We started the search of important independent variables for the construction of Multiple
Linear Regression model. We began with the dependence of single dependent variables on
single independent variables (type of equation is y= x1b1+b0), than continued with linear
combinations of two independent variables (type of equation is y= x1b1+ x2b2+b0), linear
combinations of all independent variables (type of equation is y= x1b1+ x2b2+…+ x21b21+b0)
and ended with linear combinations of all independent variable , including mixed terms (type of
equation is y= x1b1+ x2b2+…+ x21b21+…+x1x2b1_2+…+x20x21b20_21+b0). We used the program
Matlab to perform MLR for all 6 dependent variables. We determined several equations for
each dependent variable: 21 linear equations for separate independent variables, 210
combinations of 2 independent variables and 210 combinations of 2 independent variables with
interaction term. Due to rapidly increasing number of combinations we skipped linear equations
with 3 or more independent variables. For all 6 dependent variables we calculated equations
containing 21 independent variables and all 210 interaction terms. The data was standardized
before we did any calculations.
3.2. Partial Least Squares
Our strategy was to precede with a more sophisticated modelling method i.e. Partial
Least Squares method PLS.9,10 PLS is used to find the fundamental relations between
independent and dependent variables. Variables are transformed to the latent space and PLS
model determines the multidimensional direction in the space of independent variables that
explains the maximum multidimensional variance direction in the space of independent
variables. Partial least squares method is particularly suited when the matrix of predictors has
more variables than observations. Standard regression (MLR) fails in these cases.
Page 8 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 9/17
We wanted to find the most important independent variables. Evaluation of model
improvement with increasing number of components was also done with PLS. All PLS
calculations data were standardized first. For some results we used Excel module for PLS, but
from the results, obtained with Excel module, we were not able to obtain VIP (Variable
Importance for the projection) parameters. 10 Program SIMCA-P+ 12.0 was mostly used to
perform PLS modelling. We explored only the PLS1 option.
3.3. Counterpropagation neural network
Counter-propagation neural network is based on two-steps learning procedure, which is
unsupervised in the first step11. The first step corresponds to the mapping of objects in the input
layer (also called Kohonen layer). This part is identical to the Kohonen learning procedure. The
second step of the learning is supervised, which means that the response or target value is
required for each input for the learning procedure. The network is thus trained with a set of
input-target pairs {Xs, Ts}, where Xs represents the samples (independent variables) and Ts
represents the dependent variable(s).
The training of the counter-propagation neural network means adjusting the weights of
the nodes in such a way that the network would respond with the output Outs identical to the
target Ts for each input sample Xs from the training set. Once the position (central or winning
node - neuron c) of the input vector is defined, the weights of the input and output layer of the
counter-propagation neural network are corrected accordingly. The corrections in the output
layer are defined as follows:
• the winning node: Out(t+1) = Out(t) + η (T – Out(t))
• the neighbourhood nodes: Out(t+1) = Out(t) + η N(t,r) (T – Out(t))
Again η is the learning rate and N(t,r) is a neighbourhood function, see the details above. The
training is repeated until the network stabilizes.
Page 9 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 10/17
Basically the counter-propagation neural network model acts as a pointer device; there
are as many predictions available as there are units in the output layer (equal to the number of
nodes (neurons)). The advantage of counterpropagation model is that it can deal with missing
values without any assumption about the distribution of the data. PLS can also be performed
with missing values, but it is assumed that data is normally distributed with zero mean.12,13
3.4. Model assessment
Data were standardized before all calculations. We made several Multiple Linear
Regression models, PLS and counterpropagation models. For each model the Modelling and the
Predictive abilities (r2 and r2loo) were calculated for evaluation. Predictive ability (r2
loo) shows
the Leave-One-Out (LOO) Cross-Validation prediction results. The K-fold Cross-Validation
was used for most promising MLR and counterpropagation models, because Leave-One-Out
Cross-Validation is the variant of Cross-Validation that is most susceptible to over-fitting. On
the basis of LOO results as most promising MLR model linear combination of all independent
variables (type of equation is y= x1b1+ x2b2+…+ x21b21+b0) was selected. Counterpropagation
model the net with dimension 12X12 and number of epochs 200 was selected as the most
promising. Leave-10-percent-out Cross-Validation data was implemented for these two models.
More general Monte Carlo Cross-Validation that has been recently introduced in chemometrics
seems to give better validation.14,15,16 Because training of counterpropagation models is time
consuming, only leave-10-percent-out Cross-Validation was used. Leave-20-percent-out Cross-
Validation was additionally used for PLS models.
With Kennard&Stone algorithm we selected 70% of data for training set and left other
30% for test set.17 With selected training set we made PLS model and 12 X 12
counterpropagation model, then we tested them with test set.
Page 10 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 11/17
4. Results and discussion
4.1. Multiple Linear Regression
We started the search of important independent variables for the construction of Multiple
Linear Regression model. We began with the dependence of single dependent variables on
single independent variables (type of equation is y= x1b1+b0). Modelling and Predictive
abilities (r2 and r2
loo) were calculated to evaluate the model. Predictive ability (r2loo) shows the
Leave-One-Out (LOO) Cross-Validation prediction results. Four independent variables with the
highest correlation coefficient for each dependent variable are listed in Table II. The best
correlation for this type of equation between the observed and the predicted values of bulk
density of cylindrical sample, prepared in laboratory (y1), is in Figure 4a.
We found the most important pairs of independent variables with the linear combinations
of two independent variables (type of equation is y= x1b1+ x2b2+b0). Pairs of independent
variables with the highest correlation coefficient for each dependent variable are listed in Table
II. The best correlation for this type of equation between the observed and the predicted values
of bulk density of cylindrical sample, prepared in laboratory (y1), is in Figure 4b.
We made linear combinations of all independent variables (type of equation is y= x1b1+
x2b2+…+ x21b21+b0) and linear combinations of all independent variables, including mixed
terms (type of equation is y= x1b1+ x2b2+…+ x21b21+…+x1x2b1_2+…+x20x21b20_21+b0). The
Modelling and Predictive abilities are presented in Table III. The results of LOO Cross-
Validation were better for linear combinations of all independent variables, so leave-10-
percent-out Cross-Validation were implemented only on linear combinations of all independent
variables. Results of Leave-10-percent-out Cross-Validation in Table IV are similar to LOO
results. We concluded that the MLR model with linear combinations of all independent
variables is the best of MLR models tested in this study.
Page 11 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 12/17
That was the upper limit of our trial with linear equations, due to the fact that the number
of terms of polynomials is becoming too big. We assume that with the increasing complexity
the model would have a tendency of overfitting.
All Modelling and Predictive abilities (r2 and r2loo) are presented in Table III. Leave-10-
percent-out Cross-Validation results are presented in Table IV.
4.2. Partial Least Squares
In Figure 5.a there are VIP (Variable Importance for the projection) parameters for bulk
density of cylindrical sample, prepared in laboratory. It can be seen from Figure 5.a that
independent variable x21 (the maximal density of asphalt mix) was more important than others.
Variable x6 (quantity of stone aggregate passing 8 mm sieve) is the second most important with
VIP value more than 1. The only less important variable is x10 (quantity of rounded stone
aggregate), importance of all other variables is in the grey zone with VIP values between 0.5
and 1. For the other dependent variables the maximal density of asphalt mix was not so
important, but x10 was almost always unimportant (Table II).
We can see the increasing modelling ability of PLS models with increasing number of
components (r2) in Figure 5.b. With only 6 components we can see that the model fits the
experimental data enough. A similar situation is found in the correlations, regarding all other
dependent variables. It can also be seen from Figure 5.b that modelling ability of PLS models
(r2train) for training set containing 70% of data set, selected with Kennard&Stone algorithm, was
similar to r2. Also, the values for test set were nearly as good as for modelling ability (r2test).
We can see the difference between the increasing modelling ability and the predictive
ability of PLS models with the increasing number of components in Figure 6. Modelling ability
(r2) is compared to the Leave-One-Out (r2loo), leave-10%-out Cross-Validation (r2
l10%o), leave-
20%-out Cross-Validation (r2l20%o) and Cross-Validation parameter obtained from SIMCA
Page 12 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 13/17
program (q2). All three r squares seem parallel from 10 to 21 components, but the leave-20%-
out Cross-Validation has a slight maximum at 15 components. Cross-Validation q2, obtained
from SIMCA, has a clear maximum at 5 components.
PLS was also applied to the data set, containing independent variables with mixed terms.
The independent variables with mixed terms can be seen from Figure 7.b, where the bulk
density of cylindrical sample, prepared in laboratory, is showing the increasing modelling
ability of PLS models with increasing number of components, but the Root Mean Square of
errors (RMS) is increasing after 70 components. Cross-Validation q2, obtained from SIMCA,
has clear maximum at 7 components.
4.3. Counterpropagation
We trained the counterpropagation neural network with two data sets: with a set of 444
samples and additionally with extended data set of 444+38 samples. Several
counterpropagation networks were tested, differing in the dimension of the Kohonen map. We
first trained neural networks (NN) with 100 epochs. With increasing dimension the fit of the
model was increasing t, but the Cross-Validation showed (Supplementary Material -Table S1)
that the 10X10 NN dimension was optimal (Regression plot is in Figure 4).
Later we trained NN with 200 epochs. At 200 epochs the Cross-Validation showed
(Supplementary Material -Table S1) the best values for the 12X12 NN dimension. It can be
concluded that the NN learning parameters are correlated and there is no simple rule for
determination of the optimal NN learning parameters. The training and test set, obtained with
Kennard Stone, were used to validate 12X12 network (Supplementary Material -Table S1). We
did not get any conflict node regarding the type of asphalt mix with the test set. There was a
clear separation between the 3 different types of asphalt mix in the topmap. We selected the NN
dimension 12X12 for further modelling
Page 13 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 14/17
With the NN dimension 20X20, the Cross-Validation showed slight improvement at 300
epochs, compared to the selected NN architecture, but there was no improvement in the
separation between the 3 different types of asphalt mix.
We can see from Figure 4, Tables III, IV and V and Supplementary Material (Table S1)
that the correlation coefficients (r2) for counterpropagation NN are bigger than those for MLR
models, but the Cross-Validation (r2loo and r
2l10%o) shows, that the MLR models are better. With
the counterpropagation NN we can visualize the position of classes of asphalt. The final topmap
of the counterpropagation model (dimension 12X12, after 200 epochs) can be seen in Figure 8.
Stone Mastic Asphalts (SMA) are labelled with number 1, Asphalt Concretes (AC) used for
wearing course are labelled with number 2 and Asphalt Concretes (AC) used for binder and
base course with number 3. The only conflicting nodes are in the first column, row 4 and 5.
We also made a counterpropagation model for classification purposes with asphalt
analyses, without removing the samples with missing data. In Figure 9 we can see that the
Mastic asphalts (MA), labelled with number 4, and the Porous asphalts (PA), labelled with
number 6, are clearly separated from the other asphalts. Special purpose asphalts, labelled with
number 1, are similar to the Stone Mastic Asphalts and they are classified between them in the
counterpropagation model.
5. Conclusion
We implemented a pure chemometrics approach on the extensive asphalt data collected
during the past ten years. To our best knowledge, such a research has not yet been published.
The study was performed without any predetermined assumptions, commonly considered in
previous studies. We were able to collect some information, usually not contained in the test
reports (i.e. temperature of asphalt on the building site, quality of recovered bitumen,
Page 14 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 15/17
percentage of rounded stone fractions). Exploration of these additional data enabled us to
extract useful information about the factors, influencing the quality of asphalt layers.
The results of MLR and PLS models show that all independent variables are important.
The lowest importance was ascribed to the quantity of rounded stone aggregate.
The best modelling and predictive abilities were obtained with MLR model, containing
the linear combinations of all independent variables. Counterpropagation models have good
modelling abilities, but predictive abilities are lower then those, obtained with MLR.
Nevertheless, there are two advantages of counterpropagation models. First, it can be used also
for classification and visualization of different types of asphalt, and second, the model can cope
with the missing data without any statistical pre-treatment.
The MLR models will be useful for the preparation of asphalt mix design (recipe) with
some unknown material. It may also be employed to correct mix design if some property of
input material is changed during production process, as for example the change of bitumen type
or sieving curve.
6. Acknowledgment
The authors thank of the Ministry of Higher Education, Science and Technology of the
Republic of Slovenia for the financial support through the Project P1-017: Modeling of
relationship between chemical structure and properties - QSAR –QSPR.
Page 15 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 16/17
7. References
1. Lavin P.; Asphalt Pavements: A Practical Guide to Design, Production and
Maintenance for Engineers and Architects, Taylor & Francis, 2003, ISBN
0415247330.
2. Ramljak Z. Bitumen Content Calculation in Designed Asphalt Mixture, proceedings
of 2nd Euroasphalt $Eurobitume Congress, Barcelona, 2000, 704.
3. Ramljak Z. Funkcionalna ovisnost optimalnog udjela bitumena u asfaltnoj mješavini
o uvjetima projektiranja te o gustoči ingredijenta, Zbornik referatov 7. kolokvija o
bitumnih, Gozd Martuljek, 2002, 44-54.
4. Cominsky R J, Huber G A, Kennedy T W, Anderson M. The Superpave mix design
manual for new construction and overlay, SHRP-A-407, National Research Council
Washington, DC, 1994.
5. Anderson R M. Superpave level 1 mixture design example, A first look at volumetric
mix design in Superpave system, Asphalt Institute Research Centre, Lexington, KY,
1993.
6. Kandhal P S, Cooley L A Jr. NCHRP report 464; The Restricted Zone in the
Superpave Aggregate Gradation Specification, National Academy Press,
Washington DC, 2001
7. Bitval final report (http://bitval.fehrl.org/index.php?id=14&no_cache=1)
8. COST 333 Final Report, 1999, 375 pp. - 17.5 x 25.0 cm, ISBN 92-828-6796-X -
EUR 18906
Page 16 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Data exploration on standard asphalt mix analyses
Page 17/17
9. Wold S, Martens S, Wold H. The multivariate calibration problem in chemistry
solved by the PLS method. In: Ruhe A, Kagstrom B(Eds.), Lecture Notes in
Mathematics. Proceedings of the Conference on Matrix Pencils, 1983, 286-293.
10. Brereton R G. Chemometrics: Data Analysis for Laboratory and Chemical Plant,
Wiley, 2003, ISBN 0-471-48978-6
11. Hecht-Nielsen R. Counterpropagation networks. Appl. Opt. 1987; 26: 4979-4984.
12. Zupan J, Novič M, Ruisanchez I. Kohonen and counter-propagation artificial neural
networks in analytical chemistry. Chemom. Intell. Lab. Syst. 1997; 38: 1-23
13. Nelson PRC, MacGregor JF, Taylor PA. The impact of missing measurements on
PCA and PLS prediction and monitoring applications. Chemometrics Intell. Lab.
Syst. 2006; 80: 1-12
14. Xu QS, Liang YZ. Monte Carlo cross validation. Chemometrics Intell. Lab. Syst.
2001; 56: 1-11.
15. Gourvénec S, Fernández Pierna JA, Massart DL, Rutledge DN. An evaluation of the
PoLiSh smoothed regression and the Monte Carlo cross-validation for the
determination of the complexity of a PLS model. Chemometrics Intell. Lab. Syst.
2003; 68: 41-51.
16. Xu QS, Liang YZ, Du YP. Monte Carlo cross-validation for selecting a model and
estimating the prediction error in multivariate calibration. J. Chemometrics 2004;
18: 112-120.
17. Kennard RW, Stone LA. Computer Aided Desing of Experiments. Technometrics,
1969, 11, 137–148.
Page 17 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 1/6
Tables:
Table I: Mean values for independent and dependent variables for three types of asphalt
mixes
Abbreviation Independent variables Stone
mastic
asphalt
SMA
Asphalt
concrete
(AC) for
wearing
course
Asphalt
concrete
(AC) for
binder and
base
course
x1 quantity of stone aggregate passing
0.09 mm sieve (range from 4.6 wt %
to 13.2 wt %), 10.8 9.9 6.9
x2 quantity of stone aggregate passing
0.25 mm sieve (range from 6.0 wt %
to 18.0 wt %), 13.0 14.2 9.9
x3 quantity of stone aggregate passing
0.71 mm sieve (range from 10.4 wt %
to 29.4 wt %) 16.3 23.1 15.6
x4 quantity of stone aggregate passing 2
mm sieve (range from 19.1 wt % to
58.9 wt %), 25.6 45.5 29.3
x5 quantity of stone aggregate passing 4
mm sieve (range from 25.2 wt % to
73.1 wt %), 35.0 61.7 40.7
x6 quantity of stone aggregate passing 8
mm sieve (range from 40.4 wt % to
99.8 wt %), 86.6 89.8 57.8
x7 quantity of stone aggregate passing
11.2 mm sieve (range from 47.6 wt %
to 100 wt %), 99.3 99.4 68.9
x8 quantity of stone aggregate passing
16 mm sieve (range from 54.5 wt %
to 100 wt %), 100 100 80.9
x9 quantity of stone aggregate passing
22.4 mm sieve (range from 73.9 wt %
to 100 wt %), 100 100 93.6
x10 quantity of rounded stone aggregate
(range from 0.0 wt % to 19.9 wt %), 0.000 0.005 0.643
x11 quantity of silica type stone aggregate
(range from 0.0 wt % to 88.7 wt %), 76.49 43.78 0.65
x12 quantity of bitumen (range from 2.9
wt % to 7.5 wt %), 6.3 5.5 3.8
x13 penetration of bitumen extracted from
asphalt mix (range from 18 mm/10 to
68 mm/10), 35.2 39.3 30.8
x14 softening point of bitumen extracted
from asphalt mix (range from 47.5 0C
to 860C), 70.4 59.0 63.2
Page 18 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 2/6
Abbreviation Independent variables Stone
mastic
asphalt
SMA
Asphalt
concrete
(AC) for
wearing
course
Asphalt
concrete
(AC) for
binder and
base
course
x15 Fraass breaking point of bitumen
extracted from asphalt mix (range
from -23 0C to -2.3
0C ), -13.3 -12.0 -10.3
x16 temperature of asphalt at construction
site (range from 130 0C to 195
0C), 171 162 163
x17 type of input bitumen (with or
without polymer) (for Polymer
modified and 0 for neat bitumen), 0.963 0.046 0.004
x18 hardness of input bitumen (according
to penetration) (1 soft, 2 middle, 3
hard), 1.6 1.3 2.1
x19 temperature of asphalt when asphalt
mix is compacted in laboratory(range
from 138 0C to 180
0C), 170 154 158
x20 thickness of asphalt layer(range from
1.65 cm. to 13 cm), 3.3 3.8 7.5
x21 maximal density of asphalt mix
(range from 2.355 g/cm3 to 2.707
g/cm3), 2.558 2.550 2.570
Abbreviation Selected dependent variables are: Stone
mastic
asphalt
SMA
Asphalt
concrete
(AC) for
wearing
course
Asphalt
concrete
(AC) for
binder and
base
course
y1 bulk density of cylindrical sample
prepared in laboratory (range from
2.204 g/cm3 to 2.556 g/cm
3), 2.446 2.444 2.404
y2 air void content of cylindrical sample
prepared in laboratory (range from
1.2 % (v/v) to 10.5 % (v/v)), 4.4 4.1 6.4
y3 bulk density of sample drilled from
pavement (range from 2.204 g/cm3 to
2.574 g/cm3), 2.439 2.406 2.407
y4 air void content of sample drilled
from pavement (range from 2.4 %
(v/v) to 8.7 % (v/v)), 4.65 5.626 6.367
y5 stability of cylindrical sample
prepared in laboratory (range from
4.2 kN to 15.3 kN) 9.16 9.86 10.66
y6 flow of cylindrical sample prepared
in laboratory(range from 2.4 mm to
8.7 mm) 5.4 4.5 4.1
Page 19 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 3/6
Table II: The most important independent variables, obtained with MLR and PLS
Type of model y1 y2 y3 y4 y5 y6
MLR 4 best x
(y=x1b1+b0)
x21, x8, x7,
x1
x2, x1, x8,
x12
x21, x13, x2,
x11
x12, x1, x11,
x7
x13, x12, x15,
x11
x17, x11, x12,
x1
MLR best pair
(y=x1b1+ x2b2+b0)
x21, x2 x2, x13 x21, x12 x12, x6 x13, x5 x17, x2
PLS VIP value over 1 x21, x6 x2, x1, x6,
x3, x8, x12,
x13, x20, x9
x21, x17, x19,
x11, x14
x12, x13, x6,
x17, x20, x2,
x11, x15, x1,
x21
x13, x15, x12,
x3, x2, x5,
x11
x17, x11, x12,
x1, x7, x20,
x8, x19, x14,
x6, x2
PLS VIP value less
than 0.5
x10 x19, x16 x18, x15, x10 x10 x21, x10
Page 20 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 4/6
Table III: Correlation coefficients for fit of the model (r2) and for leave-one-out cross-
validation (r2loo) for all dependent variables
Type of linear equation bulk
density of
cylindrical
sample
prepared in
laboratory
air void
content of
cylindrical
sample
prepared in
laboratory
bulk
density of
cylindrical
sample
from
pavement
air void
content of
cylindrical
sample
from
pavement
stability of
cylindrical
sample
prepared in
laboratory
flow of
cylindrical
sample
prepared in
laboratory
The best:
y= x1b1+b0
r2
r2
loo
0.303
0.297
0.558
0.554
0.390
0.385
0.176
0.168
0.182
0.179
0.317
0.317
The best:
y= x1b1+ x2b2+b0
r2
r2
loo
0.661
0.656
0.593
0.589
0.489
0.482
0.248
0.238
0.247
0.237
0.360
0.349
y= x1b1+ x2b2+…+
x21b21+b0
r2
r2
loo
0. 813
0.793
0. 760
0.734
0. 607
0.563
0. 365
0.297
0. 449
0.393
0. 414
0.352
y= x1b1+ x2b2+…+
x21b21+…+x1x2b1_2+…
+x20x21b20_21+b0
r2
r2
loo
0.933
0,239
0. 913
0,197
0. 827
0,206
0. 722
0,089
0. 781
0,107
0. 755
0,112
Page 21 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 5/6
Table IV: Correlation coefficients for fit of the model (r2) and for leave-10%-out-cross-
validation (r2l10%o) for all dependent variables (Type of linear equation: y= x1b1+ x2b2+…+
x21b21+b0)
Trial
bulk density
of
cylindrical
sample
prepared in
laboratory
air void
content of
cylindrical
sample
prepared in
laboratory
bulk density
of
cylindrical
sample
from
pavement
air void
content of
cylindrical
sample
from
pavement
stability of
cylindrical
sample
prepared in
laboratory
flow of
cylindrical
sample
prepared in
laboratory
1 r2
r2l10%o
0,821
0,772 0,762 0,749
0,607 0,660
0,371 0,423
0,440 0,506
0,420 0,341
2 r2
r2l10%o
0,824 0,674
0,773 0,622
0,630 0,229
0,401 0,030
0,459 0,324
0,430 0,222
3 r2
r2l10%o
0,805
0,869
0,754
0,786
0,597
0,664
0,362
0,362
0,476
0,257
0,429
0,203
4 r2
r2l10%o
0,818
0,747
0,765
0,675
0,615
0,483
0,357
0,387
0,445
0,459
0,415
0,362
5 r2
r2l10%o
0,809
0,828
0,759
0,712
0,612
0,551
0,371
0,295
0,446
0,478
0,410
0,382
6 r2
r2l10%o
0,810
0,829
0,762
0,735
0,600
0,632
0,372
0,271
0,446
0,429
0,419
0,351
7 r2
r2l10%o
0,813
0,798
0,761
0,734
0,609
0,566
0,377
0,201
0,462
0,280
0,427
0,276
8 r2
r2l10%o
0,819
0,733
0,757
0,799
0,605
0,589
0,350
0,467
0,471
0,141
0,402
0,524
9 r2
r2l10%o
0,810
0,827
0,760
0,749
0,601
0,631
0,377
0,220
0,430
0,574
0,414
0,386
10 r2
r2l10%o
0,813
0,808
0,762
0,738
0,621
0,431
0,367
0,298
0,439
0,533
0,408
0,444
average r2
r2l10%o
0,814
0,788
0,761
0,730
0,610
0,544
0,370
0,295
0,452
0,398
0,417
0,349
Page 22 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 6/6
Table V: Correlation coefficients for fit of the model (r2) and for leave-10%-out cross-
validation (r2l10%o) for all dependent variables (Number of epochs: 200; Dimension of NN: 12X12)
Trial
bulk density
of
cylindrical
sample
prepared in
laboratory
air void
content of
cylindrical
sample
prepared in
laboratory
bulk density
of
cylindrical
sample
from
pavement
air void
content of
cylindrical
sample
from
pavement
stability of
cylindrical
sample
prepared in
laboratory
flow of
cylindrical
sample
prepared in
laboratory
1 r2
r2l10%o
0.811
0.539
0.802
0.655
0.673
0.439
0.550
0.383
0.604
0.349
0.648
0.217
2 r2
r2l10%o
0.786
0.295
0.789
0.356
0.637
0.085
0.514
0.009
0.565
0.238
0.622
0.375
3 r2
r2l10%o
0.754
0.663
0.779
0.736
0.638
0.420
0.519
0.158
0.639
0.504
0.655
0.084
4 r2
r2l10%o
0.799
0.627
0.799
0.572
0.647
0.444
0.479
0.438
0.596
0.238
0.652
0.353
5 r2
r2l10%o
0.802
0.720
0.791
0.561
0.644
0.551
0.496
0.207
0.609
0.330
0.619
0.678
6 r2
r2l10%o
0.773
0.621
0.774
0.609
0.663
0.430
0.529
0.130
0.566
0.19495
0.703
0.172
7 r2
r2l10%o
0.801
0.603
0.782
0.608
0.666
0.463
0.516
0.143
0.640
0.298
0.669
0.383
8 r2
r2l10%o
0.831
0.650
0.774
0.609
0.702
0.375
0.550
0.245
0.649
0.042
0.636
0.545
9 r2
r2l10%o
0.808
0.631
0.792
0.645
0.653
0.414
0.498
0.167
0.612
0.155
0.632
0.455
10 r2
r2l10%o
0.777
0.657
0.804
0.611
0.678
0.320
0.550
0.253
0.612
0.369
0.625
0.408
average r2
r2l10%o
0.794
0.601
0.789
0.596
0.660
0.394
0.520
0.213
0.609
0.272
0.646
0.367
Page 23 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 1/2
Figure captions:
Figure 1: Cut through built in binder asphalt layer (a);
The lines represent the upper and lower limits of sieving curve for stone aggregate in asphalt
concrete with maximal grain size 22 mm (b) prescribed by the national standard;
Figure 2: Mechanical testing of cylindrical asphalt specimen
Figure 3: Cylindrical samples of asphalt layers from the Ring around Ljubljana (b);
Figure 4: Regression plots of the experimental versus predicted Bulk density for three
linear models (a-c) and counter-propagation neural model (d). Statistical parameters of
training (r2, Root Mean Square-RMStraining) and LOO validation of the models (r
2loo, Root
Mean Square-RMStesting) are presented.
Figure 5: VIP (Variable Importance for the projection) parameters for bulk density of the
cylindrical sample, prepared in a laboratory. (a);
Fit of PLS model with increasing number of component (independent variables) for bulk
density of cylindrical sample, prepared in a laboratory (b). Modelling ability (r2) is increasing
and RMS is decreasing with increasing number of components. Modelling ability of PLS
models (r2train) for training set, containing 70% of data set, selected with Kennard&Stone
algorithm, is similar and values for test set are nearly as good as for modelling ability (r2
test).
Page 24 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
Page 2/2
Figure 6: Fit of PLS model with increasing number of component (independent variables) for
bulk density of cylindrical sample, prepared in a laboratory. Modelling ability (r2) is compared
to leave one out (r2loo), leave-10%-out cross-validation (r
2l10%o), leave-20%-out cross -validation
(r2
l20%o) and cross–validation parameter (q2), obtained from SIMCA program.
Figure 7: Regression coefficients of independent variables and mixed terms for bulk density of
the cylindrical sample, prepared in a laboratory (a);
Fit of PLS model with increasing number of component (independent variables and mixed
terms) for bulk density of cylindrical sample prepared in laboratory (b). Modelling ability (r2) is
increasing with increasing number of components, but RMS is decreasing after 70 components.
Figure 8: Final topmap of counterpropagation model for 444 asphalts. Stone Mastic Asphalts
are designated with number 1, Asphalt Concretes, used for wearing course, are designated with
number 2 and Asphalt Concretes, used for binder and base course, with number 3.
Figure 9: Final topmap of counterpropagation model for 482 asphalts. Stone Mastic Asphalts
are designated with number 1, Asphalt Concretes, used for wearing course, are designated with
number 2 and Asphalt Concretes, used for binder and base course, with number 3, Mastic
asphalts with number 4, special purpose asphalt with number 5 and Porous asphalts with
number 6.
Page 25 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
146x142mm (111 x 111 DPI)
Page 26 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
185x138mm (96 x 96 DPI)
Page 27 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
216x197mm (96 x 96 DPI)
Page 28 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
271x242mm (96 x 96 DPI)
Page 29 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
216x241mm (96 x 96 DPI)
Page 30 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
216x197mm (96 x 96 DPI)
Page 31 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
216x241mm (96 x 96 DPI)
Page 32 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960
For Peer Review
187x177mm (96 x 96 DPI)
Page 33 of 34
http://mc.manuscriptcentral.com/cem
Journal of Chemometrics
123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960