Mechanical Systems and Signal Processing: Francesco Di Maio, Kwok Leung Tsui, Enrico Zio
Mechanical Systems and Signal Processing: Francesco Di Maio, Kwok Leung Tsui, Enrico Zio
a r t i c l e i n f o abstract
Article history: In this paper we present a new procedure for estimating the bearing Residual Useful
Received 12 July 2011 Life (RUL) by combining data-driven and model-based techniques. Respectively, we
Received in revised form resort to (i) Relevance Vector Machines (RVMs) for selecting a low number of significant
24 February 2012
basis functions, called Relevant Vectors (RVs), and (ii) exponential regression to
Accepted 17 March 2012
Available online 11 April 2012
compute and continuously update residual life estimations. The combination of these
techniques is developed with reference to partially degraded thrust ball bearings and
Keywords: tested on real world vibration-based degradation data. On the case study considered,
Prognostics the proposed procedure outperforms other model-based methods, with the added value
Residual Useful Life
of an adequate representation of the uncertainty associated to the estimates of the
Relevance Vector Machines
quantification of the credibility of the results by the Prognostic Horizon (PH) metric.
Exponential regression
Bayesian techniques & 2012 Elsevier Ltd. All rights reserved.
1. Introduction
In this paper, we consider the prediction of the Residual Useful Life (RUL) of degrading thrust bearings. This is a matter
of significance in many production and manufacturing industries where rotating machinery plays a relevant role and
thrust bearings are fundamental components in support of its operation. Maintenance of these components is currently
performed under periodic schemes of inspections, which are not capable of giving due account to their actual health status.
On the other hand, experience tells us that thrust bearing failures without warning can result in even catastrophic
consequences [26,45]. This has motivated a movement, in recent years, towards approaches for monitoring and predicting
the trend of degradation of rotating machinery components within a philosophy of Condition-Based Maintenance (CBM)
[52,31,32]. CBM attempts at monitoring the machinery health status based on measurements taken without interrupting
machine operation, e.g., vibration-based signals, to recognize early that components are degrading and anticipate their life
evolution to provide maintenance engineers with the information on the components RUL, which is indeed the time
available to schedule maintenance, repair and replacement actions, before failure [25].
Great hopes are placed on CBM and RUL prediction, for the affordable and safe operation of industrial systems.
For sake of example, Fig. 1 illustrates a vibration-based degradation signal relative to three bearings tested in
[19,20,21]: the vibration is small and constant when the bearing is operating properly; then, a damage initiation occurs
first, followed by a propagation phase. Finally, when the total damage reaches a critical value, failure occurs [44].
n
Corresponding author.
E-mail addresses: [email protected] (F. Di Maio), [email protected], [email protected] (E. Zio).
0888-3270/$ - see front matter & 2012 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.ymssp.2012.03.011
406 F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427
30 30
0 100 200 300 400 500 600 700 0 100 200 300 400 500 600 700 0 100 200 300 400 500 600 700
Time [min] Time [min] Time [min]
Physically speaking, the degradation of the considered bearings begins with subsurface minute cracks at the outer
raceway [21]. Then, the cracks propagate towards the surface and dislodge pieces of the raceway material to form small pits,
known as spalls. Spalls formation increases friction during rotation which causes increased vibrations and temperature [21].
When the vibration reaches a predefined threshold the bearing is considered to have failed (30 mVRMS in Fig. 1).
Different degradation rates during the propagation phase are due to different causes and working conditions [44]: an almost
linear relationship between vibration and time (Fig. 1, left) may be caused by normal bearing fatigue; a non linear relationship
such as in Fig. 1 (center) may be due to poor lubrification conditions and installation errors, whereas a non linear relationship
such as in Fig. 1 (right) may be caused by multiple competing degradation processes under different working conditions. A
severity measure independent of the working conditions, would be needed for diagnosing the cause of the bearing failure
[43,44,24]. This aspect of the problem is out of the scope of this work; for additional information, the interested reader may
consult [30,44,24,27].
The focus of this work is on component RUL estimation, which is still affected by the bearings working conditions. Also,
degradation may not be directly observable and/or the vibration measurements may be affected by large noise and disturbances.
From a methodological viewpoint, approaches for predicting rotating machinery failures can be grouped into two main
categories [23]: traditional reliability approaches and condition-based approaches. Traditional reliability approaches are
based on the distribution of event records of a population of identical units: their historical time-to-failure data are used to
estimate the population characteristics, e.g., the mean time to failure. Examples of these parametric failure models, are
Poisson, Exponential, Weibull and Log-Normal distributions [22,28,42]. These methods are effective for units produced in
high volumes, whereas they are not effective to characterize small numbers of component or even a single component
currently running on a particular machine [23], as it is in our case study.
Condition-based approaches can be divided into two categories: model-based and data-driven techniques [12]. Model-
based approaches attempt to set up comprehensive mathematical models to describe the physical phenomena underlying
the components degradation process and failure modes, and to estimate their RUL. Common model-based approaches of
bearing degradation modeling consist in relating the instantaneous defect area size to the rolling element bearing defect
growth rate, based on Paris-Erdogan crack growth model [5], or the bearing component natural frequency and acceleration
amplitude to the failure time, based on the stiffness damage rule model [38].
However, deterministic models have been found inadequate in addressing the stochastic nature of defect-propagation;
correspondingly, a number of stochastic models have been developed: more precisely, Markov and semi-Markov models
[6,3,55,14], Bayesian filtering models, such as Kalman filter [13,16,8], Extended Kalman filter [1] and particle filtering
[15,17] have been exploited for building exhaustive models of deteriorating components and for analyzing associated
reliability-based and condition-based maintenance policies [37,54,10,11,7]. For example, the Bayesian updating procedure
for residual-life distributions estimation from component degradation signals proposed in [19] assumes a first model in
which the degradation signal exhibits independent identically distributed (iid) errors for an exponential signal trajectory,
and a second model in which the error fluctuations follow a Brownian Motion process (BM); Bayesian updating is used to
improve the estimation of the stochastic parameters in the exponential model, thus eventually improving the estimate of
the true signal trajectory.
Still, uncertainty due to assumptions and simplifications in the models may pose limitations to their applicability in
practical industrial applications, where the failure-specific mechanistic knowledge is often hard to gather without
interrupting operation [23].
In such cases, data-driven techniques may serve the purpose as they utilize monitored operational data related to
system health and routinely collected from the machine, instead of building models based on comprehensive system
physics and human expertise. They can be beneficial when understanding of first principles of system operation is not
straightforward or when the system is so complex that developing an accurate model is prohibitively expensive.
Data-driven techniques can be divided into two categories: statistical techniques (regression methods, ARMA models, etc.) and
Artificial Intelligence (AI) techniques (neural networks, fuzzy systems, etc.). The most direct data-driven techniques for RUL
estimation attempt at fitting available data of component degradation by regression models and then extrapolating the evolution
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427 407
up to failure. However, in practice, the component degradation history available may be short, incomplete and differ significantly
due to different operating conditions, so that a common extrapolation may lead to large errors and unreliable results [53].
With respect to AI techniques, the most commonly used prediction methods are based on Neural Networks [36,2,39].
For prognostic tasks, promising methods are Recurrent Neural Networks (RNNs) [49,33,56], Neuro–Fuzzy (NF) systems
[50,48,57,58] and Support Vector Machines (SVMs) [46,34]. In spite of the recognized potential of empirical, data-driven
techniques, limitations still exist for their use in safety-critical applications because of the lack of a systematic approach
for selecting the structure and parameters of the models and their black-box character which limits intuition with respect
to the understanding of their performance [51].
In this paper, we explore the combination of Relevance Vector Machines (RVMs) (a data driven-approach) and
exponential regression (a model-based approach), in an attempt to improve conventional methods which are either purely
data-driven, not incorporating any physics of the process into the computation, or solely model-based, which cannot
accommodate for un-modeled effects and can diverge quickly in the presence of unanticipated operating conditions.
Furthermore, the Bayesian approach underpinning RVMs is well suited to handle uncertainty since it stands on probability
distributions over both parameters and variables, and integrates out the nuisance terms [47,9]. Within this scheme, the
stochastic parameters of the exponential degradation model for an individual device are obtained by manipulating
multiple sources of information: (i) the real-time sensor information collected from the device through condition
monitoring, that is, the degradation signal from the individual device and (ii) the non-zero basis functions, the so called
Relevant Vectors (RVs), which are evaluated within the RVM framework of analysis as relevant for making good
predictions and avoiding over-fitting of the selected model. The objective is to obtain sparse solutions to the regression
task [47,18], to be used as surrogate model of the device’s degradation signal in making predictions about its RUL.
The paper contents are structured as follows. Section 2 contains the description of the procedure at the basis of the RUL
estimation, with an overview of the RVM basic framework. Section 3 presents the benchmark data of degrading thrust
bearings taken from [19]. In Section 4, the results of the application of the procedure here proposed are presented, and an
evaluation of its performance is given. Finally, some conclusions on the advantages and limitations of the procedure here
propounded are given in Section 5.
2. Methodology
Starting from time t¼1 throughout the time horizon of observation T, it is assumed that J successive measurements fj, j¼1,2,
y, J are taken at predefined inspection times Tj, (T1, T2, T3, y, TJ), where, computationally, Tj Tj 1 ¼n is the number of discrete
time steps between two successive inspections, along a degradation-to-failure trajectory developing in the component under
analysis. At each Tj, the available measurements are smoothed to clean the data from the noise; then, the RUL estimation for the
degrading component is performed by resorting to a combination of RVM followed by model fitting onto the identified RVs. Fig. 2
shows a schematic sketch of the computational framework, with reference to degradation signal f(t). For simplicity of illustration,
we consider a single feature as the degradation signal used for estimating the component evolution towards failure.
At each Tj, RVM regression is performed on the j available pairs of data {Tk,fk}, where k¼ 1,2, ..., j and f k ¼ f~ ðT k Þ, so as to
n n
find the Mrj most representative pairs fT l ,f l g, where, upon re-numbering, l ¼1,2, ..., M, and f l is the state estimate
n
provided by the RVM in correspondence of Tl. Then, fitting to the sparse dataset fT l ,f l g of non-zero basis functions is
performed to identify the unknown parameters of the model adopted. Finally, the fitted model is extrapolated up to a pre-
established failure threshold value d.
408 F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427
2.2.1.1. Linear model assumption. Given a set {Tk,fk} of j cleared measurements f k ¼ f~ ðT k Þ at some points Tk, k¼ 1,2, ..., j, a
linear model can be used:
f~ ðT j Þ ¼ yðT j Þ þ ej ð1Þ
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427 409
where ej is an additional noise component of the measurement and the unknown function y(t) is a linear combination of
some known basis functions fj(t), i.e.,
X
yðtÞ ¼ wj fj ðtÞ ð2Þ
j
where the vector w ¼ fw1 ,. . .,wj g in Eq. (2) contains the linear combination weights. In many applications, the solution of
Eq. (2) suffers of over-fitting [47].
where a ¼ fa1 , a2 ,. . ., aj g is a vector of j hyper-parameters that controls how far from zero each weight is allowed to deviate.
The quantification of the optimal a, via iterative searching for the most probable posterior hyper-parameters [18],
allows the identification of M values Tl, l ¼1,2, y, M (upon re-numbering), with non-zero weighted radial basis functions;
actually, only these M inputs contribute to the state prediction in what follows and its vector is denoted by T M .
2.2.1.2.2. Constrained linear model. We assume that the conditional probability of the observations e f at the inspection
times T, pðf~ 9TÞ is Gaussian Nðf~ 9yðTÞ, s2 Þ, given Eq. (1) and assuming a constant noise variance s2 and the optimal a. State
estimations can be made in terms of the predictor distribution [18].
Z
pðf~ 9a, s2 Þ ¼ pðf~ 9w, s2 Þpðw9aÞdw
1
1 T 1=2 Tðb1 þ F a F T Þ1 f~
¼ ð2pÞJ=2 9b þ F a1 F 9 1=2f
e ð4Þ
where b ¼ s I. 2
n
Given the dataset fT, f~ g, the estimates f ðTÞ are equal to the mean of Eq. (4):
n
f ðTÞ ¼ mT F ðTÞ ð5Þ
where
1
T
S ¼ F bF þ aÞ ð7Þ
T
m ¼ S F b f~ ð8Þ
1
ˆ T
RUL
Safety threshold
30 ˆ T
RUL
ˆ T
RUL
j monitored data
j smoothed data
10
Normal
distribution of the
Failure time
5 state posterior
0
M selected state estimates to be fitted to the model
The fitted model is used to predict the degradation signal value at the future time steps. It has to be noted that the
procedure allows for an easy detection of deviations from the nominal signal value, so that no independent fault detection
module is needed to activate the prognostic module. In Fig. 3, it is shown that the estimated RUL at time Tj, RULðT ^ j Þ, is
n
derived by projecting into the future the M state estimates f ðT M Þ until Td, the time at which the state hits the
predetermined failure threshold d.
Lower and upper RUL uncertainty bounds (RUL ^ l ðT j Þ and RUL
^ u ðT j Þ, respectively) are also estimated by calculating the
d d n n
times T 95 and T 5 when the model prediction curves fitted on f 95 ðT M Þ and f 5 ðT M Þ exceed the threshold d, respectively.
Thus, at time Tj the RUL is taken equal to:
^ u=l ðT j Þ ¼ ðT d T j Þ 7ðT d T j Þ
RUL ð10Þ
95=5
The case study here illustrated has been analyzed previously with other model-based methods [19–21]. The database
consists of degradation signals of 25 rolling element thrust bearings obtained from a run-to-failure experiment under
accelerated testing conditions that initiate cracks in the bearing outer raceways [19]. For further information on the
experiment description and setup, the interested reader may refer to [4].
The monitored signal consists of the average of the first seven harmonics of the amplitude of the bearing defective
frequency, whose frequency spectra are acquired every 2 min. Generally, the first part represents the normal operation
phase where there are no cracks on the bearings, as shown in the three degradation signals plotted in Fig. 1. Then, after the
failure onset, the vibration readings increase as defects develop and worsen and the prognostic module is put in operation
until the bearings is considered to be in a failure state and cannot be used. As an indicator of the failure severity, the root
mean square errore (RMS) of the overall vibration signal is considered [21]. The critical value at which the failure occurs is
set equal to a failure threshold d ¼0.03 VRMS [19]. Among the 25 available degrading trajectories, 2 never reach the failure
threshold, so that they are discarded.
Four different degradation patterns are here considered, whose trajectories are shown in Figs. 4–7. A threshold equal to
2.5 m VRMS is used to cut out the stable signal when the bearing is working in nominal conditions, so that our analysis mainly
concentrates on the second phase of the degradation evolution. Note that the degradation trends of Figs. 4–7 could be detected
by using linear, polynomial, exponential trend fitting, etc. Yet, this case study is useful for demonstrating the capabilities of the
proposed procedure not only in terms of accuracy, but also of uncertainty evaluation and prognostic horizon.
4. Results
^
The application of the procedure for computing at the inspection time Tj the estimate RULðT j Þ (Section 2) of the thrust
bearing (Section 3) is hereafter illustrated with respect to the degradation pattern 1, of Fig. 4. Data smoothing, RVM
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427 411
30
25
15
10
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
30
25
Vibration Magnitude [μV]
20
15
10
30
25
15
10
regression and the exponential model fitted to the RVs for the bearing degradation pattern 1 are shown in Figs. 8–19, in
correspondence of the successive inspection times Tj until failure. The interval between two successive inspections is equal
to n¼40 min; the circles are the M RVs collected at each inspection time Tj; the thin continuous line represents the fitted
model degradation curve; the thin dashed lines are the upper and lower uncertainty bounds; the bold-dotted horizontal
line indicates the failure threshold d. Note that the use of probabilistic kernels in RVM varies the number of RVs at the
different time steps. The procedural steps of Section 2 are performed as follows:
30
25
Vibration Magnitude [μV]
20
15
10
5
Sensor data
Linear interpolation
0 Selected RV
Threshold value
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
15
10
5
Sensor data
Linear interpolation
0 Selected RV
Threshold value
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5
Sensor data
Linear interpolation
0 Selected RV
Threshold value
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1α
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
30
25
Vibration Magnitude [μV]
20
15
10
5 Sensor data
Linear interpolation
Selected RV
0 Threshold value
Fitted curve ±1
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
At each time Tj, j ¼1,2, ..., J, RVM regression is performed on the pairs of data {Tk,fk}, k¼ 1,2, ..., j, so as to find the M RVs
fT l ,f l g, l ¼1,2, ..., M (upon re-numbering). Then, the exponential model of bearing degradation, f(t)¼a þbect [19] is fitted to
n
n
the sparse dataset fT l ,f l g, by identification of the parameters a, b and c. The fitted model is, then, extrapolated to predict
the time Tj at which the vibration signal reaches its threshold value d, and the RUL estimate RULðT ^ j Þ; the lower and upper
uncertainty bounds of the RUL estimate are also calculated, according to Eq. (10).
4.2.1.1. Linear model assumption. The unknown function y(t) (Eq. (1)) is assumed to be a linear combination of some
gaussian basis functions fj(t) weighted by wj, according to Eq. (2).
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427
Table 1
Hyper-parameters values estimated during the machine learning process based on the data of pattern 1.
8 9 8 9 8 9 8 9 8 9 8 9 8 9 8 9 8 9 8 9 8 9 8 9
> a =
< 1>
0:0 < 0:0 >
> = >
>
0:0 >
> >
> 3:4 >
> > 10:9 > >
> 465:8 >
> > 187:4 > >
>
0:0 >
> >
> 119:5 >
> >
>
412:9 >
> >
> 9:9 > > > 163:2 >
> > > > > > 323:3 >
> > > > > > > > > > >
15:2 9:3 < 0:0 >= >
> 7:4 >
> > 0:0 >
>
> >
> > >
> > 0:0 >
>
> >
> >
> 0:0 >
> >
> 14:9
>
> > 385:8 >
>
> >
> >
> 0:4 >
> >
> 93:1 >
> >
>
a ¼ ::: >
< >
= >
> >
> >
>
>
>
> >
> >
> >
> >
> >
>
>
>
>
> >
> >
> >
>
>
>
>
> >
> >
>
>
:a > ; >
: >
; >
< >
21:5 = > 413:6 >
> >
> 0:0 >
> >
> 0:0 >
> >
> > >
> >
> > > >
> 33:9 >
>
j 3:4 >
>
>
> 20:8 >
> 0:0 >
>
<
>
>
= >
> >
> >
>
>
>
>
> >
> 0:0 > >
> >
>
> 245:2 >
>
>
>
>
> 0:0 > >
> >
>
>
>
>
>
: ; >
> > >
> >
> > > >
> >
> > > >
> >
> > >
0:0 > 2:5 >
> >
> > 23:5 >
> > 0:0 < 134:0 = >
>
> 0:1 >
>
> >
> 47:4 >
> >
>
> 0:0 >
>
> >
> 0:0 >
> >
>
> 4:4 >
>
>
>
> >
> > > > > < = >
> >
> >
> > >
> >
> > >
: ; >
> >
> >
> > < 0:0 >
> = > 0:0 > > >
> >
> > 0:0 >
> >
7:8 > 0:0 >
>
> >
> > 0:0 >
> >
> >
>
>
0:0 >
>
>
0:0 >
>
<
>
>
= >
>
>
0:0 >
>
>
>
>
>
>
>
>
: ; > > > > > > > > >
>
> 84:6 >
> > > 0:0 >
> > >
>
> 0:0 >
> > 0:0 > > 0:0 < 0:0 > = >
> >
>
3:4 >
>
:
>
>
>
;
>
>
>
>
>
> >
>
>
>
> >
>
> >
> > > < 0:0 >
> =
> >
>
> 137:4 > >
> >
> >
> > > 30:4 >
> >
0:0 >
>
: >
;
>
>
>
137:4 >
>
> > 0:0 >
> >
> >
>
>
>
>
>
>
>
>
6:7 >
>
> >
0:0
>
>
> > >
> >
> > > >
> >
> > >
33:8 > 404:8 >
> >
> > 0:0 >
>
> >
> > 0:0 >
>
> >
> > 0:0 >
>
> >
>
>
>
> 0:0 > >
>
>
: >
; > > >
> >
> > > >
> >
536:0 > 26:3 >
>
> >
> > 79:5 >
> > >
>
> 0:0 >
>
> > 0:0 >
> >
>
>
> >
> >
> >
> >
> >
> >
> >
>
: ; > 253:5 >
> > >
> >
> > 0:0 >
> >
65:4 >
> >
> >
> 194:9 >
> >
> >
>
>
: >
; >
> >
> >
> >
>
179:9 > 393:5 >
> > > 133:4 >
> >
>
> >
> >
> >
>
: ; > 325:4 >
> >
534:3 >
> >
>
>
: >
;
411:0
417
418
Table 2
The mean value of the predicted state value and its variance for pattern 1, at each inspection time.
419
420 F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427
Table 4
Estimates of the parameters values of the exponential model describing the evolution of the degradation pattern 1.
1000
900
800
700
RUL [min]
600
500 140
120
100
400
80
300 60
40
20
200
0
280 320 360 400 440 480
100
0
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
The computational time required for the estimation along one complete degradation trajectory of around 1000 min is of
few seconds on an Intels Atom 1.6 GHz.
In a similar manner, the procedure has been applied also to the degradation patterns 2, 3 and 4 (Figs. 5, 6 and 7,
respectively) to show the capability of the method to deal with different operating conditions that could initiate different
types of cracks resulting in different degrees of damage severity and crack size. For sake of brevity, we do not go through
the details of the procedure for each degradation pattern, as done before for pattern 1; in Figs. 21, 22 and 23, we directly
report the RUL estimates for patterns 2, 3 and 4, respectively.
The case study has also been analyzed by exponential trend fitting, and by exponential trend fitting with independent
identically distributed (iid) errors [19] and exponential degradation modeling with a Brownian Motion error process
model (BM) [19] (i.e., two model-based approaches). In the two model-based approaches, Bayesian updating was used to
estimate the unknown models, their posterior distributions and the residual-life distributions for any individual
component.
A comparison of the results thereby obtained is given in Table 5. For each of the 4 bearings tested, at each of three
values of Tj, i.e., for Tj A {0.5 Td, 0.75 Td, 0.9 Td} we compare the median, the 5th and 95th percentiles of the residual-life
distributions. In bold are highlighted the best results among exponential fitting, iid-based, BM-based and RVM-based
models. First, it can be noticed that exponential trend fitting (4th column of Table 5) cannot provide information on the
uncertainty of the predictions: therefore, the 5th and 95th percentiles of the residual-life distributions cannot be given in
this case and the comparison can only be made between the point-estimate (4th column) and the other medians (6th, 9th
600
500
400
RUL [min]
300 30
25
20
200 15
10
5
100 0
0 170 180 190 200 210 220 230
0
0 200 400 600 800 1000 1200
Time [min]
1000
800
RUL [min]
600
70
60
400 50
40
30
20
200 10
0
170 180 190 200 210 220
0
0 200 400 600 800 1000
Time [min]
MTTF
800 Actual Remaining Life
RUL estimate with uncertainty (RVM)
700
600
500
RUL [min]
400
300 50
40
200 30
20
10
100
0
170 180 190 200 210 220
0
0 100 200 300 400 500 600 700 800
Time [min]
Table 5
Values of the estimated RUL ^ l T j RUL
^ T j , its uncertainty interval [RUL ^ u T j ] and the actual RUL(Tj) evaluated at three prediction intervals using
exponential fitting, iid-based, BM-based and RVM-based models for four degrading patterns.
Bearing Actual life Prediction time Exponential fitting Iid-based model BM-based model RVM-based model
and 12th columns). On the other hand, the exponential fitting performs better than RVM at the beginning of life of bearing
2, because in this operational condition the trend can be easily fitted. Also the iid-based and BM-based approaches perform
better than RVM at the beginning of life of bearings 3 and 4, because the model-based feature allows including prior
knowledge on the future degradation evolution, whereas the RVM-based model has no knowledge until data become
available. Finally, it is important to notice that the RVM-based model performs better than the others towards the end of
the remaining life of all bearings, when it is more relevant to have trustworthy predictions for decisions regarding
maintenance, and shows adequate representation of the uncertainties in the predictions.
In practice, it is necessary to know also how good the prognostic estimates are before establishing a maintenance
schedule to control degradation and/or counteracting actions against failure. To this aim, we adopt the Prognostic Horizon
(PH) metric, defined as the difference between the current time index j and the instance i when the prediction crosses
the failure threshold [40], such that longer PH allows for more time to act based on a prediction that has some credibility.
The definition of PH may be complemented with an allowable error bound d around the true Td. PH then ensures that the
predicted estimates are within specified limits around the actual Td and hence the predictions may be considered
trustworthy; for instance, a PH with error bound of d ¼5% identifies when a given algorithm starts predicting estimates
that are within 5% of Td.
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427 423
^
In Figs. 24–27, the RULðT j Þ estimates for the bearing degradation trajectories of Figs. 4–7 given by two approaches here
compared, i.e., exponential fitting and the proposed RVM-based model, are plotted in bold circles (RVM-based model) and
bold crosses (exponential fitting), together with both the 75% and 710% error bounds with respect to Td, plotted in
dotted lines; The predictions for the degradation trajectories of Figs. 24–27 enter the 90% and 80% accuracy at the times
listed in Tables 6 and 7 for the RVM-based model and the exponential fitting, respectively. The RVM-based model is shown
to provide estimates with high credibility (except for bearing 1), as soon as the prediction deviates from the MTTF at
detection time Tdet(RVM), i.e., the time in which the RVM algorithm jointly detects the onset of the degradation-to-failure
trajectory and performs the estimation of the available RUL. In this sense, the results indicate that the proposed method
1000
T det (exp) T det (RVM)
900
800
700
43% of residual life (RVM)
8% of residual life (exp)
RUL [min]
600
500
20% of residual life (RVM)
6% of residual life (exp)
400
300
200
100
0
0 100 200 300 400 500 600 700 800 900 1000
Time [min]
Fig. 24. PH values for the degrading pattern of Fig. 4 for exponential fitting and RVM-based model.
1200
T det (exp) T det (RVM)
1000
800
RUL [min]
400
78% of residual life (RVM)
32% of residual life (exp)
200
0
0 200 400 600 800 1000 1200
Time [min]
Fig. 25. PH values for the degrading pattern of Fig. 5 for exponential fitting and RVM-based model.
424 F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427
800
600
400
92% of residual life (RVM)
4% of residual life (exp)
200
0
0 200 400 600 800 1000
Time [min]
Fig. 26. PH values for the degrading pattern of Fig. 6 for exponential fitting and RVM-based model.
800
T det (exp) T det (RVM)
700
600
500
RUL [min]
300
92% of residual life (RVM)
4% of residual life (exp)
200
100
0
0 100 200 300 400 500 600 700 800
Time [min]
Fig. 27. PH values for the degrading pattern of Fig. 7 for exponential fitting and RVM-based model.
can give an high informative prediction at a sufficiently early stage of the component life, so as to allow taking preventive
maintenance actions with confidence.
In Table 7, the PH values for the exponential fitting of degradation trends of bearings 1–4 are given. It can be seen that
even though this approach is faster in detecting the crack onset (i.e., the Tdet(exp) vales in Table 7 are smaller than the
Tdet(RVM) ones in Table 6), the RUL estimates are not as reliable as for the RVM-based model, as shown by the PH values
much smaller than those in Table 6.
It can then be concluded that in the case study considered, exponential fitting cannot provide information on the
uncertainty of the predictions and these are less reliable than those provided by the other methods, in terms of the PH
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427 425
Table 6
PH values for the degradation patterns of Figs. 4–7 for the RVM-based model.
Table 7
PH values for the degradation patterns of Figs. 4–7 for the exponential fitting model.
metric. On the other hand, the iid-based, BM-based, and RVM-based models have been shown to be all suitable for the
problem of RUL estimation. In general terms, they can be more or less effective depending on the quality and quantity of
available data. It is then up to the analyst to evaluate case-by-case the best approach, in a trade-off between computational
time and accuracy of predictions.
Based on the results obtained, the following general guidelines hold:
The RVM-based models outperform iid-based and BM-based models towards the end of components life.
Iid-based and BM-based models are better than RVM-based ones when the available measurements are insufficient for
an accurate estimate of the degradation functional form.
Iid-based and BM-based models allow uncertainty evaluation on the estimates of the RUL at the different inspection
times when a detailed degradation model and a large database of degradation patterns are available for estimating the
parameters values of the models [19].
RVM-based models allow uncertainty evaluation on the RUL estimates, without needing a database of degradation
patterns for estimating the parameters values of the model.
5. Conclusions
We have proposed a combination of RVM and model fitting as a prognostic procedure for estimating the RUL of
degrading components. We have successfully applied the procedure to partially degraded thrust ball bearings, based on
pre-processed data collected on individual degradation trajectories. The most relevant basis functions identified on the
smoothed data by the RVM are fitted to the degradation model, which is then extrapolated to failure for estimating the
residual useful life of the component.
The proposed procedure is capable of treating degradation signals with low signal-to-noise ratio for RUL estimation and
accounting for multiple competing degradation processes due to different working conditions and failure severity. Also,
key features of interest for CBM applications are the accuracy of the RUL estimates and the capability of uncertainty
evaluation, thanks to the Bayesian framework on which RVM relies on. Further, the inferred predictors are sparse in that
they contain relatively few non-zero basis functions, i.e., Relevant Vectors (RVs); this feature allows avoiding over-fitting
and making accurate predictions.
Finally, we have shown that the credibility of the results can be quantified by prognostic performance indicators, like
PH, to ensure that the predicted estimates can be considered trustworthy for decision-making in predictive maintenance
activities.
426 F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427
Acknowledgments
We thank the anonymous referees for their thorough review, comments and suggestions, that significantly contributed
to improve the quality of the work.
References
[1] B.D. Anderson, J.B. Moore, Optimal Filtering, Englewood Cliffs (NJ): Prentice Hall, 1979.
[2] E.B. Barlett, R.E. Uhrig, Nuclear power plant status diagnostics using an Artificial Neural Network, Nucl. Technol. 97 (1992).
[3] P. Baruah, R.B. Chinnam, HMMs for diagnostics and prognostics in machining processes, Int. J. Prod. Res. 43 (6) (2005) 1275–1293. 15.
[4] L. Bian, N. Gebraeel, Computing and updating first-passage time distributions for randomly evolving degradation signals, IIE Trans. http://dx.doi.org/
10.1080/0740817X.2011.649661.
[5] Y. Li, C. Billington, C. Zhang, T. Kurfess, S. Danyluk, S. Liang, Adaptive prognostics for rolling element bearing condition, Mech. Syst. Sig. Process. 13
(1) (1999) 103–113.
[6] C. Bérenguer, A. Grall, B. Castanier, Simulation and evaluation of condition based maintenance policies for multi-component continuous-state
deteriorating systems. In: Cottam M, Harvey D, Pape R, Tait J, Editors. Proceedings of the Foresight and Precaution Conference, pp. 275–282, 2000.
[7] F. Cadini, E. Zio, D. Avram, Monte Carlo-based filtering for fatigue crack growth estimation, Probab. Eng. Mech. 24 (2009) 367–373.
[8] W. Caesarendra, G. Niu, B.S. Yang, Machine condition prognosis based on sequential Monte Carlo method, Expert Syst. Appl. 37 (3) (2010)
2412–2420.
[9] W. Caesarendra, A. Widodo, B.S. Yang, Application of Relevance Vector Machine and logistic regression for machine degradation assessment, Mech.
Syst. Sig. Process. 24 (2010) 1161–1171.
[10] B. Castanier, C. Bérenguer, A. Grall, Stochastic maintenance planning for a repairable system which is inoperative during maintenance operations, in:
EJ Bonano, AL Camp, MJ Majors, RA Thompson (Eds.), Probabilistic Safety Assessment and Management, PSAM6, Elsevier, Amsterdam, 2002,
pp. 1365–1370.
[11] D. Chen, K.S. Trivedi, Optimization for condition-based maintenance with semi-Markov decision process, Reliab. Eng. Syst. Saf. 90 (2005) 25–29.
[12] L.H. Chiang, E. Russel, R. Braatz, Fault Detection and Diagnosis in Industrial Systems, Springer-Verlag, London, 2001.
[13] A. Doucet, On sequential simulation-based methods for Bayesian filtering, Technical Report. Dept. of Engineering, University of Cambridge, CUED-F-
ENGTR310, 1998.
[14] M. Dong, D. He, Hidden semi-Markov model-based methodology for multi-sensor equipment health diagnosis and prognosis, Eur. J. Operational Res.
178 (3) (2007) 858–878.
[15] A. Doucet, S. Godsill, C. Andreu, On sequential Monte Carlo sampling methods for Bayesian filtering, Stat. Comput. 10 (2000) 197–208.
[16] A. Doucet, J.F.G. de Freitas, N.J. Gordon, An introduction to sequential Monte Carlo methods, in: A Doucet, JFG de Freitas, NJ Gordon (Eds.), Sequential
Monte Carlo in Practice, Springer-Verlag, New York, 2001.
[17] P.M. Djuric, J.H. Kotecha, J. Zhang, Y. Huang, T. Ghirmai, M.F. Bugallo, J. Miguez, Particle filtering, IEEE Sig. Proc. (2003) 19–37.
[18] T. Fletcher, Relevance Vector Machines Explained, www.cs.ucl.ac.uk/staff/T.Fletcher, 2008.
[19] N.Z. Gebraeel, M.A. Lawley, R. Li, J.K. Ryan, Residual-life distributions from component degradation signals: a Bayesian approach, IIE Trans. 37 (2005)
543–557.
[20] N.,G. Gebraeel, Sensory — updated residual life distributions for components with exponential degradation patterns, IEEE Trans. Autom. Sci. Eng. 3
(4) (2006).
[21] N.G. Gebraeel, A. Elwany, J. Pan, Residual life predictions in the absence of prior degradation knowledge, IEEE Trans. Reliab. 58 (1) (2009).
[22] P.G. Groer, Analysis of time-to-failure with a Weibull model, Proceedings of the Maintenance and Reliability Conference, Knoxville, TN, USA, 2000,
pp. 59.01–59.04.
[23] A. Heng, S. Zhang, A.C.C. Tan, J. Mathew, Rotating machinery prognostics: state of the art, challenges and opportunities, Mech. Syst. Sig. Process. 23
(2009) 724–739.
[24] H. Hong, M. Liang, Fault severity assessment for rolling element bearings using the Lempel–Ziv compleaxity and continuous wavelet transform, J.
Sound Vib. 320 (2009) 452–468.
[25] A.K.S Jardine, D. Lin, D. Banjevic, A review on machinery diagnostics and prognostics implementing condition based maintenance, Mech. Syst. Sig.
Process. 20 (2006) 1483–1510.
[26] A. Kelly, Maintenance and its management, Proceedings of the Communication Conference, Surrey, England, 1989.
[27] Koyo-Jtekt Corporation, Ball & Roller Bearings: Failures, Causes and Countermeasures, Catalogue NO.B3001E, 2009.
[28] J.F. Lawless, Statistical Models and Methods for Lifetime Data, Wiley-Interscience, New York, 2002.
[30] Y. Li, S. Billington, C. Zhang, T. Kurfess, S. Danyluk, S. Liang, Adaptive prognostics for rolling element bearing condition, Mech. Syst. Sig. Process. 13
(1999) 103–113.
[31] L. Mann, A. Saxena, G.M. Knapp, Statistical-based or condition-based preventive maintenance, J. Qual. Maintenance Eng. 1 (1) (1995) 45–59.
[32] M. Marseguerra, E. Zio, L. Podofillini, Condition-based maintenance optimization by means of genetic algorithms and Monte Carlo simulation,
Reliab. Eng. Syst. Saf. 77 (2002) 151–165.
[33] A. More, M.C. Deo, Forecasting wind with neural networks, Mar. struct. 16 (2003) 35–49.
[34] M.C. Moura, I.D. Lins, E. Zio, E.L. Droguett, Combining particle swarm optimization and support vector regression for reliability prediction.
Proceedings of the 10th International PSAM Conference, 7–11 June 2010, Washington USA pp. 1–10.
[35] S.J. Orfanidis, Introduction to Signal Processing, Prentice-Hall, Englewood Cliffs, NJ, 1996.
[36] L. Peel, Data driven prognostics using a Kalman filter ensemble of Neural Network Models, International Conference on Prognostics and Health
Management, 2008.
[37] U. Pulkkinen, S. Uryas’ev, Optimal operational strategies for an inspected component. In: Petersen KE, Rasmussen B, Eds. Safety and Reliability ’92,
Proceedings of the European Safety and Reliability Conference ’92. London: Elsevier, pp. 896–907, 1992.
[38] J. Qiu, B.B. Set, S.Y. Liang, C. Zhang, Damage mechanics approach for bearing lifetime prognostics, Mech. Syst. Sig. Process. (2002) 817–829.
[39] T.V. Santosh, A. Srivastava, V.V.S. Sanyasi Rao, A.K. Gosh, H.S. Kushwaha, Diagnostic system for identification of accident scenarios in Nuclear Power
Plants using Artificial Neural Networks, Reliab. Eng. Syst. Saf. 94 (2009) 759–762.
[40] A. Saxena, J. Celaya, B. Saha, S. Saha, K. Goebel, Evaluating algorithm performance metrics tailored for prognostics, IEEE Aerospace Conference, Big
Sky MT, pp. 1–13. 2009.
[41] A. Savitzky, M.J.E. Golay, Smoothing and differentiation of data by simplified least squares procedures, Anal. Chem. 36 (1964) 1627–1639.
[42] A. Schömig, O. Rose, On the suitability of the Weibull distribution for the approximation of machine failures, Proceedings of the Industrial
Engineering Research Conference, Portland, OR, 2003.
[43] Y. Shao, K. Nezu, Prognosis of remaining bearing life using neural networks, Proc Instn Mech Engrs, 214, pp. 217-226.
[44] Y. Shao, K. Nezu, Design of mixture de-noising for detecting faulty bearing signals, J. Sound Vib. 282 (2005) 899–917.
[45] G.K. Singh, S.A. Saleh Al Kazzazb, Induction machine drive condition monitoring and diagnostic research—a survey, Electr. Power Syst. Res. 64 (2)
(2003) 145–158.
[46] V. Sotiris, M. Pecht, Support Vector Prognostics Analysis of Electronic Products and Systems, The AAAI Conference on Artificial Intelligence, 2007.
F. Di Maio et al. / Mechanical Systems and Signal Processing 31 (2012) 405–427 427
[47] M.E. Tipping, Sparse Bayesian learning and the relevance vector machine, J. Mach. Learn. Res. 1 (2001) 211–244.
[48] V.T. Tran, B.S. Yang, A.C.C. Tan, Multi-step ahead direct prediction for the machine condition prognosis using regression trees and neuro-fuzzy
systems, Expert Syst. Appl. 36 (5) (2009) 9378–9387.
[49] P. Tse, D. Atherton, Prediction of machine deterioration using vibration based fault trends and recurrent neural networks, Trans. ASME: J. Vib.
Acoust. 121 (1999) 355–362.
[50] W.Q. Wang, M.F. Goldnaraghi, F. Ismail, Prognosis of machine health condition using Neuro–Fuzzy Systems, Mech. Syst. Sig. Process. 18 (2004)
813–831.
[51] T. Wang, J. Yu, D. Siegel, J. Lee, A similarity based prognostic approach for remaining useful life estimation of engineered systems, International
Conference on Prognostics and Health Management, 2008.
[52] J.H. Williams, A. Davies, I.R. Drake, Condition-Based Maintenance and Machine Diagnostics, Chapman and Hall, 1994, ISBN 0 4 I2465 00 0, I87pp.
[53] J. Yan, M. Koc- , J. Lee, A prognostic algorithm for machine performance assessment and its application, Prod. Plann. Control 15 (8) (2004) 796–801.
[54] P.J. Vlok, J.L. Coetzee, D. Banjevic, A.K.S. Jardine, V. Makis, An application of vibration monitoring in proportional hazards models for optimal
component replacement decisions, J. OR Soc. 53 (2002) 193–202.
[55] X. Zhang, R. Xu, C. Kwan, S.Y. Liang, Q. Xie, L. Haynes, An integrated approach to bearing fault diagnostics and prognostics, Proceedings of American
Control Conference, Portland, OR, USA, 2005, pp. 2750–2755.
[56] E. Zio, M. Broggi, N. Pedroni, Nuclear reactor Dynamics On-Line Estimation by Locally Recurrent Neural Networks, Prog. Nucl. Energy 51 (3) (2009)
573–581.
[57] E. Zio, G. Gola, A neuro-fuzzy technique for fault diagnosis and its application to rotating machinery, Reliab. Eng. Syst. Saf. 94 (1) (2009) 78–88.
[58] E. Zio, F. Di Maio, A Data-driven Fuzzy, Approach for predicting the remaining useful life in Dynamic failure scenarios of a Nuclear System, Reliab.
Eng. Syst. Saf. 95 (1) (2010) 49–75.