0% found this document useful (0 votes)

112 views28 pages

Machines 05 00021 v3 PDF

Uploaded by

Auditio Mandhany

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views28 pages

Machines 05 00021 v3 PDF

Uploaded by

Auditio Mandhany

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

machines

Review
A Review of Feature Extraction Methods in
Vibration-Based Condition Monitoring and Its
Application for Degradation Trend Estimation of
Low-Speed Slew Bearing
Wahyu Caesarendra 1,2 ID
and Tegoeh Tjahjowidodo 3, * ID

1 Mechanical Engineering Department, Diponegoro University, Semarang 50275, Indonesia;

[email protected]
2 School of Mechanical, Materials and Mechatronic Engineering, University of Wollongong,
Wollongong, NSW 2522, Australia
3 School of Mechanical and Aerospace Engineering, Nanyang Technological University,
Singapore 639798, Singapore
* Correspondence: [email protected]

Received: 9 August 2017; Accepted: 19 September 2017; Published: 27 September 2017

Abstract: This paper presents an empirical study of feature extraction methods for the application
of low-speed slew bearing condition monitoring. The aim of the study is to find the proper features
that represent the degradation condition of slew bearing rotating at very low speed (≈1 r/min)
with naturally defect. The literature study of existing research, related to feature extraction methods
or algorithms in a wide range of applications such as vibration analysis, time series analysis and
bio-medical signal processing, is discussed. Some features are applied in vibration slew bearing
data acquired from laboratory tests. The selected features such as impulse factor, margin factor,
approximate entropy and largest Lyapunov exponent (LLE) show obvious changes in bearing
condition from normal condition to final failure.

Keywords: vibration-based condition monitoring; feature extraction; low-speed slew bearing

1. Introduction
Slew bearings are large roller element bearings that are used in heavy industries such as mining
and steel milling. They are particularly used in turntables, cranes, cranes, rotatable trolleys, excavators,
reclaimers, stackers, swing shovels and ladle cars [1]. In this paper, the slew bearing being analyzed is
the one typically used in BlueScope Steel Australia; specifically for the coal loading into the steel ladle
turret, coal stacker and reclaimer. Slew bearing is different to typical rolling element bearing in terms
of geometry, load and speed application. Slew bearings are usually operated at very low rotational
speed, from 0.5 to 15 rpm in either continuous or reversible rotation and are designed with three axes
to support high axial and radial loads. The bearings have large diameter (e.g., approximately from
200 mm to 5000 mm). Due to the high geometric and cost issue, industries purchase these bearings for
replacement in a regular maintenance period. However, in some cases, these bearings may fail before
the predicted time. To prevent catastrophic failure and to effectively schedule the bearing purchase for
minimizing the replacement cost, a condition monitoring method for early failure detection is required.
Analyzing the vibration signatures of slew bearings that operate at a very low speed is more
complex than that of typical rolling element bearings. Slew bearing signals have low energy. They are
also non-stationary, non-linear, chaotic and practically hard to analyze. Due to their low energy,
the signal is masked by a higher background noise level. Therefore, it is difficult to extract the pertinent

Machines 2017, 5, 21; doi:10.3390/machines5040021 www.mdpi.com/journal/machines

Machines 2017, 5, 21 2 of 28

bearing signal. It is necessary to have reliable condition monitoring parameters or features that could
extract accurate information about the bearing condition from the signal.
The number of studies on slew bearing condition monitoring and early failure detection based
on vibration analysis is lower than for rolling element bearing. Studies of slew bearing condition
monitoring and life prediction utilizing vibration signal are presented in [2–7]. Žvokelj et al. [2]
presents the condition monitoring method utilizing vibration signal based on ensemble empirical
mode decomposition (EEMD) and principal component analysis (PCA). An extension work of [2] is
presented in [3] which used combined EEMD with kernel PCA. Recent studies of the slew bearing
fault detection and diagnosis is presented by Žvokelj et al. [4]. The paper presents combined EEMG
and multiscale independent component analysis (ICA). Nevertheless, studies of slew bearing failure
prediction and prognosis also increased recently [5–7]. Feng et al. [5] develops a reliability model
for residual life prediction based on Weibull analysis. A method for degradation trend estimation of
slew bearings was presented in [6]. The method employed PCA, particle swarm optimization (PSO)
and least squares support vector machine (LSSVM). Another residual life prediction model study of
slewing bearing can be found in [7]. The authors used PCA and support vector regression (SVR) to
construct the model.
In order to monitor the condition of slew bearings, a feature that represents the trends related to
degradation condition from normal to failure is necessary. Due to the unique vibration signal generated
by the roller and raceway contact in slew bearing, a suitable and sensitive feature extraction method
is mandatory in slew bearing fault diagnosis and prediction. There are numerous existing methods,
which are divided into different categories. For instance, Henao et al. [8] presented diagnostic methods
in a non-stationary condition. The methods are classified into four approaches: (1) frequency domain
approach; (2) time domain analysis; (3) slip-frequency; (4) diagnosis in the time-frequency domain.
This paper particularly presents an empirical study of feature extraction methods for vibration
signals acquired from naturally degraded slew bearings operating at very low rotational speed.
In terms of feature extraction methods, six categories are reviewed, namely (i) time-domain
features extraction; (ii) frequency-domain features extraction; (iii) time-frequency representation;
(iv) phase-space dissimilarity measurement; (v) complexity measurement; and (vi) other features.
Among the six categories, the time-domain features extraction method is found to be the most dominant
method in typical rolling element bearings. In addition, the third and fourth methods are suitable for
non-stationary, non-linear and chaotic signals.

2. Laboratory Slew Bearing Experiment

2.1. Slew Bearing Test-Rig and Sensor Location

The vibration data employed in this paper were acquired from a slew bearing test rig [1]. The test
rig was designed to replicate real working conditions of a slew bearing in a steel company, such as a
high-applied load and low rotational speed. The slew bearing used was a large thrust bearing with an
inner and outer diameter of 260 mm and 385 mm, respectively. The rotational speed of the lab test rig
can be operated from 1 to 12 r/min. A photo of the test rig and sensor location on the slew bearing is
shown in Figure 1.
Machines 2017, 5, 21 3 of 28
Machines 2017, 5, 21 3 of 28
Machines 2017, 5, 21 3 of 28

Figure1.1.Photo
Figure Photoof
ofslew
slew bearing
bearing lab
lab test
test rig
rig and
andsensor
sensorlocation.
location.
Figure 1. Photo of slew bearing lab test rig and sensor location.
2.2.2.2. Data
Data AcquisitionProcedure
Acquisition Procedure
2.2. Data Acquisition Procedure
The vibration data for continuous rotation was collected from four accelerometers that were
The vibration data for continuous rotation was collected from four accelerometers that
installed on the inner
The vibration datadiameter surfacerotation
for continuous at 90° wasto each other from
collected with four
4880accelerometers
Hz sampling that rate. were
The
were installed on the inner diameter surface at 90◦ to each other with 4880 Hz sampling
accelerometers
installed on the were IMIdiameter
inner 608A11 ICP (IMI Sensors,
surface at 90° toPCB each Piezotronics
other withDivision,
4880 HzNew York, NY,
sampling rate.USA)
The
rate. The accelerometers were IMI 608A11 ICP (IMI Sensors, PCB Piezotronics Division, New York,
type sensors. The
accelerometers wereaccelerometers
IMI 608A11 ICP were
(IMIconnected
Sensors, PCBto a Piezotronics
high-speed PicoScope
Division, NewDAQYork,(PS3424) (Pico
NY, USA)
NY, USA) type sensors. The accelerometers were connected to a high-speed PicoScope DAQ (PS3424)
Technology,
type sensors.StThe Neots, United Kingdom).
accelerometers The bearing
were connected to was subjected PicoScope
a high-speed to an axialDAQload of 15 tons.(Pico
(PS3424) The
(Pico Technology,
bearing began
Technology, St Neots,
St running
Neots, inUK).
United The
withbearing
2006Kingdom). Thewas
rotational subjected
speed
bearing to an However,
of 1 subjected
was r/min. axial load
to an ofprovide
to
axial 15 tons.
load of 15 The bearing
continuous
tons. The
began running
monitoring
bearing began in running
and 2006 with
produce rotational
2006 with speed
inrun-to-failure of 1data,
bearing
rotational r/min.
speedthe ofHowever,
bearing
1 r/min.datato provide tocontinuous
was collected
However, monitoring
on a continuous
provide daily basis
andfromproduce
February
monitoring run-to-failure
and toproduce bearing
August run-to-failure
2007 data,
with total the bearing
number
bearing of 139
data, data
the days.
bearing was
Todata collected
accelerate theonbearing
was collected a daily basisbasis
on aservice
daily from
life,
February
coal
fromdust to August
was injected
February 2007
to August with
into2007 total number
the bearing of 139
in number
with total mid-Aprildays.
of 2007To accelerate
(58 days
139 days. the bearing service
from the beginning).
To accelerate the bearing In life, coal
practice,
service dust
life,
was injected
especially
coal into
dust was the bearing
in steel-making
injected in
themid-April
intocompanies, in2007
bearingthe slew (58 days
bearing
mid-April 2007from
is (58 the
located
daysbeginning).
in open
from Inwhere
theair practice,
beginning). especially
theInbearing
practice, is in
steel-making
exposed
especiallytoincompanies,
a steel-makingthe slew
dusty environment bearing
companies, is located
and the
for in open
this reason
slew bearing is air
the where
coal
located inthe
dust wasbearing
open insertedis exposed
air where in the to a dusty
mid-April
bearing to
is
environment
simulate
exposed to and
the forworking
a real
dusty this reason the and
coal The
conditions.
environment dustthis
for was inserted
schematic
reason of
thethein mid-April
vibration
coal dust was to inserted
datasimulate inthe
acquisition real
fromworking
mid-April slewto
bearing
simulatetest-rig
conditions. the real
The is presented
working
schematic in Figure
conditions.
of the 2. The
vibration dataschematic
acquisitionof the from vibration data acquisition
slew bearing from slew in
test-rig is presented
bearing
Figure 2. test-rig is presented in Figure 2.
Speed sensor
Speed sensor

Belt Gear configuration

Belt Gear configuration
Drive ring
Drive ring
Slewing
bearing
Slewing
bearing
Hidraulic
pressure Belt
Hidraulic
pressure Belt

Motor
Motor

Figure 2. Schematic diagram of slew bearing test-rig and data acquisition flow process.
Figure
Figure 2. 2. Schematicdiagram
Schematic diagramof
ofslew
slewbearing
bearing test-rig
test-rig and
and data
dataacquisition
acquisitionflow
flowprocess.
process.
Machines 2017, 5, 21 4 of 28

3. Features Extraction Methods and Its Application on Slew Bearing Vibration Signal

3.1. Category 1: Time-Domain Features Extraction

3.1.1. Statistical Time-Domain Features

Statistical time-domain features such as mean, root mean square (RMS), standard deviation and
variance were usually used in past studies to identify the differences between one vibration signal
and another. More advanced statistical-based features such as skewness and kurtosis can be applied
to the signal which is not purely stationary. These features examine the probability density function
(PDF) of the signal. It is a well-known fact that if the condition of the bearing changes, the PDF also
changes, thus the skewness and kurtosis might also be affected. In particular, skewness is used to
measure whether the signal is negatively or positively skewed, while kurtosis measures the peak value
of the PDF and indicates if the signal is impulse in nature. For a signal with a normal distribution
i.e., normal bearing signal has a skewness value of zero.
In addition, kurtosis is obtained from the peak of the PDF of the vibration signal, while skewness
is obtained from the mean value of the PDF of the vibration signal. It is well known that the kurtosis
value of vibration signal from normal bearing is approximately three [9] and the skewness value of
approximately zero. Then, when the PDF of vibration signal changes due to faults, the kurtosis
value will increase to greater than three and the skewness value will shift to either negative
or positive. Other features which can be calculated from PDF of the vibration signal includes:
entropy, which calculates the histogram of the PDF and measures the degree of randomness of
the vibration signal; lower bound and upper bound histogram, which measure the lower and upper
values of the PDF respectively.
Aside from the statistical time-domain features mentioned above, there are other non-dimensional
features such as shape factor (defines as RMS divided by mean) and crest factor (defines as standard
deviation divided by RMS). Both features will change as the mean, RMS and standard deviation
change. The summary of the statistical time-domain features is presented in Table 1.

Table 1. Brief review of statistical time-domain features extraction applied in condition monitoring of
typical rolling element bearings [10].

Description
Feature Name
Brief Definition Formula
The RMS value increase gradually as fault developed. However, RMS is unable q
1 N 2
N ∑ i =1 x i
RMS to provide the information of incipient fault stage while it increases with the fault RMS =
development [11].
N 2
Variance Variance measures the dispersion of a signal around their reference mean value. ∑ i =1 ( x i − m )
Var = ( N −1) σ 2
Skewness quantifies the asymmetry behavior of vibration signal through its N
∑ i =1 ( x i − m )
3
Skewness Sk =
probability density function (PDF). ( N −1) σ 3

Kurtosis quantifies the peak value of the PDF. The kurtosis value for normal N
∑ i =1 ( x i − m )
4
Kurtosis Ku =
rolling element bearing is well-recognized as 3. ( N −1) σ 4
q
Shape factor is a value that is affected by an object’s shape but is independent of 1 N
N ∑ i =1 x i
2
Shape factor SF =
its dimensions [12]. 1
N
N
∑ i =1 | x i |

Crest factor (CF) calculates how much impact occur during the rolling element max | xi |
Crest factor CF = q
1 N
N ∑ i =1 x i
2
and raceway contact. CF is appropriate for “spiky signals” [12].
Entropy, e( p), is a calculation of the uncertainty and randomness of a sampled n
Entropy vibration data. Given a set of probabilities, ( p1 , p2 , . . . , pn ), the entropy can be e( p) = − ∑ p(zi ) log2 p(zi )
calculated using the formulas as shown in the right column. i =1
Machines 2017, 5, 21 5 of 28
Machines 2017, 5, 21 5 of 28

3.1.2.
3.1.2. Upper
Upper andand LowerBound
Lower BoundofofHistogram
Histogram

The The application

application of histograms
of the the histograms or a discrete
or a discrete PDF for PDF for time-domain
time-domain feature feature extraction
extraction is presentedis
presented
in [13]. Let d in
be[13]. Let
the number d be
ofthe number
division thatofgroup
division
thethat group
ranges, andthehiranges, ≤ i <hi dwith
with 0 and 0 ≤ i < d
be the columns be of
the columns of the histogram
the histogram for the time xi for the time x i
n
1
hi = ∑ 1ri ( xi ), ∀i, 0 ≤ i < d
n
(1)
hi = n ri ( xi ), ∀i , 0 ≤ i < d (1)
j =0 j =0 n
(
i (max( x )−min( x )) (i +1)(max( x )−min( x ))
1, i f i (max(ixi ) − min(ixi ))≤ x <(i + 1)(max( xi ) −i min( xi )) i
ri = r = 1, if d ≤x< d (2)
i 0, d otherwise d (2)
0, otherwise

The upper bound hU and lower bound h L of histogram are defined as
The upper bound hU and lower bound h L of histogram are defined as
hU h= =max ( xx i))++Δ ∆/2
max( /2 (3) (3)
U i

h L h= =max ( xx i))−−Δ ∆/2

max( /2 (4)
L i (4)
where ∆ =Δ (=max ( xxi ))− min( xi ))/(n − 1).
where (max( i − min( xi )) / ( n − 1) .
According to the features extraction results presented in Figure 3, some features, such as RMS,
According to the features extraction results presented in Figure 3, some features, such as RMS,
kurtosis, histogram upper bound and histogram lower bound, show the fluctuation in the last
kurtosis, histogram upper bound and histogram lower bound, show the fluctuation in the last
measurement day, approximately from the 90th day to the 139th day. These indicate that some
measurement day, approximately from the 90th day to the 139th day. These indicate that some features
features are sensitive to the slew bearing condition while others are less. The incipient damage
are sensitive to the slew bearing condition while others are less. The incipient damage occurred in day
occurred
90th thatinisday 90th
clearly that isby
indicated clearly indicatedhistogram
RMS, kurtosis, by RMS,upperkurtosis,
boundhistogram upper
and histogram bound
lower bound and
histogram
feature, lower
where bound feature, where
the fluctuation the fluctuation
is associated is associated
with roller and outer with
raceroller and However,
defects. outer race more
defects.
However, more appropriate and reliable features are necessary for the condition monitoring
appropriate and reliable features are necessary for the condition monitoring of slew bearing to detect of slew
bearing to detectdamage
an impending an impending damagethe
and to estimate and to estimate
degradation the degradation index.
index.

40 1
Skewness
RMS

20 0

0 -1
0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140

200 2000
Shape factor
Kurtosis

100 0

0 -2000
0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140
4
x 10
2 5
Crest factor

Entropy

0 0

-2 -5
0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140

0 400
Histogram upper

Histogram lower

-200 200

-400 0
0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140
t, days t, days

Figure 3. Statistical time-domain features calculated from the vibration slew bearing signal.
Figure 3. Statistical time-domain features calculated from the vibration slew bearing signal.
Machines2017,
Machines 2017,5,5,21
21 66of
of28
28

3.1.3. Autoregressive (AR) Coefficients

3.1.3. Autoregressive (AR) Coefficients
In other studies [13], autoregressive (AR) coefficients of the vibration signal are calculated as
In other studies [13], autoregressive (AR) coefficients of the vibration signal are calculated
bearing vibration features. It is known that a faulty vibration signal of typical rolling element bearing
as bearing vibration features. It is known that a faulty vibration signal of typical rolling element
produces different autoregressive coefficients compared to that of normal vibration signal. The
bearing produces different autoregressive coefficients compared to that of normal vibration signal.
following is a common univariate time series model to calculate the AR coefficients:
The following is a common univariate time series model to calculate the AR coefficients:
n
yt = a1 yt −1 + a2 yt − 2 + . . . + an yt − n + ε t =  ainyt −1 + ε t (5)
yt = a1 yt−1 + a2 yt−2 + . . . + an yt−n + ε t i= =1
∑ a i y t −1 + ε t (5)
i =1
where a1 to a n are the autoregressive coefficients, yt is the digitized vibration signal, n is the
whereofa1the
order an are
to AR ( n= 8 ) and ε is
the autoregressive
model coefficients,
the residual yt (Gaussian
is the digitized noise). signal, n is the order of
whitevibration
the AR
Themodel = 8) and εkurtosis,
RMS,(nskewness, is the residual
entropy(Gaussian
and ARwhite noise). have been applied in vibration
coefficients
The RMS, skewness, kurtosis, entropy and AR
datasets from induction motor [14] and rolling element bearing coefficients have
[15].been
Theseapplied in vibration
papers have made datasets
great
from inductiontomotor
contributions the [14] and rolling
research areas element
on the bearing
bearing[15]. These papers
condition have made
monitoring andgreat
faultcontributions
diagnosis.
to the research
However, areas on were
the features the bearing
applied condition monitoring
for artificially and fault
bearing diagnosis.
damage. However,
In practice, the the features
damage is
were applied for artificially bearing damage.
typically developed naturally rather than artificially. In practice, the damage is typically developed naturally
rather
Thethan
ARartificially.
coefficients results are presented in Figure 4. It is shown that most of the features are
less relevant coefficients
The AR to representresults are presented
the slew in Figure except
bearing condition 4. It is shown
for ARthat most of the
coefficient features
4 which are less
is shown
relevant to represent the slew
fluctuating in the last measurement. bearing condition except for AR coefficient 4 which is shown fluctuating
in the last measurement.

1 0.5
AR coef 1

AR coef 2

0 0

-1 -0.5
0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140

1 0.2
AR coef 3

AR coef 4

0 0

-1 -0.2
0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140

0.5 0.5
AR coef 5

AR coef 6

0 0

-0.5 -0.5
0 50 100 150 0 50 100 150

0.2 0.5
AR coef 7

AR coef 8

0 0

-0.2 -0.5
0 50 100 150 0 50 100 150
t, days t, days

Figure 4. AR features (up to 8 coefficients) calculated from the vibration slew bearing signal.
Figure 4. AR features (up to 8 coefficients) calculated from the vibration slew bearing signal.

Other time domain features such as impulse factor (IF) and margin factor (MF) have been
recently used [12,16,17]. Similar to CF, IF is used to measure how much impact is generated from the
bearing defect. The CF measures the impact by dividing the maximum absolute value of vibration
Machines2017,
Machines 2017,5,5,21
21 77of
of28
28

signal and RMS value, while IF divides the maximum absolute value by the mean of absolute value.
Other time
The formula domain features
is presented such as impulse factor (IF) and margin factor (MF) have been recently
as follows:
used [12,16,17]. Similar to CF, IF is used to measure how much impact is generated from the bearing
defect. The CF measures the impact by dividing max xi
IF = the maximum absolute value of vibration signal and
1 value (6)
RMS value, while IF divides the maximum absolute

N
i =1
xi by the mean of absolute value. The formula
is presented as follows: N
max| xi |
Margin factor (MF) measures the levelIF of=impact
1 N
between rolling element and raceway. The MF (6)
is calculated by dividing the maximum absolute N∑ i =1 | xof
value i | vibration signal to the RMS of absolute

valueMargin
of vibration
factorsignal (7):
(MF) measures the level of impact between rolling element and raceway. The MF is
calculated by dividing the maximum absolute value of vibration signal to the RMS of absolute value
max xi
of vibration signal (7): MF = 2
 1 max N | xi |  (7)
MF =  x
i =1 p i  2
 N1 ∑iN=1 | xi |
 (7)
N
The progressionofofslew
The progression slew bearing
bearing deterioration
deterioration of IFof IFMF
and and MF features
features are moreare more compared
apparent apparent
compared to RMS, kurtosis and histogram lower features as presented
to RMS, kurtosis and histogram lower features as presented in Figure 5. in Figure 5.

60
Impulse factor

0
0 20 40 60 80 100 120 140

150
Margin factor

100

0
0 20 40 60 80 100 120 140
t, days

Figure 5.
Figure Impulsefactor
5. Impulse factor(IF)
(IF)and
andmargin
marginfactor
factor(MF)
(MF)feature
feature calculated
calculated from
from the
the vibration
vibration slew
slew
bearingsignal.
bearing signal.

3.1.4.
3.1.4.Hjorts’
Hjorts’Parameters
Parameters
In
Inaddition
additionto tothe
thetime-domain
time-domainfeatures features mentioned
mentionedin in previous
previous subsections,
subsections,Hjorts’
Hjorts’parameters
parameters
also
also falls
falls into
into this
this category
category [18]. [18]. These These features features are are calculated
calculated based
based on
on the
the first
first and
and the
the second
second
derivatives of the vibration signal. In the time series context, the numerical
derivatives of the vibration signal. In the time series context, the numerical values for the derivatives values for the derivatives
are
are obtained
obtained as as the
the differences
differences between between the the current
current value value and
and the
the prior
prior value.
value. The
The first
first Hjorth’s
Hjorth’s
parameter
parameterisis called
called ‘activity’.
‘activity’. The Theactivity activityof ofvibration
vibrationsignal signalcan
canbebecomputed
computedby bysimply
simplycalculating
calculating
the
the variance of of the
thevibration
vibrationsignal signalamplitude amplitude σx2 ,σwhere
x
2
, where σ x standard
σx is the is the standard
deviation deviation of the
of the sampled
signal x =
vibrationvibration
sampled ( x1 , xx2 ,=x3( x, 1.,. x. 2, ,xxN3 ,)...., xN ) .
signal
The ‘mobility’ parameter of vibration signal is calculated as the square root of the ratio of the
activity of the first derivative and the activity of the vibration signal.
Machines 2017, 5, 21 8 of 28

Machines 2017, 5, 21 8 of 28
The ‘mobility’ parameter of vibration signal is calculated as the square root of the ratio of the
activity of the first derivative and the activity of the vibration signal.
σ '
mobility = σxx 0 (8)
mobility = σ x (8)
σx
where σσ0 x'is is
where thethe standard
standard deviation
deviation of first
of the the first derivative
derivative of theofvibration
the vibration
signal.signal. The numerical
The numerical values
x
values
for for the derivatives
the derivatives can be obtained
can be obtained as the differences
as the differences betweenbetween
samplessamples as follows.
as follows.
'
x 0x==xx((ii+
+1)1)−−
xi xi = 1,
i =i 1, , NN−− 1
. . ...., (9)
(9)

The last
The last parameter
parameter called
called ‘complexity’
‘complexity’ which
which is
is calculated
calculated as
as the
the ratio
ratio of
of mobility
mobility of
of the
the first
first
derivative and
derivative and the
the mobility
mobility of
of the
the vibration
vibration signal.
signal.

σ σx 0
σ x00 //σ
" '
complexity ==
complexity x x
(10)
(10)
σσxx 0 /σ
' / σx
These
These parameters
parametershave have been used
been in electroencephalography
used in electroencephalography (EEG) (EEG)
signalssignals
to detecttothe epileptic
detect the
seizures
epileptic[19]. They [19].
seizures have never been used
They have neverinbeen
vibration
usedbearing signal except
in vibration bearingforsignal
activity feature,
except forwhich is
activity
similar to the variance feature in the statistical time-domain features extraction. It can be
feature, which is similar to the variance feature in the statistical time-domain features extraction. It seen from
Figure 6 thatfrom
can be seen the progression
Figure 6 thatofthebearing condition
progression is difficult
of bearing to track
condition using mobility
is difficult to trackand complexity
using mobility
feature.
and complexity feature. In contrast, the activity or variance feature shows one highest peakthe
In contrast, the activity or variance feature shows one highest peak at approximately at
90th day and continue
approximately the 90th fluctuation on the last
day and continue measurement
fluctuation on the days.
last measurement days.

600

400
Activity

200

0
0 20 40 60 80 100 120 140
1.5
Mobility

0.5
0 20 40 60 80 100 120 140
4
Complexity

1
0 20 40 60 80 100 120 140
t, days

Figure 6. Hjorts’ features calculated from the vibration slew bearing signal.
Figure 6. Hjorts’ features calculated from the vibration slew bearing signal.

3.1.5. Mathematical Morphology (MM) Operators

3.1.5. Mathematical Morphology (MM) Operators
In image processing, mathematical morphology (MM) was introduced as a non-linear method to
In image processing, mathematical morphology (MM) was introduced as a non-linear method
analyze two dimensional (2D) image data including binary images and grey-level images based on set
to analyze two dimensional (2D) image data including binary images and grey-level images based
theory [20]. The fundamental principle of the MM method is to modify the shape of the original signal
on set theory [20]. The fundamental principle of the MM method is to modify the shape of the original
by transforming it through its intersection with another object called the ‘structuring element’ (SE).
signal by transforming it through its intersection with another object called the ‘structuring element’
Four operators including erosion, dilation, closing and opening are used to achieve the transformation.
(SE). Four operators including erosion, dilation, closing and opening are used to achieve the
transformation.
After having been applied successfully in 2D image processing, the MM method has been used
in biomedical research field to analyze one dimensional (1D) EEG signals [21–23] and
electrocardiography (ECG) signals [24]. The first known study of MM in 1D vibration signals was
Machines 2017, 5, 21 9 of 28

After having been applied successfully in 2D image processing, the MM method has been used in
biomedical research field to analyze one dimensional (1D) EEG signals [21–23] and electrocardiography
(ECG) signals [24]. The first known study of MM in 1D vibration signals was presented by Nikolaou
and Antoniadis [25]. The authors used the MM method for vibration data with inner race and
outer race faults. The effects of four MM operators on the enveloping impulse-type signals were
examined. The result shows that the ‘closing’ operator is more suitable for extracting the impulse
signal corresponding to the bearing fault type. The SE value was empirically investigated within the
range of 0.2 T to 1.2 T. The optimum SE for a signal with additive noise was selected as 0.6 T (0.6 times
the pulse repetition period). In addition, T is the period of each bearing component frequency such as
BPFO, BPFI and BSF.
The original discrete one dimensional (1D) signal and structuring element (SE) must be
first determined to use the four basic operators in the MM method. Suppose the discrete 1D
vibration signal x = {x(1), x(2), . . . , x( N )}, which functions over a domain x(n) = (1, 2, . . . , N ).
Let S = {S(1), S(2), . . . , S( M)} be the SE, which is the discrete function over a domain
S(m) = (1, 2, . . . , M), where N and M are integers and N ≥ M. Then, the four morphology operators for
the discrete 1D vibration signal can be defined as follows:

• Erosion: also refer to as min filter.

( xΘS)(n) = min( x (n + m) − S(m)) (11)

• Dilation: also refer to as max filter.

( x ⊕ S)(n) = max( x (n − m) + S(m)) (12)

• Closing: Dilates 1D signal and then erodes the dilated signal using the similar structuring element
for both operations.
( x •S)(n) = (( x ⊕ S)ΘS)(n) (13)

• Opening: Erodes 1D signal and then dilates the eroded signal using the similar structuring element
for both operations.
( x ◦ S)(n) = (( xΘS) ⊕ S)(n) (14)

where n ∈ (1, 2, . . . , N ) and m ∈ (1, 2, . . . , M), the notations Θ, ⊕, •, ◦ denote the erosion operator
or Minkowski subtraction, dilation operator or Minkowski addition, closing operator and opening
operator in the MM method, respectively.
The original simulated 1D signal is shown in the first row of Figure 7. The various effects of the
basic MM operators application in the simulated 1D vibration signal are presented in the second to
fifth row of Figure 7. The second row of Figure 7 to the fifth row of Figure 7 illustrate the effect of
erosion, dilation, closing and opening, respectively. Please note that the solid blue line represents the
original signal and the dashed red line is the signal after being processed or transformed by the MM
operators. The simulated signal has zero mean value, which is typical to 1D vibration signals. The work
principle of the basic MM operators can be summarized as follows: erosion of x (n) by the structuring
element A(m) reduces the positive part and enlarges the negative part of x (n), while dilation of x (n)
by the structuring element A(m) reduces the negative part and enlarges the positive part of x (n).
In addition, closing of x (n) smoothens the signal x (n) by cutting down any impulses in the negative
part and opening x (n) smoothens the signal x (n) by cutting down its impulses in the positive part.
In rolling element bearing, the MM method has been studied. The multi-scale morphological
analysis was developed by Zhang et al. [26]. The method shows independency of empirical rules in
terms of SE selection, but the technique is rather complex. Wang et al. [27] proposed an improved
morphological filter as features of periodic impulse signals. An average weighted combination of
open-closing and close-opening morphological operators was employed to eliminate the statistical
Machines 2017, 5, 21 10 of 28

terms of SE selection, but the technique is rather complex. Wang et al. [27] proposed an improved
Machines 2017, 5, 21 10 of 28
morphological filter as features of periodic impulse signals. An average weighted combination of
open-closing and close-opening morphological operators was employed to eliminate the statistical
deflection of amplitude and to extract the impulse component from the original signal. To optimize
the SE,
SE, they
theyused
usedaanew newcriterion
criterion called
called impulse
impulse attenuation
attenuation (IMA).
(IMA). DongDonget al.et[28]
al. presented
[28] presented the
the fault
fault diagnosis
diagnosis of element
of rolling rolling bearings
element using
bearings using the
the modified modified morphological
morphological method. The
method. The morphological
morphological
operator used inoperator used in
their method wastheir
the method
average was thecombination
weight average weight combination
of the closing andof the closing
opening and
operator.
opening
They operator.
proposed Theycriterion
a new proposed a new
called SNR criterion called
criterion SNR criterion
to optimize to optimize
the length the length
of SE. Another of SE.
recently
Another recently
published researchpublished
on MM methodresearchwas onpresented
MM method by Rajwas
andpresented by The
Murali [29]. Raj authors
and Murali [29]. The
combined the
authors combined
MM operators withthe MMinference
fuzzy operators to with fuzzy
classify the inference to classify
artificial bearing the artificial
defects. bearing
The vibration defects.
data used
The vibration
in [29] data used
were similar to theindata
[29] used
were insimilar
Zhangtoettheal.data
[26].used in Zhang et al. [26].

10
Amplitude

5
0
-5
0 5 10 15 20 25 30 35 40 45 50 55
10
Amplitude

5
0
-5
0 5 10 15 20 25 30 35 40 45 50 55
Data number (n)

Figure 7. Illustration of four MM operators (second to fifth row).

The application
The application of of the
the MMMM method
method in in detecting
detecting thethe bearing
bearing fault
fault frequencies
frequencies from
from among
among
impulsive-type signals
impulsive-type signalshave havebeenbeen demonstrated
demonstrated [26–29].
[26–29]. TheseThese
signalssignals are generated
are generated from known from
known bearing conditions such as inner and/or outer race faults. It is
bearing conditions such as inner and/or outer race faults. It is acknowledged that the signals from acknowledged that the
signals from
artificial faultsartificial
are easier faults
to beare easier to
identified thanbe those
identified than those
from natural fromfaults.
bearing natural bearing
Besides, faults.
typically,
Besides,
the typically,signals
impulse-type the impulse-type signals visually
are easily identified are easily identified
without visually without
any additional any additional
tool or processing step
tool or processing step [28]. The application of MM method to the naturally
[28]. The application of MM method to the naturally damaged slew bearing is presented in Figures damaged slew bearing is
presented
8–10. in Figures 8–10.
The MM MMmethod
methodhas hasbeen used
been effectively
used effectivelyfor the
for damage detection
the damage of highofspeed
detection highrolling
speed bearing.
rolling
It is because
bearing. It is when
because thewhen
damagethe occurred, the contact
damage occurred, thebetween the damage
contact between the spot
damageandspot
rolling
andelement
rolling
generatesgenerates
element detectabledetectable
impulse due to thedue
impulse contact
to thebetween
contactthe defect spot
between and race
the defect way
spot or roller.
and race wayIn the
or
case of naturally degraded low-speed slew bearing, the multiple damage e.g.,
roller. In the case of naturally degraded low-speed slew bearing, the multiple damage e.g., outer race,outer race, inner race
and roller
inner race areandtypically
roller areoccurred
typicallyatoccurred
close sequence
at close time and the
sequence timemagnitude of impulseofisimpulse
and the magnitude very low. is
Thus, it is Thus,
very low. difficult
it istodifficult
identifytothe originthe
identify of impulse
origin offrom outer
impulse race,
from inner
outer raceinner
race, or roller
race ordamage.
roller
Therefore, the featurethe
damage. Therefore, extraction result of MM
feature extraction forof
result the
MM three
forconditions: BPFO, BPFIBPFO,
the three conditions: and BSF
BPFIpresent
and BSF an
identicalan
present figure.
identical figure.
Machines 2017, 5, 21 11 of 28
Machines 2017, 5, 21 11 of 28
Machines 2017, 5, 21 11 of 28

Figure 8.
Figure 8. RMS
RMS feature
feature calculated
calculated from
from MM
MM operators
operators for
for 0.9
0.9 TT (T
(T == 1/BPFI):
1/BPFI): (a)
(a) erosion;
erosion; (b)
(b) dilation;
dilation;
Figure
(c) 8. RMS
closing and feature
(d) calculated from MM operators for 0.9 T (T = 1/BPFI): (a) erosion; (b) dilation;
opening.
(c) closing and (d) opening.
(c) closing and (d) opening.

Figure 9. RMS feature calculated from MM operators for 0.9 T (T = 1/BPFO): (a) erosion; (b) dilation;
Figure
Figure 9. RMS
9.
(c) closing and feature
RMS feature calculated from
calculated
(d) opening. from MM
MM operators
operators for
for 0.9
0.9 TT (T
(T == 1/BPFO):
1/BPFO): (a)
(a) erosion;
erosion; (b)
(b) dilation;
dilation;
(c) closing and (d) opening.
(c) closing and (d) opening.
Machines 2017, 5, 21 12 of 28
Machines 2017, 5, 21 12 of 28

Figure 10.
Figure 10. RMS
RMS feature
feature calculated
calculated from
from MM
MM operators
operators for
for 0.9
0.9 TT (T
(T == 1/BSF):
1/BSF): (a)
(a) erosion;
erosion; (b)
(b) dilation;
dilation;
(c) closing and (d) opening.
(c) closing and (d) opening.

3.2. Category 2: Frequency-Domain Features Extraction

3.2. Category 2: Frequency-Domain Features Extraction
To apply frequency-domain features, the time-domain vibration signals must initially be
To apply frequency-domain features, the time-domain vibration signals must initially be
transformed into frequency-domain signals using fast Fourier transform (FFT). FFT is a common
transformed into frequency-domain signals using fast Fourier transform (FFT). FFT is a common
method in vibration analysis for bearing fault detection. FFT measures the dominant frequency of the
method in vibration analysis for bearing fault detection. FFT measures the dominant frequency of the
repetitive impulse period of certain faults due to the contact between rolling elements and defective
repetitive impulse period of certain faults due to the contact between rolling elements and defective
spot. Therefore, the type of bearing defects such as outer race, inner race or ball defect can be detected.
spot. Therefore, the type of bearing defects such as outer race, inner race or ball defect can be detected.
FFT performed effectively in stationary periodic signals; however, it is less effective for non-
FFT performed effectively in stationary periodic signals; however, it is less effective for non-stationary
stationary signals that arise from time-dependent events [30].
signals that arise from time-dependent events [30].

3.2.1. Statistical
3.2.1. Statistical Frequency-Domain Features
Frequency-Domain Features
Other frequency-domain
Other frequency-domain features
features such
such asas frequency
frequency centre
centre (FC),
(FC), root mean square
root mean square frequency
frequency
(RMSF) and
(RMSF) and root
root variance
variance frequency
frequency (RVF)
(RVF) have
have been
been studied
studied in
in [13].
[13]. When
When fault
fault exists,
exists, the
the frequency
frequency
element changes, and the values of the FC, RMSF and RVF also change. The FC
element changes, and the values of the FC, RMSF and RVF also change. The FC and RMSF indicateand RMSF indicate
the
the position changes of main frequencies, while the RVF shows the convergence
position changes of main frequencies, while the RVF shows the convergence of the power spectrum. of the power
spectrum.feature
Another Another feature
called mediancalled medianisfrequency
frequency sometimesisused.
sometimes used. The
The median median
frequency frequencyby
is measures is
measures by dividing the area of the amplitude spectrum into two equal part. In addition,
dividing the area of the amplitude spectrum into two equal part. In addition, the mean frequency (FC) the mean
frequency
can (FC) as
be defined canthe
befirst-order
defined asmoment
the first-order
of the moment of the Fourier
Fourier spectrum, RVF spectrum, RVF as themoment,
as the second-order second-
order moment, SS as the third-order moment and SK as the fourth-order
SS as the third-order moment and SK as the fourth-order moment of the Fourier spectrum moment of the Fourier
[18].
spectrum [18]. These features can be calculated
These features can be calculated as follows: as follows:


N '
∑ N xx0 xxi
i =2 i i
FC== i=2N i 2
FC (15)
(15)
2π 
N
2π ∑i=1 xxi 2
i
i =1


N
i =2
( xi' )2
MSF = (16)
4π 2  i =1 xi2
N
Machines 2017, 5, 21 13 of 28

2
∑iN=2 ( xi0 )
MSF = (16)
Machines 2017, 5, 21 4π 2 ∑iN=1 xi2 13 of 28
√
RMSF=
RMSF = MSFMSF (17) (17)
p
2
RVF
RVF MSF −−
= = MSF FCFC 2
(18) (18)

The result of FC,ofRMSF

The result andand
FC, RMSF RVF feature
RVF featureextraction
extraction isispresented
presented in Figure
in Figure 11. 11.

1000

500
FC

0
0 20 40 60 80 100 120 140
1500

1000
RMSF

500

0
0 20 40 60 80 100 120 140
1000

800
RVF

600

400
0 20 40 60 80 100 120 140
t, days

FigureFigure Frequency-domain
11. 11. features:
Frequency-domain features: FC, and
FC, RMSF RMSFRVF and RVFfrom
calculated calculated from
the vibration slewthe vibration
bearing signal. slew
bearing signal.
3.2.2. Spectral Skewness, Spectral Kurtosis, Spectral Entropy and Shannon Entropy Feature
Recently,
3.2.2. Spectral new methods
Skewness, Spectral such as spectral
Kurtosis, skewness,
Spectral Entropyspectral kurtosis, spectral
and Shannon Entropyentropy and
Feature
Shannon entropy have been developed. Spectral skewness (SS) and spectral kurtosis (SK) are the
Recently,
advancednew methods
statistical such as
measures spectral
applied skewness,
to the magnitudespectral
spectrum. kurtosis, spectral the
The SS measures entropy andofShannon
symmetry
the distribution of the spectral magnitude values around its mean [31]. It
entropy have been developed. Spectral skewness (SS) and spectral kurtosis (SK) are the advanced is defined by
statistical measures applied to the magnitude Β /2 −1 spectrum. The SS measures the symmetry of the
L

2  (| X (k , n) | −μ | X | )3
distribution of the spectral magnitude SS( values around its mean [31]. It is defined by
n) = k =0 (19)
ΒL ⋅ σ 3|X |
B L /2−1 3
The SK measures the distribution of2the∑spectral k, n)−µ| Xvalues
(X (magnitude |) and compare to a Gaussian

distribution. It is defined by SS(n) = k =0
(19)
ΒL / 2 −1 B L · σ|3X |
2 
k =0
(| X (k , n) | −μ | X | ) 4

(20)
SK(n) = −3
The SK measures the distribution of the spectral
ΒL ⋅ σ 4|Xmagnitude
|
values and compare to a Gaussian
distribution. It is defined by
Antoni and Randall [32] presents the development of classical kurtosis analysis called spectral
kurtosis (SK) to deal with the transient behaviour
B L /2−1 in a signal. SK can4 be used to detect incipient faults
even in the presence of resilient masking2 ∑ noise.(X (k, n)−µ|ofX |spectral
)
The concept statistic is adopted in this
method to supplement the SK classical power k = 0
spectral density (PSD). PSD ideally gives zeros values at
(n) = 4
−3 (20)
L · σ| X | noise, and it gives high positive values at
those frequencies when the signal has stationary BGaussian
those frequencies during the occurrence of the transients. When the noise-to-signal ratio is high, the
transient
Antoni andsignal will [32]
Randall be buried in thethe
presents background
development noise, of thus the incipient
classical faultanalysis
kurtosis cannot becalled
clearlyspectral
detected. SK can overcome this problem by analyzing the entire frequency band
kurtosis (SK) to deal with the transient behaviour in a signal. SK can be used to detect incipient and to select the faults
sensitive frequency band that correspond to the bearing condition.
even in the presence of resilient masking noise. The concept of spectral statistic is adopted in this
method3.3.
to Category
supplement the classical
3: Time-Frequency power spectral density (PSD). PSD ideally gives zeros values at
Representation
those frequencies when the signal has stationary Gaussian noise, and it gives high positive values
Time-frequency domain representation methods such as short-time Fourier transform (STFT),
at thosewavelet
frequencies during
transform the occurrence
and Wigner-Ville of the (WVD)
distribution transients. When the
are commonly noise-to-signal
used ratio
for non-stationary or is high,
the transient signal will be buried in the background noise, thus the incipient fault cannot be clearly
detected. SK can overcome this problem by analyzing the entire frequency band and to select the
sensitive frequency band that correspond to the bearing condition.
Machines 2017, 5, 21 14 of 28

3.3. Category 3: Time-Frequency Representation

Time-frequency domain representation methods such as short-time Fourier transform (STFT),
wavelet transform and Wigner-Ville distribution (WVD) are commonly used for non-stationary or
transient signal. These methods implement a mapping of one-dimensional time-domain signals to a
two-dimensional function of time and frequency. The objective is to provide a true time-frequency
Machines 2017, 5, 21 14 of 28
representation of a signal. A review of time-frequency analysis methods for machinery fault diagnosis
has beentransient
presentedsignal.byThese
Fengmethods
et al. [33].
implement a mapping of one-dimensional time-domain signals to a
two-dimensional function of time and frequency. The objective is to provide a true time-frequency
3.3.1. Short-Time Fourier
representation of a Transform (STFT)of time-frequency analysis methods for machinery fault
signal. A review
diagnosis has been presented by Feng et al. [33].
Due to its low computational complexity and definite physical meaning, STFT is often used as an
initial pre-processing
3.3.1. Short-Timetool to analyze
Fourier Transform non-stationary
(STFT) vibration signals. The STFT maps the vibration
signal into a two-dimensional (2D) function of time and frequency [34]. STFT divides a non-stationary
Due to its low computational complexity and definite physical meaning, STFT is often used as
signal into small windows of equal time frame. The Fourier transformation is then applied to the time
an initial pre-processing tool to analyze non-stationary vibration signals. The STFT maps the
segment. Unlikesignal
vibration FFT where the frequency(2D)
into a two-dimensional transformation
function of timeincludes the whole
and frequency original
[34]. STFT signal,
divides a non-in STFT
original stationary
signal is signal
decomposed
into smallusing FFTofatequal
windows pre-defined time
time frame. Theintervals. By doing this,
Fourier transformation theapplied
is then defect signal
to the time segment. Unlike FFT where the frequency transformation includes the
frequency can be identified in a particular window of time. The key to this method is the selection ofwhole original
the timesignal, in STFT
interval. Theoriginal
STFT cansignal
beisexpressed
decomposed asusing FFT [35]:
follows at pre-defined time intervals. By doing this,
the defect signal frequency can be identified in a particular window of time. The key to this method
is the selection of the time interval. The STFT
t+ can be expressed as follows [35]:
ZT/2
STFT( f , t) = t +T / 2
w(t − τ ) x (τ )e j2π f τ dτ (21)

j 2π f τ
STFT ( f , t ) = w (t − τ ) x(τ ) e dτ (21)
t− T/2 t −T / 2

where STFT ( f , t ) is short-term frequency spectrum, t is the time, f is frequency, w (t − τ ) is the

where STFT( f , t) is short-term frequency spectrum, t is the time, f is frequency, w(t − τ ) is the shifted
shifted window, x(τ ) denotes the input vibration signal, exp ( j 2π f τ ) is the complex exponential,
window, x (τ ) denotes the input vibration signal, exp( j2π f τ ) is the complex exponential, T is the
T is the window interval and w (τ ) is the windowing function which satisfies w (τ ) = 0 for
window interval and w(τ ) is the windowing function which satisfies w(τ ) = 0 for |τ | => T/2.
τ => T / 2 .
By taking the square magnitude of the STFT in (21), an energy density using the spectrogram
S(t, f ) method can bethe
By taking square magnitude of the STFT in (21), an energy density using the spectrogram
obtained
S(t , f ) method can be obtained
S(t, f ) = |STFT(t, f )|2 (22)
2
S(t , f ) = STFT(t, f ) (22)
Equation (22) represents the energy density of the vibration signal x (τ ) windowed by w(τ ).
The visualization
Equation (22)ofrepresents
windowthe length
energyand hopofsize
density the is presented
vibration signal (τ ) windowed
in xFigure w(τ ) .
12. Thebyvibration signal
with the sample index i is split
The visualization into overlapping
of window length and hopblocks
size isofpresented
length Bin L and
Figure block index
12. The n. While,
vibration signalthe hop
with
size Hs is thethe sample from
distance index thei isstart
split of
into
oneoverlapping
particularblocks
blockofand length Β L and blockblock.
its subsequent n . While,
index The STFT result
the hop size
of one day slew bearing Η s is the distance from the start of one particular block and its
signal measurement is shown in Figure 13 for 10 s vibration signal subsequent block. The with
STFT result of one day slew bearing signal measurement is shown in Figure
window length of 2048, hop size of 256 and number of FFT points of 16,384. It can be seen that during 13 for 10 s vibration
signal with window length of 2048, hop size of 256 and number of FFT points of 16,384. It can be seen
10 s sampled signal, the amplitudes at lower frequencies were higher than the amplitude at high
that during 10 s sampled signal, the amplitudes at lower frequencies were higher than the amplitude
frequencies (approximately at 1300 Hz and 1750 Hz). This indicates that slew bearing frequencies were
at high frequencies (approximately at 1300 Hz and 1750 Hz). This indicates that slew bearing
dominated by lowwere
frequencies frequency
dominatedcontents.
by low frequency contents.

Figure
Figure 12. 12. Schematic
Schematic visualization
visualization of of
hophopsize
size and
and overlapping
overlapping block length
block in STFT
length algorithm.
in STFT algorithm.
Machines 2017, 5, 21 15 of 28
Machines 2017, 5, 21 15 of 28

Figure 13. STFT of the vibration slew bearing signal (data on 3 May).

3.3.2. Wavelet
3.3.2. Transform and
Wavelet Transform and Wavelet
WaveletDecomposition
Decomposition
Wavelet transform
Wavelet transform is is well-known
well-known as as an
an appropriate
appropriate method
method to to analyze
analyze non-stationary
non-stationary and and
transient signal
transient signal [14,36].
[14,36]. Wavelet
Wavelet transformation
transformation employs
employs wavelet
wavelet function
function instead
instead of
of sinusoidal
sinusoidal
function as the basis function. It enhances a scale variable in addition
function as the basis function. It enhances a scale variable in addition to the time variable to the time variable in thein
inner
the
2 2
product
inner transformation
product transformation [33].[33].
For For
anyanylimited energy
limited energy signal xx((t )t)∈∈
signal LL(R() R, )the
, thewavelet
waveletfunction
function is
defined as [26]
1 +∞ τ−t
Z
WTx (t, a) = √ 1 +∞ x (τ )ψ τ − t dτ (23)
 a dτ
WTx (t , a) = a −∞ x(τ )ψ   (23)
a 
where wavelet ψ[(τ − t)/a] is derived by dilating and translating
a −∞
the wavelet basis ψ(t), a is
√
the
wherescale
wavelet ψ [(τ −( at ) />a] 0)is, derived
parameter t is the by
time shift and
dilating and 1/ a is a normalization
translating the wavelet basis ψ (to
factor a is the
t ) , maintain
energy preservation.
scale parameter (a > 0) , t is the time shift and 1/ a is a normalization factor to maintain energy
A review of the wavelet transform applications in machine condition monitoring and
preservation.
fault diagnostics is presented by Peng et al. [37]. As a non-stationary signal analysis method,
A review of the wavelet transform applications in machine condition monitoring and fault
wavelet transform can be applied in machine fault diagnostics as a feature extraction method [37].
diagnostics is presented by Peng et al. [37]. As a non-stationary signal analysis method, wavelet
Since the wavelet transformations can provide an excellent energy concentration properties due to
transform can be applied in machine fault diagnostics as a feature extraction method [37]. Since the
the uses of the wavelet basis function; the wavelet transform can represent the signal with a certain
wavelet transformations can provide an excellent energy concentration properties due to the uses of
number of coefficients. The process is usually called wavelet decomposition.
the wavelet basis function; the wavelet transform can represent the signal with a certain number of
Wavelet decomposition technique decomposes a non-stationary vibration signal into linear form
coefficients. The process is usually called wavelet decomposition.
of time-scale units. It decomposes and organizes original signal into several signal components
Wavelet decomposition technique decomposes a non-stationary vibration signal into linear form
according to the translation of the wavelet function. This changes the scale and shows the transition
of time-scale units. It decomposes and organizes original signal into several signal components
of each frequency component [38]. Through this decomposition, a selected coefficient can be used
according to the translation of the wavelet function. This changes the scale and shows the transition
directly as the fault features. In wavelet decomposition, the input signal is basically decomposed
of each frequency component [38]. Through this decomposition, a selected coefficient can be used
into two coefficients. The input signal that is passed through the low-pass filter becomes the
directly as the fault features. In wavelet decomposition, the input signal is basically decomposed into
‘approximation coefficient’. The input signal which is passed through the high-pass filter becomes the
two coefficients. The input signal that is passed through the low-pass filter becomes the
‘detail coefficient.’ Since the low-frequency component is the most sensitive part to extract the useful
‘approximation coefficient’. The input signal which is passed through the high-pass filter becomes
information related to the bearing condition, the ‘approximate coefficient’ is passed to the next wavelet
the ‘detail coefficient.’ Since the low-frequency component is the most sensitive part to extract the
decomposition loop in multilevel wavelet decomposition process. Figure 14 shows three level wavelet
useful information related to the bearing condition, the ‘approximate coefficient’ is passed to the next
decomposition structures. The structure illustrates the decomposition of slew bearing vibration signal
wavelet decomposition loop in multilevel wavelet decomposition process. Figure 14 shows three
into approximate coefficient (A) and detailed coefficient (D) at each level. For further processing,
level wavelet decomposition structures. The structure illustrates the decomposition of slew bearing
features such as mean, variance, skewness and kurtosis are calculated from the detailed coefficients of
vibration signal into approximate coefficient (A) and detailed coefficient (D) at each level. For further
level 3 (D3) [39]. The features extraction results are presented in Figure 15, it can be seen that kurtosis
processing, features such as mean, variance, skewness and kurtosis are calculated from the detailed
coefficients of level 3 (D3) [39]. The features extraction results are presented in Figure 15, it can be
seen that kurtosis can predict the damage compared to mean, variance and skewness calculated from
Machines 2017, 5, 21 16 of 28
Machines 2017, 5, 21 16 of 28

can predict
signal the
D3 of thedamage
waveletcompared to mean,
decomposition duevariance and skewness
to the significant calculated
changes from
after day 90 ofsignal D3 of the
the bearing life
wavelet
time. decomposition due to the significant changes after day 90 of the bearing life time.
The
Theapplication
applicationofofwavelet
waveletdecomposition
decompositiontotothe thetransient
transientstator
statorcurrent
currentsignal
signalacquired
acquiredfrom
from
induction
inductionmotor
motorisispresented
presentedby byNiu
Niuetetal.
al.[40].
[40].Statistical
Statisticalfeatures
featureshave
havealso
alsobeen
beenused
usedtotoreveal
revealthe
the
fault information from the selected coefficients. Another study that uses the wavelet coefficients
fault information from the selected coefficients. Another study that uses the wavelet coefficients as as the
fault features
the fault for ball
features forbearing is presented
ball bearing by Liu
is presented byetLiu
al. et
[41].
al. [41].
Amplitude (mV)

200

100

-100