Fastemd-Cca Algorithm For Unsupervised and Fast Removal of Eyeblink Artifacts From Electroencephalogram
Fastemd-Cca Algorithm For Unsupervised and Fast Removal of Eyeblink Artifacts From Electroencephalogram
Perak, Malaysia
d Department of Neuroscience, Universiti Sains Malaysia, Kelantan, Malaysia
e Laboratoire Electronique, Informatique et Image (Le2i), ERL VIBOT CNRS 6000,
Universite de Bourgogne,France
Abstract
Online detection and removal of eye blink (EB) artifacts from electroencephalo-
gram (EEG) would be very useful in medical diagnosis and Brain-Computer
Interface (BCI). In this work, approaches that combine unsupervised eyeblink
artifact detection with Empirical Mode Decomposition (EMD), and Canoni-
cal Correlation Analysis (CCA), is proposed to automatically identify eyeblink
artifacts and remove them in an online manner. First eyeblink artifact re-
gions are automatically identified and an eyeblink artifact template is extracted
via EMD, which incorporates an alternate interpolation technique, the Akima
spline interpolation. The removal of eyeblink artifact components relies on the
elimination of EEG canonical components obtained through CCA, based on
cross-correlation with the extracted eyeblink artifact template. The proposed
algorithm is evaluated and analysed with respect to its ability in removing eye-
blink artifacts and retaining neural information of the EEG signals. Analysis
proved that the proposed algorithm, FastEMD-CCA, is effective in eyeblink ar-
Preprint submitted to Biomedical Signal Processing and Control October 13, 2019
© 2019 published by Elsevier. This manuscript is made available under the CC BY NC user license
https://creativecommons.org/licenses/by-nc/4.0/
tifact removal with an average accuracy, sensitivity, specificity and error rate of
97.9%, 97.65%, 99.22% and 2.1% respectively. The algorithm is able to clean
and remove eyeblink artifacts from a 14-channel EEG of length 1 second, at an
average time of 63 milliseconds. This makes it a feasible solution for applications
requiring online removal of eyeblink artifacts.
Keywords: Electroencephalogram (EEG), Enhanced Empirical Mode
Decomposition (FastEMD), Canonical Correlation Analysis (CCA), Eyeblink
Artifact.
1. Introduction
2
plitudes of around 10 times greater than the actual brain signals, noticeable in
the delta wave range and can last up to 200ms to 400ms [2, 3]. The eyeblink
potential propagates and spreads out to all EEG electrodes but in various con-
duction volume - higher conduction near the frontal and parietal regions while
the conduction in the occipital region is very low. The frontal region is the most
prone region to contamination from eyeblink artifacts as it is closest to the eyes.
Fig. 1 shows the positions of EEG electrodes following the 10-20 system. The
Fp1 and Fp2 electrode positions, which are closest to the eyes and highlighted
in Fig. 1, can be used to capture the eyeblink artifacts.
For reliable analysis of EEG signals, it is therefore essential that these arti-
facts be removed. Traditionally artifact removal is done after the EEG signal
has been recorded, either manually or automatically. However, in clinical moni-
toring such as continuous epilepsy monitoring and the brain-computer interface
(BCI), where EEG signals are analyzed and manipulated as they are being
recorded, an online artifact removal solution is required [4]. Various techniques
are available for de-noising purposes, which will be discussed below. The most
common method is to have eyeblink artifact regions identified through man-
ual inspection and these segments are removed. This method can cause a loss
3
of information as the EEG segments being removed may contain useful neu-
rological information. Regression-based methods [5–7] perform a regression or
correlation test between the signal to be processed and a reference signal. For
example, electrooculogram (EOG) signal can be used as the reference signal to
be compared with the EEG signal. The segment of the EEG signal that highly
correlates with the EOG is then assumed to be related to the eyeblink artifacts
and thus removed. However, since EOG also contains some EEG potentials due
to the close proximity of EOG electrodes to the frontal region of the brain, arti-
fact removal via regression methods may also remove important EEG data. In
addition, a reference electrode is obligatory in regression-based methods, which
may cause discomfort to patients when there is an extra pair of electrodes placed
around the eyes especially for longer EEG recordings.
In [8], Principal Component Analysis (PCA) is used to isolate out com-
ponents of the highest variance between EOG and EEG signals. The highest
variance components are the principal components and are classified as eyeblink
components. Similar to regression-based methods, PCA also requires additional
EOG electrodes to be applied. Besides that, it is not able to completely separate
some artifacts from the raw EEG signal in the event that both the eyeblink and
EEG signals have comparable amplitudes [9]. On the other hand, Independent
Component Analysis (ICA) [10, 11] is proven to be able to remove eyeblink
artifacts, as well as artifacts from different sources, but it may not be suit-
able for online applications as visual inspection on the independent components
(ICs) is required to manually identify and select ICs corresponding to artifacts
[12]. To overcome this, some work has been done to automate artifact detection
and removal by combining ICA with other methods like Wavelet or Empirical
Mode Decomposition. However, in all these cases the computational complexity
stands out as a limiting factor for ICA to be used in online applications [13].
The Wavelet transforms on the other hand depends on choosing a suitable de-
composition mother wavelet. The mother wavelet is a function comprising sine
and cosine waves, thus most of the time it will not characterize or adapt to
non-linear EEG signals, producing decomposition errors [12].
4
Canonical Correlation Analysis (CCA) [14], has been used in muscle and
eyeblink artifact removal and has been proven to be the fastest among other
de-noising techniques discussed above [15]. However, if CCA is to be used alone
to remove artifacts, it still requires an additional reference signal to identify
the artifact events. Instead of using a reference signal, Empirical Mode Decom-
position (EMD) can be used to extract the eyeblink artifact signal from the
EEG signal [15–18]. EMD [19] is an algorithm that decomposes a signal with-
out requiring any pre-knowledge or pattern of interest, unlike other de-noising
techniques. In a comparative study on extracting out a biomedical signal [20],
EMD is proven to be more accurate compared to the wavelet transform. On
top of that, a method combining EMD and CCA (EMD-CCA) in [15], is shown
to outperform CCA, FastICA and EMD-FastICA in terms of artifact removal
accuracy, when evaluated on an EEG signal added with a synthetically gener-
ated eyeblink artifact. Despite the fact that it can accurately remove artifacts
from the EEG signal compared to other techniques, the algorithm is relatively
slow due to its iterative nature.
Most of the techniques on eyeblink artifact removal discussed above are used
only for offline artifact removal. Since applications such as BCI and epilepsy
monitoring require online signal processing, artifact removal methods and algo-
rithms should be capable of online processing. Hence, to cater to online artifact
removal, the methods or algorithm should satisfy a few criteria. The most im-
portant requirement is that the algorithm should be fully automatic without
any expert’s intervention. Secondly, online applications should avoid utilizing
additional electrodes around the artifact originating regions, such as EOG, as it
may cause discomfort and inconvenience to the subject during long-term EEG
recordings. Finally, online implementation requires the artifact removal algo-
rithm to have minimal computational complexity so that the algorithm doesn’t
introduce an unacceptable time delay.
Researchers have studied hybrid techniques to detect and remove eyeblink
artifact from EEG signal which may be useful for online applications [21–23].
Some of these techniques are discussed here. Lawhern et al. in [24] used the
5
Auto-regressive (AR) model for artifact feature selection followed by a Support
Vector Machine (SVM) classifier for training purposes to detect the artifacts.
Nguyen et al. [25], have reported their work on ocular artifact removal by
combining Wavelets and Artificial Neural Network (ANN), and naming their
technique Wavelet Neural Network (WNN). This technique requires an EOG
reference channel to train the ANN classifier. Zhao et al. [13] used Discrete
Wavelet Transformation (DWT) and an Adaptive Predictor Filter (APF) to
remove ocular artifacts from EEG signals. Daly et al. [26] have developed
a software plugin GUI, called the Fully Online and Automated Artifact Re-
moval for Brain-Computer Interfacing (FORCe). This plugin works based on
the combination of Wavelet Decomposition, Independent Component Analy-
sis and thresholding. FORCe runs in MATLAB and it is stated that it can
be used for online BCI applications, making it the only software plugin that
is able to perform significantly faster. Most recently, Tonachini et al. in [27]
has developed an online automatic artifact rejection using artifact subspace re-
construction (ASR), online recursive independent component analysis (ORICA)
and an IC classifier. However, the author has stated that ASR had negligible
effect on eyeblink artifact removal, and the time it took for ORICA to converge
well enough on the blink-related IC for the artifact to be removed is 26 seconds,
which is a significant amount of time.
To the extent of the authors’ knowledge, every online artifact removal tech-
nique discussed above depends on either a dedicated artifact reference recording
or some kind of training data that records artifacts separately for training pur-
poses, which may add some time delay to the techniques in online applications.
This work first focuses on introducing a novel unsupervised eyeblink artifact
detection algorithm which identifies eyeblink artifact regions effectively, assist-
ing subsequent artifact removal process. Secondly, the performance of EMD is
improved with various enhancements to resolve the processing time inefficiency
of the algorithm. Next, the enhanced version of EMD is applied on the most
relevant eyeblink artifact region identified through the unsupervised artifact de-
tection algorithm to extract out a suitable eyeblink artifact template. Finally,
6
our work makes use of the artifact template extracted as a reference in iden-
tifying subsequent eyeblink artifacts instead of relying on an EOG recording.
With the help of the artifact template, the identified eyeblink artifact regions
are subjected to CCA for eyeblink artifact removal in online applications. The
direction of the work is to provide an application-centric solution for online ap-
plications with reasonable/reduced complexity and enhanced performance. The
developed algorithm neither depends on a separate EOG recording or an ex-
pert’s advice for eyeblink events identification, thus removing any constraints in
terms of automation for online implementation. Additionally, no training data
is required beforehand for the algorithm to learn and identify eyeblink artifacts.
The developed algorithm is compared with one of the state-of-the-art methods,
i.e. FORCe, due to its effectiveness in removing eyeblink artifacts and its low
computation time. The next section elaborates the proposed algorithm and the
materials used in this work, while results and discussions are presented in Sec-
tion 3. Finally, we conclude the paper with some recommendations in Section
4.
Since the frontal region of the brain is the nearest region to the position of
eyes, eyeblink artifacts can be easily captured in this region, so the Fp1 and
Fp2 electrodes should hypothetically exhibit high correlation whenever there is
an occurrence of an eyeblink. To validate this theory, the correlation coefficient
is computed between Fp1 and Fp2 in windows of 500 samples (1.95 seconds).
As eyeblink artifacts can last up to 800ms [2, 3], this window size will allow
at least one eyeblink artifact to fall within the window. The test has revealed
that segments of Fp1 and Fp2 without eyeblink artifact produce correlation
below than 0.7, whereas segments containing eyeblink artifact results in higher
correlation, usually more than 0.9 as illustrated in [28] and shown in Fig. 2.
7
Correlation Coefficient (CC) between Fp1 & Fp2
100
Fp1
Fp2
50
-100
-150
-200
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
Samples
where, X[t] is the EEG signal’s amplitude at time t, and for any given window
starting at sample point n, X[t] is evaluated from t = n to t = n + 500, and µ
8
is the mean of that particular window.
An experiment conducted by the authors in [28] has shown that the thresh-
old for eyeblink artifact’s onset point and eyeblink potentials dominating the
EEG window in question can be correctly determined by taking two standard
deviation, 2σ width from the mean of the displacement distribution acquired,
as in Eq. (2). Any absolute value beyond 2σ is classified as an eyeblink artifact
potential and the first sample that exceeds this threshold is considered as the
eyeblink artifact’s starting point.
Later, the onset of eyeblink artifact is moved 100 samples (0.39 seconds)
ahead. The reason for setting the onset point in advanced of 100 samples before
the threshold is to provide a buffer for any subsequent analysis. The end point
of the eyeblink artifact is then set to 256 samples, or 1 second, after the first
sample with an amplitude displacement crossing the threshold. The eyeblink
artifact region is therefore taken to be from the onset of eyeblink till the end
point of the eyeblink. Thus, an eyeblink which can last up to 0.8 seconds (205
samples) in duration completely fit into this window (100+256=356 samples).
Several eyeblink artifact regions are searched and saved in a similar way un-
til any two eyeblink artifact regions exhibit correlation coefficient of more that
0.9 between them. The correlation coefficient value of more than 0.9 is chosen
assuming that a high correlation between the eyeblink artifact regions denotes
repetitiveness or similarity in the blinking pattern of an individual. Hence these
regions with high similarity or correlation will be subjected for further anal-
ysis, which is the EMD algorithm in this research work. Fig. 3 summarizes
the algorithm in a flowchart. Fig. 4 shows the plot of a real EEG signal’s eye-
blink artifact regions identified through the proposed eyeblink artifact detection
algorithm located on the Fp1 channel.
9
Start
NO
CC > 0.85
YES
NO
CC > 0.8 or 0.9
YES
End
10
EB Artifact Regions EEG signal
60
EB Artifact Region
40
20
Amplitude(microvolts)
0
-20
-40
-60
-80
0 1000 2000 3000 4000 5000 6000
Samples
called the Intrinsic Mode Functions (IMF). Each IMF extracted out from the
original signal is a lower oscillating trend compared to its predecessor. Adding
up all IMFs and the remaining residual signal obtained from the decomposition
would reconstruct the original signal. Each IMF should satisfy the following
criteria as in [19]:
n−1
X
X(t) = xi (t) + Rn (t) (3)
i=1
Each sifting loop produces the i-th IMF of the algorithm, xi (t). The recursive
sifting discontinues after the algorithm extracts out n − 1 IMFs, the instance
11
where the residual signal, Rn (t) becomes a monotonic trend. The algorithm is
relatively slow because it reiterates itself until the final residual signal becomes
a monotonic function.
12
at every extremum point to ensure continuity and spline curvature smoothness.
Since envelope construction through CSI force two adjacent splines to be con-
tinuous at first and second derivatives, the formed envelopes are susceptible
to overshoots and undershoots. This produces an erroneous mean estimation
during sifting and this error could eventually get transferred and added to the
whole data set on every iteration of EMD’s sifting process, resulting in an in-
accurate and unreliable decomposition. While the envelope construction of ASI
depends only on the slopes of adjacent segments with continuity up to first
order derivative. Although ASI produces envelopes that are not as smooth as
the CSI does, but it demonstrates a better decomposition accuracy. This also
reduces the necessity to solve large system equations which in turn, reduces the
computation time.
13
which is sufficient to segregate out the EEG trend and the eyeblink trend.
2
X
XEEG (t) = xi (t) (4)
i=1
5
X
Xeyeblink (t) = xi (t) + R6 (t) (5)
i=3
This automatically reduces the computation time and the algorithm does not
have to repeat itself until a monotonic residue is acquired. The eyeblink artifact
template obtained by adding up the 3rd IMF onwards with the residual signal
is shown in Fig. 5.
EB Artifact Template
60
40
Amplitude(microvolts)
20
-20
-40
-60
0 50 100 150 200 250 300 350 400
Samples
14
heavy. As an option, EMD can be applied repetitively on short segments, when-
ever an eyeblink artifact event is captured, provided the occurrence of eyeblinks
are known. Unfortunately, EMD gets computationally inefficient and slow on
repetitive application to a huge dataset especially during online recording and
analysis, which may even disrupt the recording task.
To resolve this, two eyeblink regions identified in section 2.1 with cross-
correlation of more than 0.9, indicated with boxes in Fig. 6 are subjected to
EMD separately. EMD is applied only on two most correlating eyeblink artifact
regions, thus keeping the number of EMD applications lowest as possible. These
two eyeblink regions are chosen as these regions are repetitive in terms of the
blinking pattern, which can be assumed as a general eye blinking pattern for that
particular EEG signal. This prevents EMD to be used repetitively, especially
when the EEG signal is processed in an online manner. This method is differ-
ent compared to what is being practised in classical artifact removal technique
through EMD, where EMD will be applied to remove the artifacts whenever an
artifact event is identified. The low oscillating IMFs obtained through EMD are
then added, as in Eq. (5), and averaged out to get an eyeblink artifact template.
40
20
Amplitude(microvolts)
-20
-40
-60
-80
0 1000 2000 3000 4000 5000 6000
Samples
15
Stoping Criterion for EMD
The work also adopts a stopping criterion for EMD based on the standard
deviation which was introduced in [19]. The standard deviation (SD) is defined
as the normalized squared difference between two sifting iterations, which is
assumed to indicate consistency between two sifting outputs. The SD value
calculated from two consecutive sifting outputs, yj (t) and yj−1 (t) should be less
than a pre-determined value, normally 0.2 or 0.3 to stop the sifting iteration in
EMD.
k
" #
X |yj−1 (t) − yj (t)|2
SD = 2 (t) ≤ 0.2 (6)
t=0
yj−1
where k is the number of samples in the original signal, X(t).
An updated flowchart of EMD with enhancements discussed above is shown
in Fig. 7.
16
START
i=1:5, j=1
Original signal = X(t) = R0(t)
Input signal = z(t)
Start: z(t) = X(t) = y0(t)
Is SD < 0.2?
NO
YES
NO
Is i > = 5?
YES
END
Hence, the clean EEG sources of Sx (t) and Sy (t) can be estimated by taking
17
the weighted de-mixing matrix, A onto the observed EEG signals:
A = W −1 (8)
The source signals, which are considered as the canonical variates u(t) and
v(t) are obtained through linear combinations between the de-mixing matrices
and mean removed observed EEG variables, x̂(t) and ŷ(t), where n is the number
of EEG sample in one channel and p is the number of channels of the EEG
recording:
U = AT X̂
(12)
V = B T Ŷ
The purpose of CCA is finding the de-mixing matrices A and B such that the
correlation, ρ between U and V , is maximized, or as large as possible. For exam-
ple, the de-mixing matrices a1 = [a11 , a12 , ..., a1p ]T and b1 = [b11 , b12 , ..., b1p ]T
are computed such that the coefficient of canonical correlation between the first
pair of canonical variates u1 and v1 is maximized:
ρ1 = corr(u1 , v1 ) (13)
where,
u1 = a1 T x̂
(14)
v1 = b1 T ŷ
18
The second and following pairs of canonical variates are computed in a sim-
ilar way, provided that the second pair of canonical variates are uncorrelated
with the first pair and other pairs of canonical variates. This procedure is re-
peated until enough canonical variate pairs are obtained. CCA was initially
proposed in [30] by Hotelling. In EEG’s artifact removal, CCA was employed
in several works to remove muscle and ocular artifacts. CCA is implemented by
De Clercq et al. [14] to remove muscle artifacts from the EEG signal, followed
by Hallez et al. in [31] with CCA and the blind source separation approach.
Later, Zhao et al. [32] used the Wavelet in combination with CCA to remove
ocular artifacts from EEG. Sweeney et al. [18] then use the Ensemble EMD with
CCA to remove artifact from the EEG signal. On the other hand, M.Soomro
et al. [15] has used the CCA to the entire signal with conventional EMD for
removal of eyeblink artifacts in a short length of EEG signal. In this work, CCA
is applied in windows to obtain canonical components and used with the com-
bination of enhanced EMD for eyeblink artifact removal from real EEG signals
of long durations.
where N represents the length of the window, X(t) represents the contaminated
EEG signal, and XEB (t) represents the eyeblink artifact template extracted
from FastEMD.
Windows that exhibit high similarity scores with the eyeblink artifact tem-
plate are subjected to CCA. CCA estimates the canonical components that
maximize temporal correlation within the specified window. The most perti-
nent artifactual canonical components, U , usually the first row of the canonical
components are forced to become zero in order for it to behave non-artifactual.
19
The artifact-free canonical components are termed as Uclean . Then, clean EEG
segment is reconstructed by taking the inverse of the de-mixing matrix, Ax into
the non-artifactual source, Uclean :
IMF 1
EEG Trend
IMF 2
Enhanced IMF 3
EB artifact
Raw EEG Empirical
region
Signal Mode
identification
Decomposition
IMF 4
EB artifact
template
IMF 5
RESIDUAL
SiGNAL
Cross
Raw EEG NO
Correlation > 0.5
Segments
?
YES
Remove EB Canonical
artifact canonical Correlation
components Analysis (CCA)
CLEAN EEG
SIGNAL
EEG signal Unsupervised EB
acquisition artifact region detection EB artifact template extraction EB Artifact removal in windows / segments
20
On the other hand, a synthetic EEG signal can be generated through pink
noise, Y (t) for a duration of 10 seconds, 2560 samples at a sampling frequency of
256 Hz. EEG and eyeblink artifact models simulated through pink noise and ex-
ponential function are shown in Fig. 9(a) and 9(b) respectively. Both synthetic
EEG signal and eyeblink artifact are mixed to acquire a set of synthetically
contaminated EEG signal, X(t) as in Fig. 9(c).
-20
0 1 2 3 4 5 6 7 8 9 10
(b) Simulated EB Artifact
20
Amplitude(uV)
-20
0 1 2 3 4 5 6 7 8 9 10
(c) Contaminated EEG Signal
20
-20
0 1 2 3 4 5 6 7 8 9 10
Time(s)
Fig. 9: (a) Synthetic EEG Signal, (b) Synthetic Eyeblink Artifact, (c) Contaminated EEG
Signal
21
EEG signals are recorded following the 10-20 international standardization with
free electrodes placed on the scalp. The EEG signals were collected from 10 par-
ticipants with 6 recordings from each participant, resulting in 60 EEG data sets.
The participants are aged between 30 and 55 years. All recorded signals are of
different durations, which were recorded at a sampling rate of 256 Hz.
22
2.5.1. Compared Approaches and Evaluation Criteria
The proposed algorithm, FastEMD-CCA is compared with two existing tech-
niques or algorithms. Evaluation on the approaches are performed in MATLAB
2018b on Windows 7 Professional(64-bit OS, 4GB RAM).
23
while CCeb estimates the resemblance of removed eyeblink artifacts compared
to synthetic eyeblink artifacts. RMSE measures the removal and reconstruction
error for eyeblink and EEG signals respectively. The RMSE is calculated by
finding the difference between synthetically generated eyeblink artifacts with
removed eyeblink artifacts, RM SEeb and synthetically generated EEG signals
with reconstructed signals, RM SEeeg after processing with the suggested tech-
niques. The SNR is used in this analysis to determine the ratio of signal to
artifact that remains after eyeblink artifact has been removed from the con-
taminated EEG signal. The SNR ratio is calculated before and after eyeblink
artifact removal, using Eq. (23) and (24).
cov(Y, Y1 )
CCeeg = (19)
std(Y ) ∗ std(Y1 )
cov(Z, Z1 )
CCeb = (20)
std(Z) ∗ std(Z1 )
r Pn
t=1 (Y (t) − Y1 (t))2
RMSEeeg = (21)
n
r Pn
t=1 (Z(t) − Z1 (t))2
RMSEeb = (22)
n
std(Y )
SNRbefore = 10 log (23)
std(Y − X)
std(Y )
SNRafter = 10 log (24)
std(Y − Y1 )
where X(t) represents the synthetically contaminated EEG signals, Y (t) refers
to the simulated/synthetic EEG signals generated using pink noise, Y1 (t) cor-
responds to the reconstructed EEG signals which are free from artifacts, Z(t)
refers to the synthetic eyeblink artifact and Z1 (t) corresponds to the extracted
24
eyeblink artifact. From the performance metrics, 95% of confidence interval
has been estimated so that the probability of the performance is repetitive over
95% of the time, if the evaluation to be repeated multiple times in another time
frame.
25
3. Results and Discussions
-20
0 1 2 3 4 5 6 7 8 9 10
(b) Reconstructed EEG via FastEMD-CCA
5
Amplitude(uV)
Fig. 10: (a) Mixed EEG and Eyeblink Signal, (b) Reconstructed EEG Signal, (c) Extracted
Eyeblink Artifact
3.1.1. Discussion
The algorithms are evaluated on 100 trials of synthetically contaminated
signals to ensure the performance exhibited by the algorithms are reliable and
repetitive. The confidence interval for 95% of confidence level is determined for
26
De-noising via SWT
(a) Contaminated EEG
20
-20
0 1 2 3 4 5 6 7 8 9 10
(b) Reconstructed EEG via SWT
Amplitude(uV) 5
Fig. 11: (a) Mixed EEG and Eyeblink Signal, (b) Reconstructed EEG Signal, (c) Extracted
Eyeblink Artifact
each of the performance metrics. The 95% confidence level is chosen so that the
estimation of results are statistically sound. CC value normally lies between -1
and 1, in which a value approaching 1 indicates a higher correlation or simi-
larity. RMSE value that approaches zero signifies a more precise and accurate
signal reconstruction, relative to the synthetic signals. The SNR measures the
scale of eyeblink artifacts that have been removed from the noisy EEG signal
and the degree of neural signal preservation. The effectiveness of the evaluated
algorithms in preserving the underlying neural information in an EEG signal
27
can be deduced through CC value that approaches near 1, RMSE close to 0 and
higher SNR value. In this analysis between Wavelet and the proposed technique,
FastEMD-CCA has produced higher CC values on average compared to SWT,
0.7478 in reconstructing the EEG signal and 0.9754 in extracting out the eye-
blink artifact. The error produced by FastEMD-CCA is 14% percent lower than
the error produced by SWT in reconstructing the EEG signal. While in extract-
ing out the eyeblink artifact, FastEMD-CCA has produced an error of 66% lower
than SWT. This indicates that the FastEMD-CCA algorithm is able to remove
eyeblink artifact components appropriately from the contaminated EEG signal
in comparison with SWT. From Table 1, FastEMD-CCA yields very high SNR,
close to 4 dB on average from -10dB before artifact correction, which denotes
a higher ratio of neural information has been preserved. Alternatively, SWT
produced nearly 2dB of SNR on average from -10dB before artifact elimination.
This shows that the FastEMD-CCA is a better choice in removing eyeblink ar-
tifacts, and at the same time, it is able to preserve underlying EEG components
better, by not introducing much distortion to the neural signal. In terms of
computation time, the SWT is way faster than the FastEMD-CCA. It has to be
emphasized here that SWT removes artifacts only from a single channel EEG
signal, hence faster computation time, while FastEMD-CCA performs the ar-
tifact elimination from a multichannel EEG signal. Moreover, SWT is applied
to the entire signal for processing which is not applicable for online applica-
tions, while the FastEMD-CCA algorithm process the EEG signals in windows.
SWT also relies on manual selection of appropriate mother wavelet, comprises
sine and cosine functions, which may not represent a basis function for non-
stationary biomedical signals. Selecting an inappropriate mother wavelet could
lead to inaccuracy in reconstructing artifact-free EEG signals. Furthermore, the
accuracy of SWT is also sensitive to the selection of thresholding function which
could have an effect on preserving or discarding the neural information in an
EEG signal. Considering the performance shown by FastEMD-CCA by means
of accuracy in removing artifacts, it’s used for evaluation in removing artifacts
in real EEG signals.
28
3.2. Evaluation on Real EEG Signals
Results in Table 2 were obtained through offline analysis performed on the
artifact removed EEG signals in an online manner through the proposed tech-
nique, FastEMD-CCA and the state-of-the-art algorithm, FORCe. Fig. 12 and
13 shows an example of an entire EEG signal, reconstructed using FORCe algo-
rithm and the proposed algorithm, FastEMD-CCA respectively. Fig. 14 and 15
show a short portion of the EEG signal, reconstructed using FORCe algorithm
and the proposed algorithm, FastEMD-CCA respectively.
50
-50
Amplitude(microvolts)
-100
0 1 2 3 4 5 6 7 8
104
(b)-Artifact Free EEG Signal
100
50
-50
-100
0 1 2 3 4 5 6 7 8
Samples 104
3.2.1. Discussion
Accuracy is a measurement of correct detection of eyeblink artifacts by the
algorithms, thus removing them, and also how well the algorithms could retain
the artifact-free EEG segments after artifact correction is performed. The pro-
posed algorithm has achieved an average of 97.9% accuracy compared to 91.7%
29
(a) Observed EEG Signal
100
50
Amplitude(microvolts)
-50
-100
0 1 2 3 4 5 6 7 8
104
(b) Artifact Free EEG Signal
100
50
-50
-100
0 1 2 3 4 5 6 7 8
Samples 104
50
-50
Amplitude(microvolts)
-100
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
50
-50
-100
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
Samples
30
(a) Observed EEG Signal
100
50
-50
Amplitude(microvolts)
-100
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
50
-50
-100
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
Samples
8.30%. This denotes that both algorithms are still susceptible to miss out an
eyeblink artifact, however, the proposed algorithm is more reliable in detect-
ing and removing eyeblink artifacts in online applications compared to FORCe.
Sensitivity, on the other hand, is a measurement of how sensitive the algorithms
are in detecting and removing the eyeblink artifacts in comparison with the
actual number of observed eyeblink artifacts. The results indicate the proposed
algorithm, FastEMD-CCA has achieved 97.65% of sensitivity, 8.18% higher than
that of the FORCe algorithm. This shows that FastEMD-CCA could identify
and remove eyeblink artifacts relatively better than FORCe could. The sen-
sitivity of FORCe in identifying and removing the artifacts is 89.47 %. This
31
Performance Representation
120.00%
100.00%
Percentage
80.00%
60.00%
40.00%
20.00%
0.00%
1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58
EEG Datasets
100.00
80.00
Time (s)
60.00
40.00
20.00
0.00
389 294 337 330 306 321 345 271 316 335 289 304 359 334 272 346 298 291 331 263 299 359 278 325 337 253 288 339 335 314
EEG Length (s)
lower percentage level of sensitivity could have been due to the inability of the
algorithm in identifying some of the artifact events. The identification of arti-
fact related ICs in FORCe during ICA application on the wavelet coefficients
are dependent on manually adjusted threshold values, which classifies or make
a binary decision whether an IC is artifactual. So, having manually adjusted
fix thresholds may lead to detection errors, thereby not removing some of the
artifacts. On a separate note, the performance of the algorithms in retaining
the neural information of an EEG signal is evaluated through specificity. Speci-
ficity is the ratio of undistorted artifact-free EEG segments before and after
artifact elimination is performed. The ideal expectation is to have these por-
tions undistorted after the artifacts have been removed. FastEMD-CCA and
FORCe records an average specificity of 99.22% and 98.65% respectively, which
signifies that both algorithms doesn’t introduce much distortion to the neural
information of the EEG signals under evaluation. From the comparison, it is
clear that FastEMD-CCA has achieved better performance than FORCe on the
32
same set of EEG signals. The average computation time FastEMD-CCA took to
remove eyeblink artifacts from all 14 channels of these 60 EEG data-sets with
an average signal length of 312s ( 5 minutes) is 19.73 seconds, while FORCe
took 85.10 seconds. The computation time of FastEMD-CCA is at least 4 times
faster than that of FORCe.
The results have pointed out that the proposed algorithm, FastEMD-CCA
is highly accurate in removing eyeblink artifacts, proved by accuracy, error rate
and sensitivity measurement. It is also capable of retaining underlying EEG
data in uncontaminated EEG portions which were indicated by the specificity
percentage. Apart from this, the algorithm is also able to remove eyeblink
artifacts when the eyeblink artifacts are in continuous sequence as highlighted
in Fig. 17. The computation time of the algorithm is low as well, with an
average of 63 milliseconds processing time to remove artifacts from 1-second
length of EEG signal with 14 channels (256 samples x 14 EEG channels). This
makes it a feasible solution for applications requiring online removal of eyeblink
artifacts, with very low distortion to the neural signal.
50
-50
Amplitude(microvolts)
-100
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
50
-50
-100
0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000
Samples
33
4. Conclusion
Acknowledgment
The authors would like to thank the Ministry of Education, Malaysia for sup-
porting this research through the Fundamental Research Grant Scheme, FRGS
34
(FRGS/2/2014/TK03/UTP/02/1) and the Higher Institution Centre of Excel-
lence (HICoE) Scheme.
References
[3] S. Sanei, J. A. Chambers, EEG Signal Processing, John Wiley & Sons,
2013.
[8] P. Berg, M. Scherg, Dipole Modelling of Eye Activity and its Application
to the Removal of Eye Artefacts from the EEG and MEG, Clinical Physics
and Physiological Measurement 12 (A) (1991) 49.
35
[9] T. D. Lagerlund, F. W. Sharbrough, N. E. Busacker, Spatial Filtering of
Multichannel Electroencephalographic Recordings through Principal Com-
ponent Analysis by Singular Value Decomposition, Journal of Clinical Neu-
rophysiology 14 (1) (1997) 73–82.
36
Empirical Mode Decomposition (EMD), in: Biomedical Engineering Inter-
national Conference (BMEiCON), IEEE, 2012, pp. 1–5.
[23] S. Çınar, N. Acır, A Novel System for Automatic Removal of Ocular Arte-
facts in EEG by Using Outlier Detection Methods and Independent Com-
ponent Analysis, Expert Systems with Applications 68 (2017) 36–44.
37
[24] V. Lawhern, W. D. Hairston, K. McDowell, M. Westerfield, K. Robbins,
Detection and Classification of Subject-generated Artifacts in EEG Signals
Using Autoregressive Models, Journal of Neuroscience Methods 208 (2)
(2012) 181–189.
[27] L. Pion-Tonachini, S.-H. Hsu, C.-Y. Chang, T.-P. Jung, S. Makeig, Online
Automatic Artifact Rejection Using the Real-time EEG Source-mapping
Toolbox (REST), in: 2018 40th Annual International Conference of the
IEEE Engineering in Medicine and Biology Society (EMBC), IEEE, 2018,
pp. 106–109.
38
[31] H. Hallez, M. De Vos, B. Vanrumste, P. Van Hese, S. Assecondi,
K. Van Laere, P. Dupont, W. Van Paesschen, S. Van Huffel, I. Lemahieu,
Removing Muscle and Eye Artifacts Using Blind Source Separation Tech-
niques in Ictal EEG Source Imaging, Clinical Neurophysiology 120 (7)
(2009) 1262–1272.
39