International Scholarly Research Notices - 2014 - Al-Fahoum - Methods of EEG Signal Features Extracti
International Scholarly Research Notices - 2014 - Al-Fahoum - Methods of EEG Signal Features Extracti
ISRN Neuroscience
Volume 2014, Article ID 730218, 7 pages
http://dx.doi.org/10.1155/2014/730218
Review Article
Methods of EEG Signal Features Extraction Using Linear
Analysis in Frequency and Time-Frequency Domains
Copyright © 2014 A. S. Al-Fahoum and A. A. Al-Fraihat. This is an open access article distributed under the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is
properly cited.
Technically, a feature represents a distinguishing property, a recognizable measurement, and a functional component obtained
from a section of a pattern. Extracted features are meant to minimize the loss of important information embedded in the signal. In
addition, they also simplify the amount of resources needed to describe a huge set of data accurately. This is necessary to minimize
the complexity of implementation, to reduce the cost of information processing, and to cancel the potential need to compress
the information. More recently, a variety of methods have been widely used to extract the features from EEG signals, among
these methods are time frequency distributions (TFD), fast fourier transform (FFT), eigenvector methods (EM), wavelet transform
(WT), and auto regressive method (ARM), and so on. In general, the analysis of EEG signal has been the subject of several studies,
because of its ability to yield an objective mode of recording brain stimulation which is widely used in brain-computer interface
researches with application in medical diagnosis and rehabilitation engineering. The purposes of this paper, therefore, shall be
discussing some conventional methods of EEG feature extraction methods, comparing their performances for specific task, and
finally, recommending the most suitable method for feature extraction based on performance.
Raw EEG
take 𝑖𝐷 to be the point of start of the 𝑖th sequence. Then 𝐿
of length 2𝑀 represents data segments that are formed. The
resulting output periodograms give
2
1 𝑀−1
Segment
detection ≈ (𝑖) −𝑗2𝜋𝑓𝑛
𝑃𝑥𝑥 (𝑓) = ∑ 𝑥 (𝑛)𝑤(𝑛)𝑒 . (2)
𝑀𝑈 𝑛=0 𝑖
Feature Here, in the window function, 𝑈 gives normalization factor
extraction of the power and is chosen such that
1 𝑀−1 2
Classification 𝑈= ∑ 𝑤 (𝑛) , (3)
𝑀 𝑛=0
techniques, and neural networks. This is done by exploiting 2.2. Wavelet Transform (WT) Method. WT plays an impor-
the algorithmic characteristics of the feature vector of the data tant role in the recognition and diagnostic field: it compresses
input and thus gives rise to a hypothesis [10, 15]. the time-varying biomedical signal, which comprises many
This paper presents a short review of mathematical data points, into a small few parameters that represents the
methods for extracting features from EEG signals. The review signal [14].
considers five different methods for EEG signal extracting. As the EEG signal is nonstationary [7], the most suitable
The adopted approach is such that a full literature review way for feature extraction from the raw data is the use of
is introduced for the five different techniques, summarizing the time-frequency domain methods like wavelet transform
their strengths and weaknesses. (WT) which is a spectral estimation technique in which
any general function can be expressed as an infinite series
2. Methods of wavelets [20–22]. Since WT allows the use of variable
sized windows, it gives a more flexible way of time-frequency
Different articles were used to extract advantages and disad- representation of a signal. In order to get a finer low-
vantages of selected methods by thoroughly reviewing chosen frequency resolution, WT long time windows are used; in
articles including the main methods for linear analysis of contrast in order to get high-frequency information, short
one-dimensional signals in the frequency or time-frequency time windows are used [13].
23567872, 2014, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2014/730218, Wiley Online Library on [19/08/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
ISRN Neuroscience 3
combination of the whole of noise subspace eigenvectors Table 1: Comparison between FFT and AR [8].
[25, 26]. This technique is depicted by
Method Frequency resolution Spectral leakage
1 FFT Low High
𝑃MIN (𝑓, 𝐾) = .
𝐴 (𝑓)2 (12)
AR High Low
WT High Low
All the aforementioned eigenvector methods can best
address the signal that is composed of many distinctive
sinusoids embedded in noise. Consequently, they are prone
to yield false zeros and thus resulting in a relatively poor 2.5. Autoregressive Method. Autoregressive (AR) methods
statistical accuracy [26]. estimate the power spectrum density (PSD) of the EEG using
a parametric approach. Therefore, AR methods do not have
problem of spectral leakage and thus yield better frequency
2.4. Time-Frequency Distributions. These methods require resolution unlike nonparametric approach. Estimation of
noiseless signals to provide good performance. Therefore, PSD is achieved by calculating the coefficients, that is, the
very restricted preprocessing stage is necessary to get rid of parameters of the linear system under consideration. Two
all sorts of artifacts. Being time-frequency methods they deal methods used to estimate AR models are briefly described
with the stationary principle; windowing process is therefore below [18, 19].
required in the preprocessing module [29]. The definition of
TFD for a signal 𝑥(𝑛) was generalized by Cohen as [30]
2.5.1. Yule-Walker Method. In this method, AR parameters or
1 ∞ ∞ coefficients are estimated by exploiting the resulting biased
𝑃 (𝑡, 𝑤) = ∫ ∫ 𝐴 (𝜃, 𝜏) Φ (𝜃, 𝜏) 𝑒−𝑗𝜃𝑡−𝑗𝜔𝜏 𝑑𝜃 𝑑𝜏,
2𝜋 −∞ −∞ approximate of the autocorrelation data function. This is done
(13) by subsequently finding the minimization of the least squares
of the forward prediction error as given below [31]:
where
1 ∞ 𝜏 𝜏 𝑟 (0)𝑥𝑥 𝑟 (−1)𝑥𝑥 ⋅ ⋅ ⋅ 𝑟(−𝑝 + 1)𝑥𝑥 𝑎 (1)
𝐴 (𝜃, 𝜏) = ∫ 𝑥 (𝑢 + ) 𝑥∗ (𝑢 − ) 𝑒𝑗𝜃𝑢 𝑑𝑢. (14)
2𝜋 −∞ 2 2 [ 𝑟 (1)𝑥𝑥 𝑟 (0)𝑥𝑥 . . . 𝑟(−𝑝 + 2)𝑥𝑥 ] [ ]
[ ] [ 𝑎 (1) ]
[ .. .. .. ] × [ .. ] ,
𝐴(𝜃, 𝜏) is popularly known as ambiguity Function, and [ . . d . ] [ . ]
Φ(𝜃, 𝜏) refers to kernel of the distribution, while 𝑟 and 𝑤 are 𝑟 (𝑝 − 1) 𝑟 (𝑝 − 2)𝑥𝑥 ⋅⋅⋅ 𝑟 (0)𝑥𝑥
[ 𝑥𝑥 ] [𝑎 (𝑝)]
time and frequency dummy variables, respectively. (18)
Smooth pseudo-Wigner-Ville (SPWV) distribution is a
variant method which incorporates smoothing by indepen- where 𝑟𝑥𝑥 can be defined by
dent windows in time and frequency, namely, 𝑊𝑤 (𝜏) and
𝑊𝑡 (𝑡) [29]:
1 𝑁−𝑚−1 ∗
𝑟𝑥𝑥 (𝑚) = ∑ 𝑥 (𝑛) 𝑥 (𝑛 + 𝑚) , 𝑚 ≥ 0. (19)
SPWV (𝑡, 𝑤) 𝑁 𝑁=0
∞ ∞
𝜏
=∫ 𝑊𝑤 (𝜏) [∫ 𝑊𝑡 (𝑢 − 𝑡) 𝑥 (𝑢 + ) Calculating the above set of (𝑝 + 1) linear equations, the AR
−∞ −∞ 2 (15) coefficients can be obtained:
𝜏 2
× 𝑥∗ (𝑢 − ) 𝑑𝑢] 𝑒−𝑗𝜔𝜏 𝑑𝜏. 𝐵𝑈
𝜎𝑤𝑝
2 𝑃𝑥𝑥 (𝑓) =
2 , (20)
1 + ∑𝑘=1 𝑎 𝑝 (𝑘)𝑒−𝑗2𝜋𝑓𝑘
𝑃 _
The feature extraction using this method is based on the
energy, frequency, and the length of the principal track. Each
segment gives the values 𝐸𝑘 , 𝐹𝑘 , and 𝐿 𝑘 . The EEG signal is while 𝜎̂𝑤𝑝 gives the approximated lowest mean square error
firstly divided into 𝑘 segments; then, the construction of a of the 𝑝th-order predictor given as follows:
three-dimensional feature vector for each segment will take
𝑝
place. Energy of each segment 𝑘 can be calculated as follows: 2
2
𝜎𝑤𝑝 = 𝐸𝑝𝑓 = 𝑟𝑥𝑥 (0) ∏ [1 − 𝑎𝑘 (𝑘) ] . (21)
∞ ∞
𝑘=1
𝐸𝑘 = ∫ ∫ 𝜗𝑘 (𝑡, 𝑓) 𝑑𝑡 𝑑𝑓, (16)
−∞ −∞
where 𝜗𝑘 (𝑡, 𝑓) stands for the time-frequency representation 2.5.2. Burg’s Method. It is an AR spectral estimation based
of the segment. However, to calculate the frequency of each on reducing the forward and backward prediction errors
segment 𝑘, we make use of the marginal frequency as follows: to satisfy Levinson-Durbin recursion [8]. Burg’s method
estimates the reflection coefficient directly without the need
∞
𝐹𝑘 = ∫ 𝜗𝑘 (𝑡, 𝑓) 𝑑𝑡. (17) to calculate the autocorrelation function. This method has the
−∞ following strength: Burg’s method can estimate PSD’s data
records to look exactly like the original data value. It can
Finally, for achieving good results, noiseless EEG signals yield intimately packed sinusoids in signals once it contains
or a well-denoised signal should be used for TFD [30]. minimal level of noise.
23567872, 2014, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2014/730218, Wiley Online Library on [19/08/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
ISRN Neuroscience 5
Analysis
Method name Advantages Disadvantages Suitability
method
(i) Weakness in analyzing nonstationary
signals such as EEG
(i) Good tool for stationary signal (ii) It does not have good spectral
processing estimation and cannot be employed for
(ii) It is more appropriate for narrowband analysis of short EEG signals Narrowband,
Fast fourier Frequency
signal, such as sine wave (iii) FFT cannot reveal the localized stationary
transform domain
(iii) It has an enhanced speed over spikes and complexes that are typical signals
virtually all other available methods in among epileptic seizures in EEG signals
real-time applications (iv) FFT suffers from large noise
sensitivity, and it does not have shorter
duration data record
(i) It has a varying window size, being
broad at low frequencies and narrow at
high frequencies
Both time and Transient and
Wavelet (ii) It is better suited for analysis of
Needs selecting a proper mother wavelet freq. domain, stationary
transform sudden and transient signal changes
and linear signal
(iii) Better poised to analyze irregular
data patterns, that is, impulses existing at
different time instances
Lowest eigenvalue may generate false
Provides suitable resolution to evaluate Frequency Signal buried
Eigenvector zeros when Pisarenko’s method is
the sinusoid from the data domain with noise
employed
(i) The time-frequency methods are
oriented to deal with the concept of
(i) It gives the feasibility of examining stationary; as a result, windowing process
Time Both time and
great continuous segments of EEG signal is needed in the preprocessing module Stationary
frequency frequency
(ii) TFD only analyses clean signal for (ii) It is quite slow (because of the signal
distribution domains
good results gradient ascent computation)
(iii) Extracted features can be dependent
on each other
(i) AR limits the loss of spectral problems
(i) The model order in AR spectral
and yields improved frequency resolution
estimation is difficult to select
(ii) Gives good frequency resolution
(ii) AR method will give poor spectral
(iii) Spectral analysis based on AR model Signal with
estimation once the estimated model is Frequency
Autoregressive is particularly advantageous when short sharp spectral
not appropriate, and model’s orders are domain
data segments are analyzed, since the features
incorrectly selected
frequency resolution of an analytically
(iii) It is readily susceptible to heavy
derived AR spectrum is infinite and does
biases and even large variability
not depend on the length of analyzed data
most significant specification of this method since it ensures [2] X.-Y. Wang, J. Jin, Y. Zhang et al., “Brain control: human-
the suitable time frequency resolution in all frequency range computer integration control based on brain-computer inter-
[26]. Autoregression analysis suffers from speed and is not face approach,” Acta Automatica Sinica, vol. 39, no. 3, pp. 208–
always applicable in real-time analysis while FFT appears to 221, 2013.
be the least efficient of the discussed methods because of its [3] G. Al-Hudhud, “Affective command-based control system inte-
inability to examine nonstationary signals. The strength of grating brain signals in commands control systems,” Computers
AR method can be emphasized by further comparing its in Human Behavior, vol. 30, pp. 535–541, 2014.
performance with that of classical FFT as shown in Table 1. [4] J. L. S. Blasco, E. Iáñez, A. Úbeda, and J. M. Azorı́n, “Visual
evoked potential-based brain-machine interface applications to
It is highly recommend to use AR method in conjunction
assist disabled people,” Expert Systems with Applications, vol. 39,
with more conservative methods, such as periodograms, to no. 9, pp. 7908–7918, 2012.
help to choose the correct model order and to avoid getting
[5] H. Cecotti, “Spelling with non-invasive brain-computer inter-
fooled by spurious spectral features [32]. faces—current and future trends,” Journal of Physiology, vol. 105,
The most important application for eigenvectors is to eva- no. 1–3, pp. 106–114, 2011.
luate frequencies and powers of signals from noise corrupted [6] I. S. Kotchetkov, B. Y. Hwang, G. Appelboom, C. P. Kellner,
signal; the principle of this method is the decomposition of and E. S. Connolly Jr., “Brain-computer interfaces: military,
the correlation matrix of the noise corrupted. Three methods neurosurgical, and ethical perspective,” Neurosurgical Focus,
for eigenvectors module were discussed: Pisarenko, multiple vol. 28, no. 5, article E25, 2010.
signal classification (MUSIC), and minimum norm [27]. The [7] E. D. Übeyli, “Statistics over features: EEG signals analysis,”
good thing about the eigenvector method is that it produces Computers in Biology and Medicine, vol. 39, no. 8, pp. 733–741,
frequency spectra of high resolution even when the signal-to- 2009.
noise ratio (SNR) is low. However, this method may produce [8] R. Agarwal, J. Gotman, D. Flanagan, and B. Rosenblatt, “Auto-
spurious zeros leading to poor statistical accuracy [26]. matic EEG analysis during long-term monitoring in the ICU,”
The TFD method offers the possibility to analyze rela- Electroencephalography and Clinical Neurophysiology, vol. 107,
tively long continuous segments of EEG data even when the no. 1, pp. 44–58, 1998.
dynamics of the signal are rapidly changing. At the same time [9] A. Meyer-Lindenberg, “The evolution of complexity in human
a good resolution both in time and frequency is necessary, brain development: an EEG study,” Electroencephalography and
making this method not preferable to use in many cases [30]. Clinical Neurophysiology, vol. 99, no. 5, pp. 405–411, 1996.
Table 2 shows the summary of advantages and disad- [10] A. Subasi and E. Erçelebi, “Classification of EEG signals using
vantages of the above-mentioned methods, their accuracies, neural network and logistic regression,” Computer Methods and
Programs in Biomedicine, vol. 78, no. 2, pp. 87–99, 2005.
speeds, and suitability to make it easier to compare their
[11] A. Subasi, “EEG signal classification using wavelet feature extra-
performances.
ction and a mixture of expert model,” Expert Systems with Appli-
cations, vol. 32, no. 4, pp. 1084–1093, 2007.
4. Conclusion [12] H. Jasper and L. D. Proctor, Reticular Formation of the Brain,
Five of the well-known methods for frequency domain and 1958.
time-frequency domain methods were discussed. Acclaim [13] M. R. N. Kousarrizi, A. A. Ghanbari, M. Teshnehlab, M. Ali-
about the definite priority of methods according to their yari, and A. Gharaviri, “Feature extraction and classification
capability is very hard. The findings indicate that each of EEG signals using wavelet transform, SVM and artificial
neural networks for brain computer interfaces,” in Proceedings
method has specific advantages and disadvantages which
of the International Joint Conference on Bioinformatics, Systems
make it appropriate for special type of signals. Frequency Biology and Intelligent Computing (IJCBS ’09), pp. 352–355,
domain methods may not provide high-quality performance August 2009.
for some EEG signals. In contrast, time-frequency methods, [14] D. Cvetkovic, E. D. Übeyli, and I. Cosic, “Wavelet transform fea-
for instance, may not provide detailed information on EEG ture extraction from human PPG, ECG, and EEG signal respo-
analysis as much as frequency domain methods. It is crucial to nses to ELF PEMF exposures: a pilot study,” Digital Signal Proce-
make clear the of the signal to be analyzed in the application ssing, vol. 18, no. 5, pp. 861–874, 2008.
of the method, whenever the performance of analyzing [15] H. Kordylewski, D. Graupe, and K. Liu, “A novel large-memory
method is discussed. Considering this, the optimum method neural network as an aid in medical diagnosis applications,”
for any application might be different. IEEE Transactions on Information Technology in Biomedicine,
vol. 5, no. 3, pp. 202–209, 2001.
Conflict of Interests [16] I. Güler and E. D. Übeyli, “Adaptive neuro-fuzzy inference sys-
tem for classification of EEG signals using wavelet coefficients,”
The authors declare that there is no conflict of interests Journal of Neuroscience Methods, vol. 148, no. 2, pp. 113–121,
regarding the publication of this paper. 2005.
[17] E. D. Übeyli, “Wavelet/mixture of experts network structure
References for EEG signals classification,” Expert Systems with Applications,
vol. 34, no. 3, pp. 1954–1962, 2008.
[1] L. Mayaud, M. Congedo, A. van Laghenhove et al., “A compar- [18] A. Subasi, M. K. Kiymik, A. Alkan, and E. Koklukaya, “Neural
ison of recording modalities of P300 Event Related Potentials network classification of EEG signals by using AR with MLE
(ERP) for Brain-Computer Interface (BCI) paradigm,” Clinical preprocessing for epileptic seizure detection,” Mathematical and
Neurophysiology, vol. 43, no. 4, pp. 217–227, 2013. Computational Applications, vol. 10, no. 1, pp. 57–70, 2005.
23567872, 2014, 1, Downloaded from https://onlinelibrary.wiley.com/doi/10.1155/2014/730218, Wiley Online Library on [19/08/2025]. See the Terms and Conditions (https://onlinelibrary.wiley.com/terms-and-conditions) on Wiley Online Library for rules of use; OA articles are governed by the applicable Creative Commons License
ISRN Neuroscience 7