A Hybrid Forecasting Model For Prediction of Stock Value of Tata Steel Using Support Vector Regression and Particle Swarm Optimization
A Hybrid Forecasting Model For Prediction of Stock Value of Tata Steel Using Support Vector Regression and Particle Swarm Optimization
A HYBRID FORECASTING MODEL FOR PREDICTION OF STOCK VALUE OF TATA STEEL USING
SUPPORT VECTOR REGRESSION AND PARTICLE SWARM OPTIMIZATION
INTRODUCTION
ABSTRACT:
Stock market analysis has always been an essential part
Financial time series forecasting has always draws a lot of the financial sector of any country. Most of investors
of attention from investors and researchers. The are presently depending upon Intelligent Trading Systems
inclination of stock market is extremely complex and is for prediction of stock market price based on various
inclined by various factors. Hence to find the most conditions. Precision of these forecast systems is
significant factors to the stock market is really important. necessary for better investment decisions with minimum
But the high noise and difficulty residing in the financial risk factors. Prediction of stock price has been beneficial
data makes this job very challenging. Many researchers for both the individual and institutional investors.
have used support vector regression (SVR) and Predicting stock market price is a moderately challenging
comparatively overcome this challenge. As the dormant task. Technological analysis is an admired approach to
high noises in the data impair the performance, reducing study the stock market analysis.
the noise would be competent while constructing the
forecasting model. To achieve this task, integration of Researchers use various machine learning and artificial
SVR with particle swarm optimization (PSO) is proposed intelligent approaches to forecast future trends or price.
in this research work. This paper analyzes a series of Artificial neural network (ANN), support vector machine
technological indicators used in usual studies of the stock (SVM), and logistic regression (LR) have been used by
market and executes support vector regression and many for this kind of forecasting tasks. Among all these
particle swarm optimization algorithm. SVM is considered to one of the best performing
technique provided appropriate initialization of its
The performance of the proposed approach is evaluated regularization parameters is made.
with 18 years’ daily transactional data of Tata Steel
stocks price from Bombay Stock Exchange (BSE). We used the support vector regression and particle swarm
Empirical results show that the proposed model enhances optimization technique for forecasting the stock price of
the performance of the previous prediction model. TATA STEEL. Support vector regression requires its
hyper parameters (i.e., cost and gamma) to be optimized
This approach is compared with existing models with real to perform better and hence particle swarm optimization
data set and gives more accurate results which give more (PSO) is used to optimize the same. Technical indicators
accuracy with MAPE 0.7 % (approximately). used in this analysis are calculated from the historical
trading data. Lagged data in the time series domain have
Keywords: Stock market, Financial time series always been influencing the forecasting accuracy. The
forecasting; Support vector regression; particle swarm availability of lagged data for our proposed model PSO-
optimization. SVR leads to better performance than standard SVR.
1719
International Journal of Pure and Applied Mathematics Special Issue
4, the methodology and the process involved in the hybrid Propagation Technique (BP) and Support Vector Machine
model under study, i.e., PSO-SVR, is explained. In Technique (SVM) to forecast future prices exchange in
Section-5, experimental analysis are presented and the Indian stock market.They have shown that Support
finally, the paper is concluded in Section-6. Vector Machines gives the better overview than that
conventional methods. Yongsheng Ding, Xinping Song &
1. LITERATURE REVIEW Yueming Zen 2008, [9] constructed Support Vector
Machine based on basic data forecast to the stock crises
The authors Vapnik et al, 1999 [1] represented that and the financial position of the companies in the Chinese
support vector machine is a learning system paying market. They have applied 10-fold cross-validation and
attention to statistical learning theory. Support vector grid-search technique to obtain the optimal hyper
machine has been utilized by Kim KJ, 2003 [2] and Hu, parameters C and γ for different kernel functions. They
Su, Hao & Tang 2009 [3] for forecasting financial time have compared the prediction performance of the Support
series. Kim KJ analyzed the effect of the value of the Vector Machines with four dissimilar kernels and
upper bound C and the kernel parameter δ2 in Support cocluded the Radial Basis Function kernel (RBF) is the
Vector Machine and concluded that the prediction best performance among four. They also statistically
performances of Support Vector Machines are sensitive compared the prediction accuracy with Back Propagation
to the value of these parameters.Tony Van Gestel, Johan Neural Network (BPNN), Multiple Discriminate
A. K. Suykens, Dirk-Emma Baestaens et al , 2001[4] Analysis(MDA) and logistic regression (Logit). The
proposed the model combining Bayesian evidence results of empirical analysis show that the RBF kernel
framework with least squares support vector machines for SVM superior than other kernel SVM and BPNN, MDA,
nonlinear regression and validated on the forecast of the and Logit models. Shen, Shunrong, Haomiao Jiang &
weekly US short term T-bill rate and the daily closing Tongda Zhang 2012 [10] proposed a forecast algorithm
prices of the DAX30 stock index. Wei Huang, Yoshiteru which makes use of the sequential among global stock
Nakamori, & Shou-Yang Wang, 2005 [5] summarized the markets and different financial substance to predict the
stock trading decision support systems and proposed next day stock value using Support Vector Machines.
Supprt vector machine is a superior tool for financial They have used the same algorithm with individual
stock market prediction. Yuling Lin, Haixiang Guo & regression algorithm to forecast the actual growth in the
Jinglu Hu in 2013, [6] propose a Support Vector Machine markets. At last they build a basic trading model and
based stock market prediction model. They implemented distinguish its performance with the existing algorithm.
the piecewise linear principle, and the characteristic Puspanjali Mohapatra, Soumya Das, Tapas Kumar Patra
weights are integrated to put up the optimal separating & Munnangi Anirudh, 2013 [11] proposed a comparative
hyperplane, which assesses for stock indicator and control study of particle swarm optimization (PSO) based hybrid
over fitting on stock market expectation. They tried this swarmnet and simple functional link artificial neural
methodology on Taiwan stock market datasets and network(FLANN) model. Both the models are initially
establish that this method performs result in compare to trained with least mean square (LMS) algorithm, then
the conventional stock market prediction system. Lucas with particle swarm optimization (PSO) algorithm. The
Lai & James Liu, 2014 [7] implemented the Support models are predicted the stock price of two different
Vector Machine and Least Square Support Vector datasets NIFTY and NASDAQ on different time horizons
Machine models for prediction of stock market. They (one day, one week, and one month) ahead. The
have considered three ate systems- General performance is evaluated on the basis of Root Mean
Autoregressive Conditional Heteroskedasticity Square Error (RMSE) and Mean Absolute Percentage
(GARCH), Support Vector Regression (SVR) and Least Error (MAPE). It was verified that PSO based hybrid
Square Support Vector Machine (LSSVM) with the swarmnet performed better in comparison to PSO based
wavelet kernel for configuration of three narrative FLANN model, simple hybrid model trained with LMS
algorithms namely Wavelet-based GARCH and simple FLANN model trained with LMS.
(WL_GARCH), Wavelet-based SVR (WL_SVR) and Mohammed Siddique, Debdulal Panda, Sumanjit Das at
Wavelet-based Least Square Support Vector Machine el., 2017, [12] proposed a hybrid model to forecast stock
(WL_LSSVM) to resolve the non-parametric and non- price using Artificial Neural Network (ANN) model
linear financial time series issue. Shom Prasad Das & optimized by particle swarm optimization (PSO), which
Sudarsan Padhy,2012 [8] incorporated the Back consisting of an effective algorithm for predicting next
1720
International Journal of Pure and Applied Mathematics Special Issue
day high price of Yahoo stock value and Microsoft stock by their labels and it maximize the distance from
value. M. Karazmodeh, S. Nasiri, and S. Majid Hashemi, hyperplane to the support vector.
2013, [13], proposed an improved hybrid Improved via
Genetic Algorithm based on Support Vector Machines The basic concept of SVM is to maximize the margin
(IPSOSVM) system to predict the future stock prices. hyperplane in the feature space. The principle of normal
Rohit Choudhry, and Kumkum Garg, 2008, [14], Support Vector Machine for Regression (SVR) model, a
proposed a hybrid GA-SVM system for predicting the supervised machine learning technique developed by
future stock prices. Cheng-Lung Huang, Jian-Fan Dun, Vapnik et al.[1], is described below.
2008 [15], proposed a new hybrid PSO–SVM system to
solve continuous valued and discrete valued PSO version. Given a sample data-set S = (x1; y1); (x2; y2); :::(xk; yk)
They have shown that experimental results optimize the representing k input-output pairs, where each xi ϵ X is a
model parameters and search the discriminating feature subset of Rn, denoting the n dimensional input sample
subset simultaneously. space and matching target values yi ϵ Y is a subset of R
for (i = 1; 2; :::; k). The objective of this regression
2. METHODOLOGY USED problem is to find a function f : Rn → R, to approximate
the value of y for hidden and unlabeled x, which is not
1. Support Vector Machine for Regression present in the training sample data-set. Through a
Support Vector Machines is one of the best binary nonlinear mapping function ϕ, the input data is mapped
classifiers. SVM create a decision boundary such that the from Rn to a higher dimensional space Rm, where m > n,
majority of the points in one category falls on one side of and hence the estimating function f is defined as
the boundary while most points of other category fall on
the other side of the boundary. Consider an n-dimensional f(x) = w T ϕ(x) + b -------------------------------------------(1)
feature vector X = (x1, x2, ... ... xn). We can define a
hyperplane
where w ϵ Rm is the regression coefficient vector, b ϵ R, is
the bias or threshold value.The objective of the support
α α x α x … … α x vector regression is to find a function f that has the most
ϵ-deviation from the target y , .We want to determine w
α α x 0
and b such that the value of f(x) can be determined by
minimizing the risk.
Then elements in one category will be such that the sum Rreg (w) = ||w||2 + K ∑ L ϵ y , f x -----------------(2)
is greater than 0, while elements in the other category will
have the sum be less than 0. We construct a label, α
∑ α x Y, where Y ϵ {-1, 1} is the label where, where K determines the trade-off between the
classifier.We can rewrite the hyperplane equation using flatness of the f(x) and the amount up to which deviations
inner products Y α ∑ β Y X i ∗ X,Where ∗ greater than ϵ are tolerated. Also K is the penalty factor
represents the inner product operator and inner product is which is a user defined constant that determines the
weighted by its label. transaction between the training error and the penalizing
term ||w||2 and L ϵ y , f x , is the ϵ-intensive loss
The margin of the optimal hyperplane is obtained by function, defined as
maximizing the distance from the plane to any point. The
maximum margin hyperplane (MMH) splits the data very y, f x ∈ , |y f x | ∈
well. The essential aspect is that only the points Lϵ y , f x ----(3)
0, |y f x | ∈
neighboring to the boundary of the hyperplane are
participated in selection; all other points are irrelevant.
These points are known as the support vectors, and the The minimization of risk functional equation (2) can be
hyperplane is known as a Support Vector Classifier reformulated by introducing non-negative slack variables
(SVC) as it places each support vector in one class or in γ and ξ as
the other class. The inner products in SVC are weighted
1721
International Journal of Pure and Applied Mathematics Special Issue
1722
International Journal of Pure and Applied Mathematics Special Issue
1723
International Journal of Pure and Applied Mathematics Special Issue
1724
International Journal of Pure and Applied Mathematics Special Issue
REFERENCE:
1725
International Journal of Pure and Applied Mathematics Special Issue
1726
1727
1728