English Premier League (EPL) Soccer Matches Prediction Using An Adaptive Neuro-Fuzzy Inference System (ANFIS)
English Premier League (EPL) Soccer Matches Prediction Using An Adaptive Neuro-Fuzzy Inference System (ANFIS)
net/publication/276485002
CITATIONS READS
2 2,516
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Frank Amadin on 22 June 2016.
ABSTRACT
Prediction of English Premiership League (EPL) matches has been on the heart and minds of researcher
over the pass decades, but none has sufficiently introduced and Adaptive Neuro-Fuzzy Inference System
(ANFIS) approach for these prediction which has served as the focal aim of this research paper using
seven premier league teams and nine linguistic values. Matric Laboratory (MATLAB) 7.0 served as the
tool of implementation highlighting various views. The ANFIS training was successful completed at
epoch 2 and having an error of 1.41237e-006. The model was further used to predict the outcome of 7
matches with a successful rate of 71%.
Keywords: ANFIS, Premiership, Soccer, Predication, Layers
1 Introduction
It is no doubt that the most viewed sport in the world is soccer having above 10 billon fan all over the
world (Tony, 2014). The English league is the most watched soccer league all over the world having
more than 2 billion fans (Jonathan, 2014 and Tony, 2014). It comprises of 20 teams and each team plays
38 match in a season which spans ten months. A team plays 19 matches at home (at the city in which
the club originates) and 19 match away (Jonathan, 2014). A win for any match regardless of home or
away is 3 points a draw is 1 point while a loss earn no point. At the end of each season the 3 teams at
the lower end (bottom) of the league are relegated to a lower division (Tony, 2014). The outcome of any
match is a win, a draw or a loss (Jonathan, 2014 and Tony, 2014). To calculate the number of games
played in a season using the hand shaking lemma we represent each team as a node and a matched
played as an edge. There are two parallel edges between each node (one for home match and the other
for away match). So, the number of game played = (38*20) / 2 =380 matches. Since the edge contribute
twice to the degree of the graph.
DOI: 10.14738/tmlai.32.1027
Publication Date: 28th April, 2015
URL: http://dx.doi.org/10.14738/tmlai.32.1027
Transactions on Machine Learning and Artificial Intelligence Volume 3 , Issue 2, April 2015
implemented with two additional hidden layers (a layer for the home team with a weight of 1 and the
other for the position of the team on the table of ranking) which was utilized in predicting nine matches,
with an 80% accuracy Aditya et al (2013) used a Multi Nominal Logic Regression (MNLR) and Support
Vector Machines (SVM) to predict the outcome of a match utilizing 3 fundamental approaches. In their
first approach Multi nominal Logistic was enacted for the training phase in which performance
optimization metrics derived from current matches were considered, rather than taking the average of
previous matches and during testing they predicted the outcome between two teams using the number
of matches past played. In the second approach, they trained in the same way they later test the data
and instead of using the feature vector as the performance metric vector corresponding to the current
match, they used Key Performance Parameters (KPP). The third approach tried to find a global set of
parameter which was independent of the competing teams.
Ian and Phil (2013), forecasted international; soccer matches result using bivariate discrete distribution.
They collected a total of 8,735 international soccer results from two main sources. The data for the
period 1993-2001 was obtained from the archive of International Soccer Results (ISR) and the data for
the period 2001 to 2004 were obtained from the Record Sport Soccer Statistics Foundation (RSSSF)
archive. Data on the Federation of International Football Association (FIFA) world rankings were
collected from the FIFA website for each month during the years from 1993 to 2004. The model was
based on copula functions. The copula regression model forecasts 41.8% of the results correctly.
From the review of related literature An Adaptive Neuro-Fuzzy Inference System (ANFIS) approach has
not been adopted previously for EPL matches prediction which is serving as the focal point of our
research.
c. Layer 3 (Rule Layer): The third layer is the rule layer. Each neuron in this layer
corresponds to a single first-order Sugeno fuzzy rule receiving signal from the
membership function layer and computes the truth value of the rule (Jang, 1993 and
1997. The nodes calculate the ratios of the rule’s firing strength to the sum of all the
rules firing strength using the equation 4. The result is a normalised firing strength.
wi
O3,i wi i 1, 2 (4)
w1 w2
d. Layer 4 (Normalization layer): The fourth layer is the normalization layer. Each neuron
in this layer receives signals from all rule neurons in the third layer, and calculates the
normalized firing strength of a given rule. The nodes compute a parameter function on
the layer 3 output using equation 5. Parameters in this layer are called consequent
parameters.
O4,i wi fi wi ( pi x qi y ri ) (5)
e. Layer 5 (Defuzzification layer): The fifth layer is the defuzzification layer. Each neuron in
this layer is connected to the respective normalization neuron in the fourth layer (Jang,
1997 and Nagnevitsky, 2002). It is a single node that aggregates the overall output as
the summation of all incoming signals using equation 6
URL:http://dx.doi.org/10.14738/tmlai.32.1027 36
Transactions on Machine Learning and Artificial Intelligence Volume 3 , Issue 2, April 2015
The ANFIS training was completed at epoch 2 and having an error of 1.41237e-006. The model was
further used to predict the outcome of 7 matches and 5 outcomes out of the matches were accurately
predicted.
URL:http://dx.doi.org/10.14738/tmlai.32.1027 38
Transactions on Machine Learning and Artificial Intelligence Volume 3 , Issue 2, April 2015
5 Conclusion
The ANFIS model was able to make an accurate prediction of 5 out of 7 matches which is promising but
this implies that there might be more factors which were not included in our model that determine the
outcome of a soccer match. So, further study is encouraged to increase the accuracy of prediction of a
soccer match suing more applicable variables.
REFERENCES
[1] Aditya S. T., Aditya P. and Vikesh K. (2013)” Game ON! Predicting English Premier League
Match Outcomes” retrieved from gameON.com
[2] Ian M. and Phil S (2013), ”Forecasting international soccer match results using bivariate
discrete distributions” Centre for Operational Research and Applied Statistics, Salford Business
School, University of Salford, Salford, Manchester UK.
[3] Jang, R (1991). "Fuzzy Modeling Using Generalized Neural Networks and Kalman Filter
Algorithm". Proceedings of the 9th National Conference on Artificial Intelligence, Anaheim, CA,
USA, pp. 762–767. Retrieved on October 15th 2014 from www.irrolecom/article/00098.pdf
[4] Jang, J. (1993). "ANFIS: adaptive-network-based fuzzy inference system". IEEE Transactions on
Systems, Man and Cybernetics Volume23 issue3. doi:10.1109/21.256541.
[5] Jang, S (1997) “Neuro-Fuzzy and Soft Computing “Prentice Hall, Pp 335–368, ISBN 0-13-
261066-3
[6] Jonathan T. (2014), “2014-15 English Premier League and MLS TV”rtrived online from
http://www.philly.com/philly/blogs/thegoalkeeper/NBC-Sports-2014-Premier-League-TV-
schedule-for-August-through-November.html#sd6Wr3imqE4ehOdH.99
[7] Nauck, D., Klawon F. and R. Kruse, (1997), “Foundations of Neuro-Fuzzy Systems”, J. Wiley &
Sons pages 312
[10] Yue W. M. (2003)” Prediction on Soccer Matches using Multi-Layer Perceptron” ID: 903-051-
7735
URL:http://dx.doi.org/10.14738/tmlai.32.1027 40