Neural_Networks_in_Data_Mining
Neural_Networks_in_Data_Mining
www.jatit.org
ABSTRACT
Companies have been collecting data for decades, building massive data warehouses in which to store it.
Even though this data is available, very few companies have been able to realize the actual value stored in
it. The question these companies are asking is how to extract this value. The answer is Data mining.
There are many technologies available to data mining practitioners, including Artificial Neural Networks,
Regression, and Decision Trees. Many practitioners are wary of Neural Networks due to their black box
nature, even though they have proven themselves in many situations. This paper is an overview of artificial
neural networks and questions their position as a preferred tool by data mining practitioners.
Keywords: Artificial Neural Network (ANN), neural network topology, Data mining, back propagation
algorithm, Advantages.
1. INTRODUCTION:
Income is a very important socio-economic
Data mining is the term used to describe indicator. If a bank knows a person’s income, they
the process of extracting value from a database. A can offer a higher credit card limit or determine if
data-warehouse is a location where information is they are likely to want information on a home loan
stored. The type of data stored depends largely on or managed investments. Even though this financial
the type of industry and the company. Many institution had the ability to determine a customer’s
companies store every piece of data they have income in two ways, from their credit card
collected, while others are more ruthless in what application, or through regular direct deposits into
they deem to be “important”. their bank account, they did not extract and utilize
Consider the following example of a financial this information.
institution failing to utilize their data-warehouse.
Another example of where this institution has failed
to utilize its data-warehouse is in cross-selling 2. ARTIFICIAL NEURAL NETWORKS:
insurance products (e.g. home, life and motor
vehicle insurance). By using transaction An artificial neural network (ANN), often just
information they may have the ability to determine called a "neural network" (NN), is a mathematical
if a customer is making payments to another model or computational model based on biological
insurance broker. This would enable the institution neural networks, in other words, is an emulation of
to select prospects for their insurance products. biological neural system. It consists of an
These are simple examples of what could be interconnected group of artificial neurons and
achieved using data mining. processes information using a connectionist
Four things are required to data-mine effectively: approach to computation. In most cases an ANN is
high-quality data, the “right” data, an adequate an adaptive system that changes its structure based
sample size and the right tool. There are many tools on external or internal information that flows
available to a data mining practitioner. These through the network during the learning phase.
include decision trees, various types of regression
and neural networks.
37
Journal of Theoretical and Applied Information Technology
www.jatit.org
38
Journal of Theoretical and Applied Information Technology
www.jatit.org
Neural networks essentially comprise three pieces: Prediction, Clustering, Association Rules.
the architecture or model; the learning algorithm; Classification and prediction is a predictive model,
and the activation functions. Neural networks are but clustering and association rules are descriptive
programmed or “trained” to “. . . store, recognize, models.
and associatively retrieve patterns or database The most common action in data mining is
entries; to solve combinatorial optimization classification. It recognizes patterns that describe
problems; to filter noise from measurement data; to the group to which an item belongs. It does this by
control ill-defined problems; in summary, to examining existing items that already have been
estimate sampled functions when we do not know classified and inferring a set of rules. Similar to
the form of the functions.” It is precisely these two classification is clustering. The major difference
abilities (pattern recognition and function being that no groups have been predefined.
estimation) which make artificial neural networks Prediction is the construction and use of a model to
(ANN) so prevalent a utility in data mining. As data assess the class of an unlabeled object or to assess
sets grow to massive sizes, the need for automated the value or value ranges of a given object is likely
processing becomes clear. With their “model-free” to have. The next application is forecasting. This is
estimators and their dual nature, neural networks different from predictions because it estimates the
serve data mining in a myriad of ways. future value of continuous variables based on
Data mining is the business of answering questions patterns within the data. Neural networks,
that you’ve not asked yet. Data mining reaches depending on the architecture, provide associations,
deep into databases. Data mining tasks can be classifications, clusters, prediction and forecasting
classified into two categories: Descriptive and to the data mining industry.
predictive data mining. Descriptive data mining Financial forecasting is of considerable practical
provides information to understand what is interest. Due to neural networks can mine valuable
happening inside the data without a predetermined information from a mass of history information and
idea. Predictive data mining allows the user to be efficiently used in financial areas, so the
submit records with unknown field values, and the applications of neural networks to financial
system will guess the unknown values based on forecasting have been very popular over the last
previous patterns discovered form the database. few years. Some researches show that neural
Data mining models can be categorized according networks performed better than conventional
to the tasks they perform: Classification and statistical approaches in financial forecasting and
39
Journal of Theoretical and Applied Information Technology
www.jatit.org
The simplified process for training a FFNN is as 3.2. The Back Propagation Algorithm:
follows: Backpropagation, or propagation of error, is a
1. Input data is presented to the network and common method of teaching artificial neural
propagated through the network until it reaches networks how to perform a given task.The back
the output layer. This forward process produces propagation algorithm is used in layered feed-
a predicted output. forward ANNs. This means that the artificial
2. The predicted output is subtracted from the neurons are organized in layers, and send their
actual output and an error value for the signals “forward”, and then the errors are
networks is calculated. propagated backwards. The back propagation
3. The neural network then uses supervised algorithm uses supervised learning, which means
learning, which in most cases is back that we provide the algorithm with examples of the
propagation, to train the network. Back inputs and outputs we want the network to
propagation is a learning algorithm for compute, and then the error (difference between
adjusting the weights. It starts with the weights actual and expected results) is calculated. The idea
between the output layer PE’s and the last of the back propagation algorithm is to reduce this
hidden layer PE’s and works backwards error, until the ANN learns the training data.
through the network. Summary of the technique:
4. Once back propagation has finished, the 1. Present a training sample to the neural
forward process starts again, and this cycle is network.
continued until the error between predicted and 2. Compare the network's output to the desired
actual outputs is minimized. output from that sample. Calculate the error in
each output neuron.
40
Journal of Theoretical and Applied Information Technology
www.jatit.org
3. For each neuron, calculate what the output 5. Assign "blame" for the local error to neurons at
should have been, and a scaling factor, how the previous level, giving greater responsibility
much lower or higher the output must be to neurons connected by stronger weights.
adjusted to match the desired output. This is 6. Repeat the steps above on the neurons at the
the local error. previous level, using each one's "blame" as its
4. Adjust the weights of each neuron to lower the error..
local error.
Actual Algorithm: fraud detection, telecommunications, medicine,
1. Initialize the weights in the network (often marketing, bankruptcy prediction, insurance, the
randomly) list goes on. The following are examples of where
2. repeat neural networks have been used.
* for each example e in the training set do Accounting
1. O = neural-net-output(network, e) ; Identifying tax fraud
forward pass Enhancing auditing by finding irregularities
2. T = teacher output for e Finance
3. Calculate error (T - O) at the output Signature and bank note verification
units Risk Management
4. Compute delta_wi for all weights Foreign exchange rate forecasting
from hidden layer to output layer ; Bankruptcy prediction
backward pass Customer credit scoring
5. Compute delta_wi for all weights Credit card approval and fraud detection
from input layer to hidden layer ; Forecasting economic turning points
backward pass continued Bond rating and trading
6. Update the weights in the network Loan approvals
* end Economic and financial forecasting
3. until all examples classified correctly or Marketing
stopping criterion satisfied Classification of consumer spending pattern
4. return(network) New product analysis
Identification of customer characteristics
Sale forecasts
4. REVIEW OF LITERATURE REPORTING Human resources
NEURAL NETWORK PERFORMANCE: Predicting employee’s performance and
There are numerous examples of commercial behavior
applications for neural networks. These include; Determining personnel resource requirements
41
Journal of Theoretical and Applied Information Technology
www.jatit.org
Artificial Neural Networks offer qualitative researchers use them, in particular those with
methods for business and economic systems that statistical backgrounds. Thus, neural networks are
traditional quantitative tools in statistics and becoming very popular with data mining
econometrics cannot quantify due to the complexity practitioners, particularly in medical research,
in translating the systems into precise mathematical finance and marketing. This is because they have
functions. Hence, the use of neural networks indata proven their predictive power through comparison
mining is a promising field of research especially with other statistical techniques using real data sets.
given the ready availability of large mass of data Due to design problems neural systems need further
sets and the reported ability of neural networks to research before they are widely accepted in
detect and assimilate relationships between a large industry. As software companies develop more
numbers of variables. sophisticated models with user-friendly interfaces
In most cases neural networks perform as well or the attraction to neural networks will continue to
better than the traditional statistical techniques to grow.
which they are compared. Resistance to using these
“black boxes” is gradually diminishing as more
REFERENCES
42