MODULE 3
ASSOCIATION RULES AND CLASSIFICATION
CONCEPTS
JUBILIN JOSE
ASST. PROFESSOR
DEPT. OF COMPUTER SCIENCE AND APPLICATIONS
MAR ELIAS COLLEGE, KOTTAPPADY
SUPERVISED VS UNSUPERVISED
LEARNING
Supervised learning is also known as classification. It involves training data are accompanied by
labels indicating the class of the observations. New data is then classified based on training data
set.
Unsupervised learning is also known as clustering. In unsupervised learning, the class labels of
training data is unknown. A set of measurements or observations will be given with an aim of
establishing the existence of classes or clusters in the data.
WHAT IS CLASSIFICATION AND WHAT IS PREDICTION?
Classification predicts categorical class labels. It classifies data based on the training set and the
values in a classifying attribute and uses it in classifying new data.
Example: Bank loan categorization based on safe or risk.
Prediction models continuous valued functions. ie, predicts unknown values.
Example: Prediction of emi of a product purchased based on customer occupation and income.
ISSUES IN CLASSIFICATION AND
PREDICTION?
[Link] major issue is preparing the data for classification and prediction:
It includes data cleaning, Relevance Analysis, Data Transformation and Reduction.
2. Another issue is evaluating classification and prediction methods:
It includes Accuracy, Speed, Robustness, Scalability, Interpretability.