Data Mining & Warehousing Lab(B.
Tech 6th Sem)
Assignment:02
CSN16102 AY 24-25 Even sem
Objective: 01
Create a Weather Table with the help of Data Mining Tool WEKA EXPLORER in .arff
format.
Descriptions:
Create a Weather table with training data set which includes attributes like: outlook
{sunny, rainy, overcast}, temperature numeric, humidity numeric, windy {true, false},
play {yes, no}.
Creation of Weather Table:
@relation weather
@attribute outlook {sunny, rainy, overcast} @attribute temperature numeric @attribute
humidity numeric
@attribute windy {TRUE, FALSE} @attribute play {yes, no}
@data
sunny,85,85,FALSE,no
sunny,80,90,TRUE,no
overcast,83,86,FALSE,yes
rainy,70,96,FALSE,yes
rainy,68,80,FALSE,yes
rainy,65,70,TRUE,no
overcast,64,65,TRUE,yes
sunny,72,95,FALSE,no
sunny,69,70,FALSE,yes
rainy,75,80,FALSE,yes
sunny,75,70,TRUE,yes
overcast,72,90,TRUE,yes
overcast,81,75,FALSE,yes
rainy,71,91,TRUE,no
Objective: 02
Apply Pre-Processing techniques to the training data set of Weather Table using WEKA
EXPLORER and KNOWLEDGEFLOW.
1) Add attribute climate {tropical, dry, mild, continental, polar}
2) Remove
3)Attribute selection
4) Fill missing values
5)Normalization
6) Discretization
Descriptions
Real world databases are highly influenced to noise, missing and inconsistency due to
their queue size so the data can be pre-processed to improve the quality of data and
missing results and it also improves the efficiency.
Lab practice:
Do practice on all other available preprocessing techniques on diabetes dataset and
prepare the analysis report? Write steps in your lab file.