EXPERIMENT – 1.
Student Name: Jatin Sharma UID: 20BCS6697
Branch: CSE-AIML Section/Group:20AIML4-B
Semester: 5 Date of Performance: 3.09.22
Subject Name: Data mining & Warehousing Lab Subject Code: 20CSF-333
1. Aim of the practical:
Apply Pre-Processing techniques to the training data set of the Weather Table.
2. Task to be done:
• Apply add, remove and normalization on weather table
3. Program Code:
Code on Notepad:
Visualization:
Add:
Remove:
Normalization:
4. Observations/Discussions (For applied/experimental sciences/materials-based labs):
WEKA
Weka is a collection of machine learning algorithms for data mining tasks. It contains tools for data preparation,
classification, regression, clustering, association rules mining, and visualization.
Normalization
It is used to scale the data of an attribute so that it falls in a smaller range, such as -1.0 to 1.0. It is generally
useful for classification algorithms.
Normalize Procedure:
➢ In Filters, we have Supervised and Unsupervised data.
➢ Click on Unsupervised data and select the attribute Normalize.
➢ Select the attributes temperature, and humidity to Normalize.
➢ Click on Apply button and then Save.
➢ Click on the Edit button, it shows a new Weather Table with normalized values on Weka.
Add Procedure:
➢ In Filters, we have Supervised and Unsupervised data.
➢ Click on Unsupervised data and select the attribute Add.
➢ A new window is opened.
➢ In that, we enter attribute index, type, data format, and nominal label values for Climate.
Remove Procedure:
➢ In Filters, we have Supervised and Unsupervised data.
➢ Click on Unsupervised data and select the attribute Remove.
➢ Select the attributes windy, and play to Remove.
➢ Click the Remove button and then Save.
➢ Click on the Edit button, it shows a new Weather Table on Weka.
10. Result:
Successfully implemented the preprocessing techniques (add, remove and normalization) on the weather table
with the help of the data mining tool WEKA.
Learning outcomes (What I have learnt):
1. Learnt installation of WEKA
2. Learnt implementing the data mining techniques using the notepad and WEKA.
Evaluation Grid (To be created as per the SOP and Assessment guidelines by the faculty):
Sr. No. Parameters Marks Obtained Maximum Marks
1.
2.
3.